PulsarRPA icon indicating copy to clipboard operation
PulsarRPA copied to clipboard

Automate webpages at scale, scrape web data completely and accurately with high performance, distributed RPA.

Results 47 PulsarRPA issues
Sort by recently updated
recently updated
newest added

Some websites use selectors what do not match the standard. For example, `` the charactor "+" is not allowed in a class name so Jsoup throws a SelectorParseException, and pulsar-dom...

Mongodb is already closed before MiscMessageWriter.close in which WebDb.flush is called. This happens when embeded mongodb is started in Exotic. A possible solution is to remove the WebDb dependency by...

Add a new method canConnect or equivent to Pulsar Driver.

2022-05-22 16:49:04.067 WARN [r-worker-5] a.p.p.p.b.e.BrowserEmulator - Unexpected exception java.lang.StringIndexOutOfBoundsException: String index out of range: 15 at java.base/java.lang.StringLatin1.charAt(StringLatin1.java:48) at java.base/java.lang.String.charAt(String.java:711) at ai.platon.pulsar.common.HtmlUtils.isBlankBody(Htmls.kt:107) at ai.platon.pulsar.protocol.browser.emulator.EmulateEventHandler.checkHtmlIntegrity(EmulateEventHandler.kt:137) at ai.platon.pulsar.protocol.browser.emulator.EmulateEventHandler.onAfterNavigate(EmulateEventHandler.kt:89) at ai.platon.pulsar.protocol.browser.emulator.BrowserEmulator.browseWithMinorExceptionsHandled(BrowserEmulator.kt:180) at ai.platon.pulsar.protocol.browser.emulator.BrowserEmulator.access$browseWithMinorExceptionsHandled(BrowserEmulator.kt:34) at...

When run exotic-standalone on WSL, we see: 2022-05-20 15:51:31.107 INFO [r-worker-4] a.p.p.p.b.d.c.ChromeDevtoolsDriver - TypeError: document.body.HMNvqKforEach is not a function at Function.HMNvqKutils__.updatePulsarStat (:300:23) at Function.HMNvqKutils__.isActuallyReady (:237:19) at Function.HMNvqKutils__.checkPulsarStatus (:178:31) at Function.HMNvqKutils__.waitForReady...

Pulsar have to know the chrome home to play with it, a user specified path is required. By default, pulsar searches the following paths for google chrome: val CHROME_BINARY_SEARCH_PATHS =...

A typical error message: ``` This page isn’t working wrd.walmart.com didn’t send any data. ERR_EMPTY_RESPONSE ``` It seems that the proxy ip has been blocked by the website.

![image](https://github.com/platonai/PulsarRPA/assets/40849111/7857607f-3f04-4e25-9318-4cc155c6a2c9) 其他分支也看了 也没有这个包及文件

The manual login method is: 1. [Optional] Delete ~/.pulsar/browser 2. Run OpenPrototypeChrome.kt 3. Manually visit the target website and browse several more web pages to create a browsing context. 4....

wontfix

url:**https://www.ebay.com/itm/155607414510** ![image](https://github.com/platonai/PulsarRPA/assets/35439240/a6d19b47-6473-4766-9f3d-6f89af45ceba)