Yanlong Wang

Results 41 comments of Yanlong Wang
trafficstars

It is difficult to deploy it locally for now. This repo is only several days old. However, most of the code was already written before the CEO decided to set...

We are using the [@mozilla/readability](https://github.com/mozilla/readability) for content transformation instead of anything visual-related. It seems that the text in the `div` block was ignored because it lacks an explicit semantic meaning...

Oh indeed. Sorry. However it is true that `readability` ignored the headings. I can share the direct output of `readability` here: ```html In this quick start guide, we will build...

This was some other bug unrelated to Chinese characters. Was fixed in cbc13ec

The default return timing didn't work. You may manually specify your point of interest. Try with our new `x-target-selector` header: ```bash curl https://r.jina.ai/https://www.unibo.it/it/ateneo/organizzazione-e-sedi/servizi-di-ateneo/servizi-online/servizi-online-per-studenti/guida-servizi-online-studenti/liste-di-distribuzione-docenti-studenti -H 'x-target-selector: #content' ```

Try this: curl https://r.jina.ai/https://www.neu.edu.cn/xygk/lrld.htm -H 'x-respond-with: markdown'

It's not about iframe. It's the return timing. Our default return timing didn't work on this page. To properly crawl this kind of webpage, you need to know about its...

Replicate now has a runnable version: https://replicate.com/joehoover/instructblip-vicuna13b

这个格式readibility没有识别好, ```bash curl https://r.jina.ai/https://metatrend.ai/tweet-trends/v2\?date\=2024-05-18\&name\=AI\#section-3 -H 'x-respond-with: markdown' ``` 这样试试

这个服务器是美国的而且是谷歌云上面的所以它访问国内站点的时候有可能就会慢或者有点别的问题...