想問問Web scraping 用邊問language 好?

簡單page content
基本做data retrieval store 入db (any brand is ok)

1st phase - csv format
2nd phase - web query UI for internal use (no authentication required)
3rd phase - provide API support for further integration (RESTful)

諗緊python + nodejs + nginx ,其他師兄有無其他意見?

重點做One off,做完交貨無手尾

本帖最後由 javacomhk 於 2022-7-3 22:49 編輯

Javascript 都得

https://www.hkepc.com/forum/redirect.php?goto=findpost&ptid=2662592&pid=41088700

你可以試下我呢個JSON API,每日早上6點自動更新,唔使自己起 server,用 javacript webscrape。

https://tinyurl.com/react9?request=getStock&stock=00001.HK&stock=00005.HK&stock=00700.HK

TOP

回覆 1# cyrus_ho

python(fastapi) + nginx(react)

TOP

回覆 1# cyrus_ho

我用 nodejs cheerio, 可能有更好選擇。

TOP

回覆 4# gamezz


    +1 on cheerio,

for complicated site use Puppeteer

TOP