Commit graph

14 commits

Author SHA1 Message Date
Geoff-Robin
4d5146c802 Added Documentation 2025-10-06 04:00:15 +05:30
Geoff-Robin
0f64f6804d Done adding cron job web scraping 2025-10-06 03:45:09 +05:30
Geoff-Robin
e5633bc368 corrected F402 error pointed out by ruff check 2025-10-06 03:44:24 +05:30
Geoff-Robin
f449fce0f1 Done with scraping_task successfully 2025-10-06 02:27:20 +05:30
Geoff-Robin
f148b1df89 Added support for multiple base_url extraction 2025-10-05 20:13:44 +05:30
Geoff-Robin
c2aa95521c removed structured argument 2025-10-05 20:00:19 +05:30
Geoff-Robin
2cba31a086 Tested and Debugged scraping usage in cognee.add() pipeline 2025-10-04 21:26:25 +05:30
Geoff-Robin
20fb77316c Done with integration with add workflow when incremental_loading is set to False 2025-10-04 15:01:13 +05:30
Geoff-Robin
1ab9d24cf0 Changed bs4_connector.py to bs4_crawler.py 2025-10-03 12:33:13 +05:30
Geoff-Robin
edd119ef97 first iteration of bs4_connector.py done 2025-10-02 22:04:50 +05:30
Geoff-Robin
c283977035 switched httpx AsyncClient to fetch webpage 2025-10-02 02:01:46 +05:30
Geoff-Robin
60499c439c Added logging 2025-10-02 01:54:56 +05:30
Geoff-Robin
925bd38195 Setup models.py and utils.py 2025-10-02 01:32:00 +05:30
Geoff-Robin
6348c9d8de Created models.py 2025-09-30 20:46:26 +05:30