aboutsummaryrefslogtreecommitdiff
path: root/src/crawler
Commit message (Collapse)AuthorAge
* The project is now using Poetry.Cédric Bonhomme2020-02-26
|
* Improved the search of existing articles.Cédric Bonhomme2019-05-24
|
* Do not raise connection error from jarr_getCédric Bonhomme2019-05-23
|
* Updated revision date of the crawler.Cédric Bonhomme2019-05-23
|
* Improved default crawler.Cédric Bonhomme2019-05-23
|
* Rmoved list of created articles.Cédric Bonhomme2019-05-23
|
* Feeds are now retrieved with requests. feedparser is only used for the parsing.Cédric Bonhomme2019-05-22
|
* Do not update existing articles.Cédric Bonhomme2019-05-21
|
* Improved crawler.Cédric Bonhomme2019-05-21
|
* Removed debug printCédric Bonhomme2019-05-19
|
* fixed module name of the crawler in bootstrap.pyCédric Bonhomme2019-05-19
|
* Temporary fix for the logging module. Closes #44Cédric Bonhomme2019-05-12
|
* Updated some linksCédric Bonhomme2018-10-15
|
* Updated copyrightsCédric Bonhomme2018-07-01
|
* Correct spelling mistakes.Edward Betts2018-03-31
|
* Ignore duplicate.Cédric Bonhomme2018-03-10
|
* Renamed crawler name.Cédric Bonhomme2016-11-22
|
* cleaning the mess in the libs directoriesCédric Bonhomme2016-11-17
|
* It is now possible to configure the refresh interval of feeds (in minutes).Cédric Bonhomme2016-11-10
|
* various improvements to the crawler (better use of coroutines, test if an ↵Cédric Bonhomme2016-11-08
| | | | article should be updated). tags are now retrieved for the k-means clustering (previously achived with the content of articles)
* Some improvements for the manager and the asyncio crawler.Cédric Bonhomme2016-11-03
|
* Fixed an error in the logging after the insertion of an articleCédric Bonhomme2016-11-03
|
* Starting to improve the loging for Heroku (with logging.StreamHandler).Cédric Bonhomme2016-11-03
|
* useless returnCédric Bonhomme2016-11-02
|
* set result to None by defaultCédric Bonhomme2016-11-02
|
* log thisCédric Bonhomme2016-10-31
|
* debugCédric Bonhomme2016-10-31
|
* strange problem hereCédric Bonhomme2016-10-31
|
* updated -> updated_dateCédric Bonhomme2016-10-31
|
* Uses a strem handler for the logging on Heroku.Cédric Bonhomme2016-10-29
|
* a messCédric Bonhomme2016-10-28
|
* testCédric Bonhomme2016-10-28
|
* new name: the last one.Cédric Bonhomme2016-10-10
|
* testCédric Bonhomme2016-09-28
|
* testCédric Bonhomme2016-09-28
|
* fixCédric Bonhomme2016-09-28
|
* more testsCédric Bonhomme2016-09-28
|
* testCédric Bonhomme2016-09-28
|
* testCédric Bonhomme2016-09-28
|
* Updated installation script.Cédric Bonhomme2016-09-27
|
* Fixed a bug with SQLite support.Cédric Bonhomme2016-09-27
|
* Updated JARR GitHub address. Thanks GitHub!Cédric Bonhomme2016-04-18
|
* Fixed a bug when existing_article.updated_date is None.Cédric Bonhomme2016-02-14
|
* Changed some strane URLs, maybe due to the renaming of pyAggr3g470r to JARR...Cédric Bonhomme2016-02-14
|
* The date of modification of articles is now used to detect changes.Cédric Bonhomme2016-02-13
|
* The classic crawler is now able to update an article. Only content or title ↵Cédric Bonhomme2016-02-09
| | | | are updatable, as discussed in #27. Need to implement the notification system with the boolean when the content of an article has been updated.
* Updated Copyright years.Cédric Bonhomme2016-02-05
|
* writing a bit of doc, moving crawler togetherFrançois Schmidts2016-02-02
bgstack15