Differences
This shows you the differences between two versions of the page.
projs:clans:docs:crawlingretweet [2014/01/26 20:29] yangjunfeng0317 created |
projs:clans:docs:crawlingretweet [2014/02/04 18:32] (current) yangjunfeng0317 |
||
---|---|---|---|
Line 8: | Line 8: | ||
===== Output ===== | ===== Output ===== | ||
- | None | + | ^ Parameters ^ Type ^ Description ^ |
+ | | status | string | show the crawler running status | | ||
===== Implementation ===== | ===== Implementation ===== | ||
- masterStart(). Create multiple processes to begin crawling data. | - masterStart(). Create multiple processes to begin crawling data. | ||
- wapLogIn(). Log in sina Account. | - wapLogIn(). Log in sina Account. | ||
- | - weiBoWapSearch(person_name, pid). Use person name and person id to search person related weibo | + | - weiBoWapSearch(searchStr, Sid). Use searchStr(person name or company name) and search id(person id or company id) to search related weibo |
* extractTopic(person_name or company_name, person_id or company_id). Extract weibo text and insert to database. | * extractTopic(person_name or company_name, person_id or company_id). Extract weibo text and insert to database. | ||
* getRetweet(retweet_url, weibo_id). Extract retweet of original weibo text and insert to database. | * getRetweet(retweet_url, weibo_id). Extract retweet of original weibo text and insert to database. |