Differences
This shows you the differences between two versions of the page.
|
projs:clans:docs:crawlingpeoplerelatedweibooriginalpost [2014/01/26 20:14] yangjunfeng0317 created |
projs:clans:docs:crawlingpeoplerelatedweibooriginalpost [2014/02/04 18:31] (current) yangjunfeng0317 |
||
|---|---|---|---|
| Line 5: | Line 5: | ||
| ^ Parameter ^ Necessity ^ Type ^ Description ^ | ^ Parameter ^ Necessity ^ Type ^ Description ^ | ||
| | pid | required | int | person id | | | pid | required | int | person id | | ||
| - | | person_name | required | string | person name need to search | | + | | person_name | required | string | person name need to crawl | |
| ===== Output ===== | ===== Output ===== | ||
| - | None | + | ^ Parameters ^ Type ^ Description ^ |
| + | | status | string | show the crawler running status | | ||
| ===== Implementation ===== | ===== Implementation ===== | ||
| - | - masterStart() | + | - masterStart(). Create multiple processes to begin crawling data. |
| - | - wapLogIn() | + | - wapLogIn(). Log in sina Account. |
| - | - weiBoWapSearch(person_name, pid) | + | - weiBoWapSearch(person_name, pid). Use person name and person id to search person related weibo |
| + | * extractTopic(person_name, person_id ). Extract weibo text and insert to database. | ||
| ===== Related Work ===== | ===== Related Work ===== | ||