Differences
This shows you the differences between two versions of the page.
projs:clans:docs:crawlingpeoplerelatedweibooriginalpost [2014/01/26 20:14] yangjunfeng0317 created |
projs:clans:docs:crawlingpeoplerelatedweibooriginalpost [2014/02/04 18:31] (current) yangjunfeng0317 |
||
---|---|---|---|
Line 5: | Line 5: | ||
^ Parameter ^ Necessity ^ Type ^ Description ^ | ^ Parameter ^ Necessity ^ Type ^ Description ^ | ||
| pid | required | int | person id | | | pid | required | int | person id | | ||
- | | person_name | required | string | person name need to search | | + | | person_name | required | string | person name need to crawl | |
===== Output ===== | ===== Output ===== | ||
- | None | + | ^ Parameters ^ Type ^ Description ^ |
+ | | status | string | show the crawler running status | | ||
===== Implementation ===== | ===== Implementation ===== | ||
- | - masterStart() | + | - masterStart(). Create multiple processes to begin crawling data. |
- | - wapLogIn() | + | - wapLogIn(). Log in sina Account. |
- | - weiBoWapSearch(person_name, pid) | + | - weiBoWapSearch(person_name, pid). Use person name and person id to search person related weibo |
+ | * extractTopic(person_name, person_id ). Extract weibo text and insert to database. | ||
===== Related Work ===== | ===== Related Work ===== |