Differences
This shows you the differences between two versions of the page.
projs:qcat:home [2011/04/07 20:21] xfyu |
projs:qcat:home [2011/04/08 17:16] (current) xfyu |
||
---|---|---|---|
Line 8: | Line 8: | ||
* Use [[http://snowball.tartarus.org/algorithms/english/stop.txt|Stop-word list]] | * Use [[http://snowball.tartarus.org/algorithms/english/stop.txt|Stop-word list]] | ||
* Misspelled words | * Misspelled words | ||
+ | * [[http://aspell.net/|GNU Aspell]] | ||
* Location-based queries | * Location-based queries | ||
+ | * NER for location detection | ||
* Part-of-speech (POS) tagging | * Part-of-speech (POS) tagging | ||
* [[http://nlp.stanford.edu/software/tagger.shtml|Stanford POS tagger]] | * [[http://nlp.stanford.edu/software/tagger.shtml|Stanford POS tagger]] | ||
Line 25: | Line 27: | ||
===== Useful tools ===== | ===== Useful tools ===== | ||
*[[http://tartarus.org/~martin/PorterStemmer/index-old.html|The Porter Stemming Algorithm]] | *[[http://tartarus.org/~martin/PorterStemmer/index-old.html|The Porter Stemming Algorithm]] | ||
+ | *[[http://aspell.net/|GNU Aspell]] | ||
*[[http://htmlparser.sourceforge.net/|Web page structure analysis]] | *[[http://htmlparser.sourceforge.net/|Web page structure analysis]] | ||
*[[http://www.nzdl.org/Kea/|KEA for key word extraction]] | *[[http://www.nzdl.org/Kea/|KEA for key word extraction]] | ||
Line 54: | Line 57: | ||
===== Overall Workflow ===== | ===== Overall Workflow ===== | ||
+ | {{:projs:qcat:figure-updated1.pdf|Workflow for query categorization}} |