Differences
This shows you the differences between two versions of the page.
|
projs:qcat:home [2011/04/07 20:21] xfyu |
projs:qcat:home [2011/04/08 17:16] (current) xfyu |
||
|---|---|---|---|
| Line 8: | Line 8: | ||
| * Use [[http://snowball.tartarus.org/algorithms/english/stop.txt|Stop-word list]] | * Use [[http://snowball.tartarus.org/algorithms/english/stop.txt|Stop-word list]] | ||
| * Misspelled words | * Misspelled words | ||
| + | * [[http://aspell.net/|GNU Aspell]] | ||
| * Location-based queries | * Location-based queries | ||
| + | * NER for location detection | ||
| * Part-of-speech (POS) tagging | * Part-of-speech (POS) tagging | ||
| * [[http://nlp.stanford.edu/software/tagger.shtml|Stanford POS tagger]] | * [[http://nlp.stanford.edu/software/tagger.shtml|Stanford POS tagger]] | ||
| Line 25: | Line 27: | ||
| ===== Useful tools ===== | ===== Useful tools ===== | ||
| *[[http://tartarus.org/~martin/PorterStemmer/index-old.html|The Porter Stemming Algorithm]] | *[[http://tartarus.org/~martin/PorterStemmer/index-old.html|The Porter Stemming Algorithm]] | ||
| + | *[[http://aspell.net/|GNU Aspell]] | ||
| *[[http://htmlparser.sourceforge.net/|Web page structure analysis]] | *[[http://htmlparser.sourceforge.net/|Web page structure analysis]] | ||
| *[[http://www.nzdl.org/Kea/|KEA for key word extraction]] | *[[http://www.nzdl.org/Kea/|KEA for key word extraction]] | ||
| Line 54: | Line 57: | ||
| ===== Overall Workflow ===== | ===== Overall Workflow ===== | ||
| + | {{:projs:qcat:figure-updated1.pdf|Workflow for query categorization}} | ||