Differences

This shows you the differences between two versions of the page.

projs:qcat:home [2011/04/07 20:21]
xfyu
projs:qcat:home [2011/04/08 17:16] (current)
xfyu
Line 8: Line 8:
    * Use [[http://snowball.tartarus.org/algorithms/english/stop.txt|Stop-word list]]     * Use [[http://snowball.tartarus.org/algorithms/english/stop.txt|Stop-word list]]
  * Misspelled words   * Misspelled words
 +    * [[http://aspell.net/|GNU Aspell]]
  * Location-based queries   * Location-based queries
 +    * NER for location detection
  * Part-of-speech (POS) tagging   * Part-of-speech (POS) tagging
    * [[http://nlp.stanford.edu/software/tagger.shtml|Stanford POS tagger]]     * [[http://nlp.stanford.edu/software/tagger.shtml|Stanford POS tagger]]
Line 25: Line 27:
===== Useful tools ===== ===== Useful tools =====
  *[[http://tartarus.org/~martin/PorterStemmer/index-old.html|The Porter Stemming Algorithm]]   *[[http://tartarus.org/~martin/PorterStemmer/index-old.html|The Porter Stemming Algorithm]]
 +  *[[http://aspell.net/|GNU Aspell]]
  *[[http://htmlparser.sourceforge.net/|Web page structure analysis]]   *[[http://htmlparser.sourceforge.net/|Web page structure analysis]]
  *[[http://www.nzdl.org/Kea/|KEA for key word extraction]]   *[[http://www.nzdl.org/Kea/|KEA for key word extraction]]
Line 54: Line 57:
===== Overall Workflow ===== ===== Overall Workflow =====
 +{{:projs:qcat:figure-updated1.pdf|Workflow for query categorization}}
 
projs/qcat/home.1302178861.txt.gz · Last modified: 2011/04/07 20:21 by xfyu     Back to top