Text mining

From Citizendium
Revision as of 14:06, 22 January 2008 by imported>Robert Badgett (New page: '''Text mining''' "involves analysing a large collection of documents to discover previously unknown information".<ref name="titleText Mining briefing paper : JISC">{{cite web |url=http:...)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Text mining "involves analysing a large collection of documents to discover previously unknown information".[1]

Coping with the many ways that a concept may be expressed in text (Zipf's law) remains a barrier in achieving results with text mining that are as good a human-curated results.[2]

References

  1. Text Mining briefing paper : JISC. Retrieved on 2008-01-22.
  2. {{cite journal |author=Rebholz-Schuhmann D, Kirsch H, Couto F |title=Facts from text--is text mining ready to deliver? |journal=PLoS Biol. |volume=3 |issue=2 |pages=e65 |year=2005 |pmid=15719064 |doi=10.1371/journal.pbio.0030065 |issn=}PubMed Central

External links

National Institute of Standards and Technology's (NIST) Information Technology Laboratory (ITL): Introduction to Information Extraction