Text mining: Difference between revisions

From Citizendium
Jump to navigation Jump to search
imported>Robert Badgett
No edit summary
imported>Robert Badgett
(→‎External links: added a link)
Line 9: Line 9:
* National Institute of Standards and Technology's (NIST) Information Technology Laboratory (ITL): [http://www.itl.nist.gov/iaui/894.02/related_projects/muc/ Introduction to Information Extraction]
* National Institute of Standards and Technology's (NIST) Information Technology Laboratory (ITL): [http://www.itl.nist.gov/iaui/894.02/related_projects/muc/ Introduction to Information Extraction]
[[Category:CZ Live]] [[Category:Library and Information Science Workgroup]]
[[Category:CZ Live]] [[Category:Library and Information Science Workgroup]]
* [http://textanalytics.wikidot.com/ Text Analytics Wiki]

Revision as of 14:08, 22 January 2008

Text mining "involves analysing a large collection of documents to discover previously unknown information".[1]

Coping with the many ways that a concept may be expressed in text (Zipf's law) remains a barrier in achieving results with text mining that are as good a human-curated results.[2]

References

  1. Text Mining briefing paper : JISC. Retrieved on 2008-01-22.
  2. Rebholz-Schuhmann D, Kirsch H, Couto F (2005). "Facts from text--is text mining ready to deliver?". PLoS Biol. 3 (2): e65. DOI:10.1371/journal.pbio.0030065. PMID 15719064. Research Blogging. PubMed Central

External links