Současný stav a trendy automatické indexace dokumentů (Schwarz, Josef, 2003)
Christopher D. Manning, Hinrich Schütze: Foundations of Statistical Natural Language Processing, MIT Press (1999), ISBN 978-0-262-13360-9, p. xxxi
Büttcher (Google) et al. Information Retrieval: Implementing and Evaluating Search Engines. Cambridge, Massachusetts: MIT Press. 2010. ISBN 978-0-262-02651-2
Nordbotten. "Multimedia Information Retrieval Systems
Bates, M. (1995). Models of natural language understanding. Proceedings of the National Academy of Sciences of the United States of America, Vol. 92, No. 22 (Oct. 24, 1995), pp. 9977–9982.
Steven Bird, Ewan Klein, and Edward Loper (2009). Natural Language Processing with Python. O'Reilly Media. ISBN 978-0-596-51649-9.
David M. W. Powers and Christopher C. R. Turk (1989). Machine Learning of Natural Language. Springer-Verlag. ISBN 978-0-387-19557-5.
Vybranné odborné články
Chakrabarti (2003). Mining the Web. Morgan Kaufmann Publishers. ISBN 1-55860-754-4
Brin and Page (1998). The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems, 30(1-7):107–117.
Liu (2007), Web Data Mining: Exploring Hyperlinks, Contents and Usage Data. Springer,ISBN 3-540-37881-2
Zobel et al. (2006). "Inverted Files for Text Search Engines". ACM Computing Surveys (New York: Association for Computing Machinery) 38 (2): 6. doi:10.1145/1132956.1132959
Lashkari et al. (2009), A Boolean Model in Information Retrieval for Search Engines, doi:10.1109/ICIME.2009.101
Salton et al. (1983). "Extended Boolean information retrieval". Commun. ACM (ACM) 26 (11): 1022. doi:10.1145/182.358466
Salton et al. (1975), "A Vector Space Model for Automatic Indexing," Communications of the ACM, vol. 18, nr. 11, pages 613–620.
Luk et al. (2002). "A survey in indexing and searching XML documents". Journal of the American Society for Information Science and Technology 53 (6): 415–437. doi:10.1002/asi.10056
Prager et al. (2000). Question-answering by predictive annotation. In Proceedings, 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Athens, Greece.
Sebastiani (2002). Machine learning in automated text categorization. ACM Computing Surveys, 34(1):1–47.
Andrews and Fox (2007). Recent Developments in Document Clustering [1]
Carpineto et al (2009). A survey of Web clustering engines. ACM Computing Surveys (CSUR), Volume 41, Issue 3 (July 2009), Article No. 17, ISSN:0360-0300