Crawlování na Webu
Thesis title in Czech: | Crawlování na Webu |
---|---|
Thesis title in English: | Crawling on the World Wide Web |
Academic year of topic announcement: | 2005/2006 |
Thesis type: | diploma thesis |
Thesis language: | čeština |
Department: | Department of Software Engineering (32-KSI) |
Supervisor: | RNDr. Leo Galamboš, Ph.D. |
Author: | hidden - assigned and confirmed by the Study Dept. |
Date of registration: | 14.11.2005 |
Date of assignment: | 14.11.2005 |
Date and time of defence: | 18.09.2007 00:00 |
Date of electronic submission: | 18.09.2007 |
Date of proceeded defence: | 18.09.2007 |
Opponents: | RNDr. Zuzana Vlčková |
Guidelines |
Cílem práce je porovnat stávající plánovací strategie ve webových robotech, navrhnout jejich případné vylepšení, a implementovat odpovídající monitor (plánovací strategii) ve webovém robotu. |
References |
M. Ehrig and A. Maedche. Ontology-focused crawling of Web documents. In Proc. of the 2003 ACM symposium on Applied computing, Melbourne, Florida, 2003.
C. C. Aggarwal, F. Al-Garawi, and P. Yu. Intelligent crawling on the world wide web with arbitrary predicates. In WWW-10, Hong Kong, 2001. Cho, J.; Garcia-Molina, H.; and Page, L. 1998. Efficient crawling through URL ordering. In WWW7. Jason Rennie and Andrew McCallum. Using reinforcement learning to spider the Web efficiently. In ICML-99, 1999. |