Andri Mirzal Build Your Own Search Engine Python Programming LAP LAMBERT Academic Series Publishing Contents 1 Introduction 2 Preparing Your 2.1 Installing Python 4 2.2 Installing SciPy and NumPy 7 2.3 Installing Beautiful Soup 2.4 Installing PySQLite 10 2.5 Installing PyLab 10 3 The Algorithms 12 3.1 PageRank 12 3.2 HITS 15 3.3 Okapi BM25 18 3.4 Anchor Model 19 3.5 Porter Stemmer 21 4 The Design of the Search 4.1 Introduction 27 4.2 System Overview 29 4.2.1 Improving reliability of the crawling process 29 4.2.2 Adding an optional 1 Environment 4 9 Engine 27 stemming function 32 functions 33 4.2.3 Adding 4.2.4 Organizing the 4.2.5 Adding 4.3 Database Design 5 Conclusion 37 References 38 Appendix 40 new scores scores functions additional indexes to linkwords table 34 34 35