Skip to content

Build a real time distributed search engine with ElasticSearch, PageRank Algorithm, TF-IDF, NLP(LDA, SVD), etc.

Notifications You must be signed in to change notification settings

ruy1su/Web-Search-Engine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Web-Search-Engine

Part1

Create a web crawler in Java to crawl the wikipages represented in Json. Use Elasticsearch to index the Wikipedia dataset and to enable keyword searches on the dataset. I will also apply various text transformations to the dataset and observe the effect of the transformations in search results.

Part2

TBA..

About

Build a real time distributed search engine with ElasticSearch, PageRank Algorithm, TF-IDF, NLP(LDA, SVD), etc.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published