Repository for Data Science and Big Data Analytics course. NRNU "MEPhI", Institute of Cyber Intelligence Systems.
Task: gathering count of browser users from httpd logs.
Tech: Hadoop.
Task: aggregating abstract metrics by timecode into time intervals.
Tech: Apache Spark + Apache Ignite + Apache Kafka.
Task: creating & setting up ELK docker image for gathering and aggregating data from public streaming API. More specified task - set up ELK stack for observing and aggregating some metrics of data from Wikipedia RecentChanges streaming API^ geolocation and edited articles by anonymous users.
Tech: ELK, docker.