Wd1 -

A sample GoLang course Exam Question by vincent youmans.

Skills:
- gRPC/PROTOCOL Buffers
- Make, Protoc, Buf
- Go routines
- Go Cobra CLI
- Understanding of adding gRPC client and server libraries

Assignment

The following test is to be implemented in Go and while you can take as much time as you need, it's not expected that you spend more than 4 hours on it.

The test consists of implementing a "Web Crawler as a gRPC service". The application consists of a command line client and a local service which runs the actual web crawling. The communication between client and server should be defined as a gRPC service (*). For each URL, the Web Crawler, creates a "site tree", which is a tree of links with the root of the tree being the root URL. The crawler should only follow links on the domain of the provided URL and not follow external links. Bonus points for making it as fast as possible.

The command line client should provide the following operations:

$ crawl -start www.example.com # signals the service to start crawling www.example.com

$ crawl -stop www.example.com # signals the service to stop crawling www.example.com

$ crawl -list # shows the current "site tree" for all crawled URLs.

Required GoLang Development Libraires

protoc https://github.com/golang/protobuf
BUF https://github.com/bufbuild/buf

VY Optional

Extra Points

Using BUF CLI to build gRPC library
might write the actual scan code as a RUST service
Add github action for cicd
A service called StartScan() go StartScan()

VY TODO

Add RUST gRPC Libraries
Add Dart gRPC libraries
Add GH Actions to deploy to
- AWS EC2
- Digital Ocean

VY Comment

v01

go startScan()
- background process that would continually scan a Map of Pending jobs and run the jobs.
- I begin the scan with a go StartScan() in server main, just before calling serve. I use a go process because both the serve() and StartScan() are blocking...
  But in doing this, I think I am messing up the WorkGroups, which the Scan function is using.

v02

from Add, I will add the rootURL to JOBS map then run StartScan.

I think this is a better Idea anyway... as there is no reason to scan if no jobs were added.

ScanURL - Blacklisting WEBSITE.

As I was developing the ScanFunction, I tended to blacklist my IP address, including my own website vyoumans.com It took me some time to realise this.... which consumed some time.

notes

go get -u golang.org/x/lint/golint

MAKE

make server will build to ./build/go/server.exe

make serverTest TODO: make server Testable TODO: add cicd github actions to for integration testing. TODO: GHA deploy server to some cloud TODO: create a DART gRPC client TODO: do a RUST server

TODO: do video

TODO: implement fetch test that will only return childURL of RootURL. Just forgot to do it. TODO: Figure out a cancel strategy because by the time the app is added, it's to late to cancel as the job is already finished.

perhaps some sort of interrupt

TODO: video of how things work

TODO: clean up the code a bit.

NOTE

Usine BUF to build gRPC library

Instead of using protoc to create libraries, I am using buf

buf lint

buf generate

make server

make client

example commands

to start server ./crawlserver
monitor server as a stream ./crawl monitor
add a url to crawl ./crawl add --rootURL="https://www.wix.com"

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.idea		.idea
build/go		build/go
client/crawl01		client/crawl01
crawler/v1		crawler/v1
gen/go/crawler/v1		gen/go/crawler/v1
rust/server		rust/server
scrap		scrap
server/v01		server/v01
Makefile		Makefile
README.md		README.md
buf.gen.yaml		buf.gen.yaml
buf.yaml		buf.yaml
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Wd1 -

Assignment

Required GoLang Development Libraires

VY Optional

VY TODO

VY Comment

v01

v02

ScanURL - Blacklisting WEBSITE.

notes

MAKE

NOTE

Usine BUF to build gRPC library

example commands

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Wd1 -

Assignment

Required GoLang Development Libraires

VY Optional

VY TODO

VY Comment

v01

v02

ScanURL - Blacklisting WEBSITE.

notes

MAKE

NOTE

Usine BUF to build gRPC library

example commands

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages