green-db table can not be joined with scraping table based on id

Currently it is not possible to relate information of the `scraping` table to its corresponding extracted product information in the `green-db` table via `id`. If we want to join the tables we currently have to use `timestamp`, `url` and `category`. 

We already use the `id`, to retrieve a specific row in the `scraping` table, but the `id` is not used any further when writing the extracted product information into the `green-db`, see:
https://github.com/calgo-lab/green-db/blob/90b631bf81b7408d496534bd75d142e7c563c84d/workers/workers/extract.py#L36-L39

The `green-db` table already has an `id` column, but this is autogenerated, see: https://github.com/calgo-lab/green-db/blob/90b631bf81b7408d496534bd75d142e7c563c84d/database/database/tables.py#L203

So, integrating this shouIdn't be a lot of work and would help whenever we want to use information from `scraping table` together with `green-db` table. For example using the HTML together with the extracted product information for some ML.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

green-db table can not be joined with scraping table based on id #78

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	scraped_page = CONNECTION_FOR_TABLE[table_name].get_scraped_page(id=row_id)

	if product := extract_product(table_name=table_name, scraped_page=scraped_page):
	green_db_connection.write(product)

green-db table can not be joined with scraping table based on id #78

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions