Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
57 commits
Select commit Hold shift + click to select a range
025417f
landing-page
en-GB Dec 12, 2022
e80c70b
utils
en-GB Dec 12, 2022
ac69af3
use minify-html
en-GB Dec 12, 2022
ffe5c30
plotly
en-GB Dec 19, 2022
e5c7a45
cleanup
en-GB Dec 19, 2022
147f49e
lints
en-GB Dec 19, 2022
919da15
rename output to index.html
en-GB Dec 19, 2022
d5e9d68
excerpt and plots
en-GB Dec 23, 2022
f9fd719
cleanup
en-GB Dec 23, 2022
bf51acc
excerpt font-weight
en-GB Dec 23, 2022
4dcf3e3
brand leaderboard
en-GB Dec 23, 2022
6d34e88
run from production database
en-GB Dec 23, 2022
a5398ab
re add scraper diagram
en-GB Jan 3, 2023
0419131
adjustments
en-GB Jan 3, 2023
bae0a2f
minor things
en-GB Jan 5, 2023
2395115
excerpt load-more button, table scroll bars
en-GB Jan 13, 2023
ca1aa4a
forgot
en-GB Jan 13, 2023
cde9563
removals
en-GB Jan 16, 2023
ae379a1
removals and darkmode
en-GB Jan 18, 2023
63e4316
lints
en-GB Jan 18, 2023
86ee9b7
Create CNAME
se-jaeger Jan 19, 2023
77890af
Delete CNAME
se-jaeger Jan 19, 2023
000c274
stats
en-GB Jan 19, 2023
7a7bec7
Merge branch 'landing-page' of https://github.com/calgo-lab/green-db …
en-GB Jan 19, 2023
9a31acc
cleanup
en-GB Jan 19, 2023
4e133c3
quick fixes
en-GB Jan 26, 2023
e938549
plot on top
en-GB Jan 26, 2023
52d964f
phrasing
en-GB Jan 27, 2023
d0dc679
category typo hack
en-GB Jan 27, 2023
a7bdc14
move index.html to /
en-GB Jan 27, 2023
5832470
filter sample products by similarity
en-GB Jan 27, 2023
fca2d66
run from production db
en-GB Jan 27, 2023
8f866fd
fix stats in chrome
en-GB Feb 1, 2023
ab61e3d
rename readme
en-GB Feb 8, 2023
e08d29f
label table popup
en-GB Feb 8, 2023
9bb6d3e
fix empty label descriptions
en-GB Feb 8, 2023
d74e607
reset overflow on body
en-GB Feb 8, 2023
92edafe
lints
en-GB Feb 8, 2023
ba3b949
use english titles and descriptions if possible
en-GB Feb 8, 2023
33849b6
schema.org link
en-GB Feb 8, 2023
82f9376
reflow label table
en-GB Feb 8, 2023
54fa8ea
fix overlay margins
en-GB Feb 8, 2023
566d797
label table friendliness
en-GB Feb 14, 2023
ae48f04
db excerpt fix
en-GB Feb 16, 2023
0ba7e68
some cleanup
en-GB Feb 17, 2023
09ad050
label button clickability
en-GB Feb 28, 2023
001906f
uniform hover caver
en-GB Feb 28, 2023
57e5606
Add Search demo and new publication.
BigDatalex Mar 20, 2023
37d0cc3
new links
en-GB Mar 21, 2023
0f41312
merge updated dependencies
en-GB Mar 22, 2023
fe1fa83
instructions for rebuilding
en-GB Mar 22, 2023
436b6f5
more instructions
en-GB Mar 22, 2023
8c873f6
black
en-GB Mar 22, 2023
d15c0ff
Update landing page.
BigDatalex Jun 14, 2023
95cf15f
Update stats.
BigDatalex Nov 27, 2023
8764c62
Add monitoring link.
BigDatalex Nov 27, 2023
435dba1
Add monitoring to index.html.
BigDatalex Nov 27, 2023
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 3 additions & 3 deletions core/core/sustainability_labels/bootstrap_database.py
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,7 @@
def _get_localized_certificate_attribute(
certificate_information: Dict[str, Dict[str, Any]],
attribute: str,
language_order: List[str] = ["de", "en", "fr"],
language_order: List[str] = ["en", "de", "fr"],
) -> str:
"""
Helper function to retrieve an `attribute` from `certificate_information`
Expand All @@ -27,7 +27,7 @@ def _get_localized_certificate_attribute(
str: of the corresponding attribute in one language.
"""
for language in language_order:
if language in certificate_information["languages"].keys():
if language in certificate_information["languages"]:
return certificate_information["languages"][language][attribute]
return ""

Expand Down Expand Up @@ -58,5 +58,5 @@ def _get_localized_certificate_attribute(
),
social_conflict_minerals=certificate_information.get("social_conflict_minerals", None),
)
for certificate_id, certificate_information in load_and_get_sustainability_labels().items()
for certificate_id, certificate_information in certificates.items()
]
60 changes: 60 additions & 0 deletions index.html

Large diffs are not rendered by default.

1 change: 1 addition & 0 deletions landing-page/.gitignore
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
poetry.lock
4 changes: 4 additions & 0 deletions landing-page/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,4 @@
to rebuild the landing page run `python landing-page.py --no-cache` and push index.html

make sure to portforward the postgres pod if you pass --no-cache.
if you run `python landing-page.py` (without --no-cache), it will use old data and it will be alot faster. use it for quickly testing things.
1 change: 1 addition & 0 deletions landing-page/landing_page/__init__.py
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
__version__ = "0.1.0"
13,064 changes: 13,064 additions & 0 deletions landing-page/landing_page/cache.py

Large diffs are not rendered by default.

70 changes: 70 additions & 0 deletions landing-page/landing_page/landing-page.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,70 @@

:: section#header div.container div.row

# GreenDB: Sustainability Data for Consumer Products

:: section div.container

::: div.row
:::: column.weight=5

GreenDB is a publicly available database of sustainable products, scraped from European online shops on a weekly basis. As proxy for the products’ sustainability, it relies on sustainability labels, which are evaluated by experts. The GreenDB schema extends the well-known [Schema.org](https://www.schema.org){.link} Product definition and is compatible with standardized fine grained product taxonomies such as [GS1](https://gpc-browser.gs1.org){.link}.

:::: column.weight=3 div.stats content.name=stats

::: div.row div.ff

[Data](https://doi.org/10.5281/zenodo.6078038){.btn}[Publications](#publications){.btn}[Demos](#demos){.btn}[Github](https://github.com/calgo-lab/green-db/){.btn}

:: section div.container div.row content.name=plot_category_cred

:: section div.container div.row
## Sample
::: div.ff content.name=excerpt

:: section div.container
## Schema

### Products

::: div.row div.ff div.table-wrapper
| **column name** | timestamp | url | source | merchant | country | category | name | description | brand | sustainability_labels | price | currency | image_urls | gender | consumer_lifestage | colors | sizes | gtin | asin
| -: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-:
| **data type** | timestamp | text | text | text | text | text | text | text | text | array[text] | numeric | text | array[text] | text | text | array[text] | array[text] | int | text
| **nullable** | no | no | no | no | no | no | no | no | no | no | no | no | no | yes | yes | yes | yes | yes | yes
:::

### Sustainability labels

::: div.row div.ff div.table-wrapper
| **column name** | id | timestamp | name | description | cred_credibility | eco_chemicals | eco_lifetime | eco_water | eco_inputs | eco_quality | eco_energy | eco_waste_air | eco_environmental_management | social_labour_rights | social_business_practice | social_social_rights | social_company_responsibility | social_conflict_minerals
| -: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-: | :-:
| **data type** | text | timestamp | text | text | int4 | int4 | int4 | int4 | int4 | int4 | int4 | int4 | int4 | int4 | int4 | int4 | int4 | int4
| **nullable** | no | no | no | no | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes | yes

:: section#demos div.container

## Demos

::: div.row div.ff
* [Product search demo](https://search.demo.calgo-lab.de/){.link}
* [Automated extraction of product information](https://product-classification.demo.calgo-lab.de/){.link}
* [Monitoring](https://monitoring.demo.calgo-lab.de/){.link}

:: section#publications div.container

## Publications

::: div.row div.ff

* [Flick, A., Jäger, S., Trajanovska, I., Biessmann, F. (2023). Automated Extraction of Fine-Grained Standardized Product Information from Unstructured Multilingual Web Data. In: , et al. Advances in Information Retrieval. ECIR 2023. Lecture Notes in Computer Science, vol 13982. Springer, Cham.](https://doi.org/10.1007/978-3-031-28241-6_19){.link}
* [Jäger, S., Greene, J., Jakob, M., Korenke, R., Santarius, T., and Bießmann, F. (2022). GreenDB: Toward a Product-by-Product Sustainability Database. ArXiv, abs/2205.02908.](https://arxiv.org/abs/2205.02908){.link}
* [Jäger, S., Flick, A., Sanchez Garcia, J. A., von den Driesch, K., Brendel, K., and Bießmann, F. (2022). GreenDB: A Dataset and Benchmark for Extraction of Sustainability Information of Consumer Goods., ArXiv, abs/2207.10733.](https://arxiv.org/abs/2207.10733){.link}

:: footer div.container div.row

.

:: div.overlay#label_overlay div.container div.row
::: column.weight=5 content.name=label_name content.name=label_description
::: column.weight=3 content.name=label_data
Loading