Skip to content

Create dedup_geogs_more.ipynb#2420

Draft
sianteesdale wants to merge 1 commit intomainfrom
st/dedup-entities-analysis
Draft

Create dedup_geogs_more.ipynb#2420
sianteesdale wants to merge 1 commit intomainfrom
st/dedup-entities-analysis

Conversation

@sianteesdale
Copy link
Copy Markdown
Contributor

Data Template

Ticket

Additional information:

  • Putting my current deduplicating A4D, LBO, TPZ and Tree analysis into a draft PR in case anyone wants to check my work whilst I'm on training/AL.

Don't merge into main yet.

Example of 25 mismatches between transformed resources and dataset_resource:

dataset name organisation transformed_resources_count dataset_resource_count transformed_vs_dataset_resource_match
article-4-direction-area North Norfolk District Council local-authority:NNO 42.0 NaN mismatch
tree Tandridge District Council local-authority:TAN 1678.0 0.0 mismatch
tree Gloucester City Council local-authority:GLO 1358.0 0.0 mismatch
listed-building-outline Hart District Council local-authority:HAT 1954.0 0.0 mismatch
listed-building-outline Central Bedfordshire Council local-authority:CBF 1918.0 NaN mismatch
tree-preservation-zone St Albans City and District Council local-authority:SAL 544.0 NaN mismatch
article-4-direction-area Stoke-on-Trent City Council local-authority:STE 27.0 NaN mismatch
tree-preservation-zone Castle Point Borough Council local-authority:CAS 179.0 NaN mismatch
article-4-direction-area Cotswold District Council local-authority:COT 22.0 NaN mismatch
listed-building-outline North Tyneside Council local-authority:NTY 221.0 0.0 mismatch
tree East Riding of Yorkshire Council local-authority:ERY 3476.0 0.0 mismatch
tree-preservation-zone Great Yarmouth Borough Council local-authority:GRY 314.0 NaN mismatch
tree North Somerset Council local-authority:NSM 4127.0 0.0 mismatch
listed-building-outline London Borough of Brent local-authority:BEN 582.0 0.0 mismatch
article-4-direction-area Great Yarmouth Borough Council local-authority:GRY 18.0 NaN mismatch
tree-preservation-zone North Somerset Council local-authority:NSM 1484.0 NaN mismatch
tree South Gloucestershire Council local-authority:SGC 5412.0 0.0 mismatch
article-4-direction-area Huntingdonshire District Council local-authority:HUN 6.0 NaN mismatch
tree-preservation-zone Tandridge District Council local-authority:TAN 885.0 NaN mismatch
listed-building-outline London Borough of Lewisham local-authority:LEW 364.0 NaN mismatch
listed-building-outline Liverpool City Council local-authority:LIV 1538.0 0.0 mismatch
listed-building-outline Buckinghamshire Council local-authority:BUC NaN NaN mismatch
tree London Borough of Southwark local-authority:SWK 2503.0 0.0 mismatch
tree-preservation-zone South Cambridgeshire District Council local-authority:SCA 2210.0 NaN mismatch
tree London Borough of Waltham Forest local-authority:WFT 2043.0 0.0 mismatch

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant