Releases: birdflyi/db_engines_ranking_table_crawling
db_engines_ranking_table_crawling-v7.4.202312.1
Update Data: Update manulabeled data for 202312.
db_engines_ranking_table_crawling-v7.4.202310
Update Data: Update manulabeled data for 202310;
Fix bug: Fix the missing delimiter ',' bug in multi_model_extract_span.text;
Add Func Option: Add an option parameter drop_unused_default_values into function auto_gen_dbms_model_type_dict_by_keys.
db_engines_ranking_table_crawling-v7.3.202306.01
Update Data: Update manulabeled data for 202306.
db_engines_ranking_table_crawling-v7.3.202306
Update Data and update_conf: Update data for 202306; update update_conf settings in main.py.
db_engines_ranking_table_crawling-v7.2.202305
Refactoring code: refactor the recursive exit logic; Resolve the bug where use_cols cannot take effect.
db_engines_ranking_table_crawling-v7.1.202305
Updata Data and log info: Update data for 202305; use "order_id_start_end" log info instead of "idx_start_end" in func crawling_dbms_infos_soup.
db_engines_ranking_table_crawling-v7.0.202304
Rename Column: Rename column name "has_open_source_github_repo" as "has_github_repo" to avoid disputes.
db_engines_ranking_table_crawling-v6.0
Use new data format: delete the redundant column 'id' for 'xxx_automerged.csv' and 'xxx_automerged_manulabeled.csv'.
Update: Update and manulabel the data for 202304; Add function merge_info_start_checkpoint_last_month_manulabeled to reuse manulabeled message from last month.
db_engines_ranking_table_crawling-v5.3
Update Data: Update manulabeled data in ranking_crawling_202303_automerged_manulabeled.csv
db_engines_ranking_table_crawling-v5.2
Change Encoding: Change the data files encoding to "utf-8", involving 4 csv files.