@dkorchevgithub adding lines suggested by @EtoDemerzel0427 to fix "g… #13

dkorchevgithub · 2021-07-14T04:22:25Z

patch for the latest dlrm
updated Docker CPU as suggested in

updated for issues DLRM inference README out of date mlcommons/inference#917 and DLRM: save downloaded and generated files outside of Docker containers mlcommons/inference#604
Update README.md
updating readme
fixed typo noticed by @psyhtest
adding lines suggested by @EtoDemerzel0427 to fix "good" count

Co-authored-by: Anton Lokhmotov psyhtest@users.noreply.github.com
Co-authored-by: rameshchukka rnaidu02@yahoo.com

) * Update fid_score.py | Move the sys path update before the import. Currently the sys path is meaningless and this change was done by the automated code formatting * Update format.yml --------- Co-authored-by: arjunsuresh <arjunsuresh@users.noreply.github.com>

…1905)

* Update GNN reference implementation: add DGL backend * [Automated Commit] Format Codebase * Update README.md

… error in DLRMv2 implementation #1909, fixes preprocess_submission (#1910) * Update generate_final_report.py * Fix sdxl (#1911) * Fix typo in fid_score.py, fail_safe for SDXL short runs * [Automated Commit] Format Codebase * Fix typo in fid_score.py, fail_safe for SDXL short runs * Fix dlrmv2 reference implementation | Update run_local.sh * Fixes for filtering invalid results * [Automated Commit] Format Codebase * Update preprocess_submission.py * Added an option to pass in sample_ids.txt for SDXL accuracy check * [Automated Commit] Format Codebase * Update accuracy_coco.py * [Automated Commit] Format Codebase * Fix typo * Not use default for sample_ids.txt --------- Co-authored-by: arjunsuresh <arjunsuresh@users.noreply.github.com>

Updating the pip packages

* Update generate_final_report.py * Fix sdxl (#1911) * Fix typo in fid_score.py, fail_safe for SDXL short runs * [Automated Commit] Format Codebase * Fix typo in fid_score.py, fail_safe for SDXL short runs * Fix dlrmv2 reference implementation | Update run_local.sh * Fixes for filtering invalid results * [Automated Commit] Format Codebase * Update preprocess_submission.py * Added an option to pass in sample_ids.txt for SDXL accuracy check * [Automated Commit] Format Codebase * Update accuracy_coco.py * [Automated Commit] Format Codebase * Fix typo * Not use default for sample_ids.txt * Update requirements.txt (#1907) Updating the pip packages * Fix preprocess_sudbmission for a bug * Update submission_checker.py | Removed TEST05 --------- Co-authored-by: arjunsuresh <arjunsuresh@users.noreply.github.com>

Signed-off-by: Joyce Brum <joycebrum@google.com> Co-authored-by: Mitchelle Rasquinha <80070689+mrasquinha-g@users.noreply.github.com>

Co-authored-by: Mitchelle Rasquinha <80070689+mrasquinha-g@users.noreply.github.com>

* - Fixed syntax error on list .insert method * Update fid_score.py --------- Co-authored-by: Arjun Suresh <arjun@gateoverflow.com> Co-authored-by: Arjun Suresh <arjunsuresh1987@gmail.com>

Docs update

* 🔄 synced local 'tools/submission/power/power_checker.py' with remote 'compliance/check.py' * [Automated Commit] Format Codebase --------- Co-authored-by: mlcommons-bot <null> Co-authored-by: mlcommons-bot <mlcommons-bot@users.noreply.github.com> Co-authored-by: Mitchelle Rasquinha <80070689+mrasquinha-g@users.noreply.github.com> Co-authored-by: Arjun Suresh <arjunsuresh1987@gmail.com> Co-authored-by: Arjun Suresh <arjun@gateoverflow.com>

https://huggingface.co/docs/huggingface_hub/main/en/guides/cli Co-authored-by: Arjun Suresh <arjunsuresh1987@gmail.com> Co-authored-by: Miro <mirhodak@amd.com>

* Initial codebase llama3-405b reference implementation * Add VLLM backend * Prune llama2 files & Update commands * Update evaluate accuracy script * Fix minor issues * Add Llama3 configuration * [Automated Commit] Format Codebase * Set tensor_parallel_size to 8 * Add requirements.txt * Remove consolidate_results.py * Host dataset + add instructions to download it * [Automated Commit] Format Codebase * Fix dockerfile + update README --------- Co-authored-by: pgmpablo157321 <pgmpablo157321@users.noreply.github.com>

…1948) * Update generate_final_report.py * Fix sdxl (#1911) * Fix typo in fid_score.py, fail_safe for SDXL short runs * [Automated Commit] Format Codebase * Fix typo in fid_score.py, fail_safe for SDXL short runs * Fix dlrmv2 reference implementation | Update run_local.sh * Fixes for filtering invalid results * [Automated Commit] Format Codebase * Update preprocess_submission.py * Added an option to pass in sample_ids.txt for SDXL accuracy check * [Automated Commit] Format Codebase * Update accuracy_coco.py * [Automated Commit] Format Codebase * Fix typo * Not use default for sample_ids.txt * Update requirements.txt (#1907) Updating the pip packages * Fix preprocess_sudbmission for a bug * Update submission_checker.py | Removed TEST05 * Fix to SDXL accuracy output * Added exists checks for rmtree in preprocess_submission script * [Automated Commit] Format Codebase * Delete .github/workflows/format.yml * Delete .github/scripts directory * Update build_wheels.yml | Added src distribution * Update VERSION.txt * Update build_wheels.yml * Update VERSION.txt * Update pyproject.toml * Increment version to 4.1.26 * Update MANIFEST.in * Increment version to 4.1.27 * Update pyproject.toml * Increment version to 4.1.28 * Update build_wheels.yml * Update VERSION.txt * Update accuracy_coco.py * Making sdxl run thread safe * Create format.yml | Run format on push instead of PR --------- Co-authored-by: arjunsuresh <arjunsuresh@users.noreply.github.com>

* Remove GLT implementation for GNN * Update Readme: remove GLT and add checkpoint * Update mlperf.conf file * Dockerize rgat benchmark * Update README * Add gnn accuracy script * Remove mlperf.conf from GNN reference implementation * Add minimun system requirements to README * Fix paths in accuracy script * Fix typing in accuracy script

* Fix llama3 multi-gpu issue * Fix identation * Remove llama2 outdated references * Fix run_evaluation argument

* Submission checker v5.0 * [Automated Commit] Format Codebase --------- Co-authored-by: pgmpablo157321 <pgmpablo157321@users.noreply.github.com> Co-authored-by: Arjun Suresh <arjun@gateoverflow.com>

* Fix loadgen build for version numbers having "0" * Update test-resnet50.yml * Update test-retinanet.yml * Update test-bert.yml

Co-authored-by: Miro <mirhodak@amd.com>

* Fix submission checker for v5.0 rgat * Update submission_checker.py | Updates for v5.0 * [Automated Commit] Format Codebase * Update submission_checker.py | Fixes latency_constraints for v5.0 * [Automated Commit] Format Codebase --------- Co-authored-by: mlcommons-bot <mlcommons-bot@users.noreply.github.com>

* add numbers into labels add * remove euro, cent...etc --------- Co-authored-by: hanyunfan <frank.han@dell.com>

…2369)

…rt R2 (#2371)

* Updation for documentation page: migration to R2 * fix for llama2 assets download instructions

…#2367)

…patibility (#2344) * [Automated Commit] Format Codebase * Updated tags for submission checker command in docs * Update mobilenets docs * Update main.py * Update main.py * update dataset download commands - waymo calib (#2130) * Merge from Master (#2155) * Update submission_checker.py | Fix open model unit in Results (#2144) * Add Llama 3.1 to special unit dict (#2150) --------- Co-authored-by: Pablo Gonzalez <pablo.gonzalez@factored.ai> * [Automated Commit] Format Codebase * Inference docs - Update model and dataset download commands (#2153) * Update llama2 70b model download docs * changes in model and dataset download commands * add powershell command to get result folder structure (#2156) * [Automated Commit] Format Codebase * [Automated Commit] Format Codebase * [Automated Commit] Format Codebase * [Automated Commit] Format Codebase * [Automated Commit] Format Codebase * Fix Typo in Interactive Latencies (#2147) (#2225) * Fix Typo in Interactive Latencies * Update submission_checker.py * Fix Typo in Interactive Latencies (#2147) (#2226) * Fix Typo in Interactive Latencies * Update submission_checker.py --------- Co-authored-by: Miro <mirhodak@amd.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> * Update MLCFlow commands for v5.1 (#2237) * [Automated Commit] Format Codebase * [Automated Commit] Format Codebase * Update main.py * [Automated Commit] Format Codebase * updating for 5.1-dev (inference doc) * [Automated Commit] Format Codebase * fix typo * [Automated Commit] Format Codebase * Update main.py * [Automated Commit] Format Codebase * [Automated Commit] Format Codebase * Doc updates (#2292) * improve submission doc * Update index.md * Fix for model and dataset download commands * update submission doc * [Automated Commit] Format Codebase * Update index.md * r2_downloader -> r2-downloader * Update multithreading information about SDXL * [Automated Commit] Format Codebase * .lower() for consistency * [Automated Commit] Format Codebase * updation for llama3_1-8b edge * [Automated Commit] Format Codebase --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Arjun Suresh <arjun@gateoverflow.com> * Add quiet flags to MLC commands (#2309) * Improve docs - submission generation (#2311) * [Automated Commit] Format Codebase * Refactor run_verification scripts to improve OS compatibility * [Automated Commit] Format Codebase * Refactor accuracy verification to use file reading instead of shell commands for compatibility * [Automated Commit] Format Codebase * Update folder structure by removing Nvidia folder --------- Co-authored-by: github-actions <github-actions@github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Arjun Suresh <arjun@gateoverflow.com> Co-authored-by: ANANDHU S <71482562+anandhu-eng@users.noreply.github.com> Co-authored-by: Nathan Wasson <nathanw@mlcommons.org> Co-authored-by: Pablo Gonzalez <pablo.gonzalez@factored.ai> Co-authored-by: Miro <mirhodak@amd.com>

…entation (#2381) * Initial commit. * WIP * [Automated Commit] Format Codebase * misc * adding pydantic_typer * offline WIP * [Automated Commit] Format Codebase * [Automated Commit] Format Codebase * rename the notebook * clean-up * [Automated Commit] Format Codebase * Downgrade from 3.13 to 3.12 * [Automated Commit] Format Codebase * send the response back to LoadGen one at a time * Move the ownership of the AsyncOpenAI client into Task, and clean up the client, event loop and the event loop thread * [Automated Commit] Format Codebase * fixing typo * [Automated Commit] Format Codebase * allowing --settings.min_duration to take in float or int as seconds * fix lint * [Automated Commit] Format Codebase * Parametrize use_token_latencies * [Automated Commit] Format Codebase * fix typos * [Automated Commit] Format Codebase * update README --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

* initial server version * change description for parameter use_token_latencies * [Automated Commit] Format Codebase * changes based on PR comments

* Update submission checker * Add submission structure description in markdown file * Add description of placeholders in submission structure * Update preprocess submission * Add directory structure samples * Structure documentation fixes * Add power to submission structure documentation * Fixes for preprocessing submission * To be reverted: point to custom inference branch * To be reverted: use custom mlc automation branch * To be reverted: use custom branch for automation and inference repo * To be reverted: use custom branch for mlcflow and inference repo * Turn off the action * To be reverted: custom branches for mlcflow and inference repo --------- Co-authored-by: ANANDHU S <71482562+anandhu-eng@users.noreply.github.com>

* Initial proposal for VLM - evaluation * address review comments and test hiclass implementation * [Automated Commit] Format Codebase * additional fixes to reviews * [Automated Commit] Format Codebase * address PR comments * [Automated Commit] Format Codebase * add a more detail description of the field dataset.split * Enable exception logging in _query_endpoint_async * [Automated Commit] Format Codebase * [Automated Commit] Format Codebase * Trigger CI/CD pipeline * Add performance_sample_count_override as a CLI flag. * [Automated Commit] Format Codebase * add json format to queries * [Automated Commit] Format Codebase * added schema file and made necessary changes * [Automated Commit] Format Codebase * refactoring and linting * [Automated Commit] Format Codebase * Add Dockerfile * Add use_guided_decoding to let user choose to use guided_decoding or not. * [Automated Commit] Format Codebase * add f1 scores of uniform random selection * [Automated Commit] Format Codebase * Enabling mlperf-inf-mm-vl2l benchmark vllm. * [Automated Commit] Format Codebase * Commit to trigger the GitHub Actions in inference PR * empty commit --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Shang Wang <samshang.wang@mail.utoronto.ca> Co-authored-by: ANANDHU S <71482562+anandhu-eng@users.noreply.github.com> Co-authored-by: Shang Wang <shangw@nvidia.com>

* add changes to brand field * fix format * [Automated Commit] Format Codebase * delete extra check in calculate_brand_f1_score * reflect the latest version of the dataset * [Automated Commit] Format Codebase * change default server_expected_qps to 10 * fix errors in task.py * add format fixes * add request timeout * [Automated Commit] Format Codebase * fix hanging * update readme * small fix * [Automated Commit] Format Codebase * empty commit --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Shang Wang <samshang.wang@mail.utoronto.ca> Co-authored-by: Shang Wang <shangw@nvidia.com>

…t (that we are going to freeze) (#2406) * notebook WIP * Update the notebook to reflect the latest version of the dataset * update * update * [Automated Commit] Format Codebase * empty commit --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

* Update loadgen to 6.0; Bulk update 6.0 checker bits * Fix qwen3vl

* Add tests to detect failures in igbh dataset download * Update Python version in CI workflow

* text to video ref implementation * update dataset and readme * move files to wan folder

…ast if the underying `vllm serve` process already failed (#2409) * enable VllmDeployer to fail fast if the underying vllm process failed. * example slurm script for submitting jobs * fix slurm scripts * small fix * [Automated Commit] Format Codebase * Update the readme about the example slurm scripts. * Change the default endpoint startup timeout to 1 hour in case someone need to download the model for the fisrt time. * change servr expected qps and target latency * Change the default dataset repo_id to the new name of the public dataset * [Automated Commit] Format Codebase * evaluate the json file with multiprocess * [Automated Commit] Format Codebase * change default server_target_latency to 12 * revert evaluation changeS * [Automated Commit] Format Codebase * update slurm script * update slurm script * revert evaluation.py changes after analysing the discrepancy in is_secondhand f1 score * [Automated Commit] Format Codebase * linting * [Automated Commit] Format Codebase * lock in model and dataset SHA * [Automated Commit] Format Codebase * Specify model quality target and server target latency in the README * Update loadgen/mlperf.conf * aligning TestSettings'C++ code with its python binding * [Automated Commit] Format Codebase * remove ttft and tpot from mlperf.conf * Enable CLI to take in user.conf * [Automated Commit] Format Codebase * readme * readme * rename vl2l -> q3vl * [Automated Commit] Format Codebase * empty * rerun ci * rerun ci * Introduce sampling parameters * [Automated Commit] Format Codebase * [Automated Commit] Format Codebase * empty * move CFLAGS="-std=c++14 -O3" into extra_compile_args of Pybind11Extension * [Automated Commit] Format Codebase * enable specifying loadgen source in the Dockerfile * update slurm scripts * Maintain None as the default value for the sampling params * [Automated Commit] Format Codebase * update readme * [Automated Commit] Format Codebase * empty --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: John Calderon <jcalderon@nvidia.com>

* Initial draft for Inference submission guide * Update index.md

arjunsuresh and others added 30 commits November 6, 2024 22:30

Update build_wheels.yml | Recommendation from MLCommons Admins (#1895)

3910973

Fix SDXL bug with diffuser 0.30.0 while giving downloaded model path (#…

b0686db

…1905)

Update GNN reference implementation: add DGL backend (#1903)

d3c01ed

* Update GNN reference implementation: add DGL backend * [Automated Commit] Format Codebase * Update README.md

Update artifact actions from v3 to v4 in build_wheels.yml (#1906)

dffd292

Update requirements.txt (#1907)

e398b90

Updating the pip packages

Removed TEST05 compliance requirement (#1925)

4763c6b

Fix: GitHub workflow script injection (#1936)

b2b346c

Signed-off-by: Joyce Brum <joycebrum@google.com> Co-authored-by: Mitchelle Rasquinha <80070689+mrasquinha-g@users.noreply.github.com>

Add GNN calibration ids (#1916)

4d7a5c8

Co-authored-by: Mitchelle Rasquinha <80070689+mrasquinha-g@users.noreply.github.com>

- Fixed syntax error on list .insert method (#1921)

acc3eb4

* - Fixed syntax error on list .insert method * Update fid_score.py --------- Co-authored-by: Arjun Suresh <arjun@gateoverflow.com> Co-authored-by: Arjun Suresh <arjunsuresh1987@gmail.com>

Sync Docs (#1908)

61a8a6a

Docs update

Fix Dockerfile.eval: huggingface-cli doesn't exist (#1805)

d28a530

https://huggingface.co/docs/huggingface_hub/main/en/guides/cli Co-authored-by: Arjun Suresh <arjunsuresh1987@gmail.com> Co-authored-by: Miro <mirhodak@amd.com>

Increment version to 4.1.24

feca9f6

[Automated Commit] Format Codebase

6ac00b1

Increment version to 4.1.30

e7c301c

Llm large fixes (#1959)

21d845e

* Fix llama3 multi-gpu issue * Fix identation * Remove llama2 outdated references * Fix run_evaluation argument

Submission checker v5.0 (#1955)

dd9be96

* Submission checker v5.0 * [Automated Commit] Format Codebase --------- Co-authored-by: pgmpablo157321 <pgmpablo157321@users.noreply.github.com> Co-authored-by: Arjun Suresh <arjun@gateoverflow.com>

Upgrade loadgen version to 5.0 (#1962)

8ce46d5

Fix loadgen build for version numbers having "0" (#1967)

4968ba0

* Fix loadgen build for version numbers having "0" * Update test-resnet50.yml * Update test-retinanet.yml * Update test-bert.yml

Increment version to 5.0.1

6afe996

Fix Dockerfile for 405B (#1960)

e69b504

Co-authored-by: Miro <mirhodak@amd.com>

Update backend_pytorch.py | Fix lock usage (#1964)

cec62ff

Co-authored-by: Miro <mirhodak@amd.com>

Add llama3 metrics + remove llama3-99.9 (#1973)

bc5e6fb

hanyunfan and others added 30 commits October 13, 2025 21:06

Increment version to 5.1.1

0e556f1

[Whisper] Add labels' in the whisper output (#2252)

0b8ca03

* add numbers into labels add * remove euro, cent...etc --------- Co-authored-by: hanyunfan <frank.han@dell.com>

Update gptj model download command to support R2 downloader (#2368)

f1b7ea1

Update model and dataset download commands to support R2 downloader (#…

3693669

…2369)

Discard duplicate information about external model download (#2370)

3d06ae1

MIXTRAL - Update MLCFlow dataset and model download commands to suppo…

0c27530

…rt R2 (#2371)

Update DLRM v2 assets download commands to support R2 downloader (#2372)

a921baf

SDXL: Update MLC commands to support model download through R2 (#2373)

55e41e0

Updation for inference docs page: migration to R2 (#2374)

6b92d87

* Updation for documentation page: migration to R2 * fix for llama2 assets download instructions

Update MLC commands to support downloads through R2 for PointPainting (…

d4bf062

…#2367)

update docs with submission instruction + hyperlinks for heading (#2390)

ceafe86

[VLM] Server Scenario, performance and accuracy (#2388)

32b6d66

* initial server version * change description for parameter use_token_latencies * [Automated Commit] Format Codebase * changes based on PR comments

Add note on LoadGen PyPI auto-build workflow (#2398)

59c59dd

Increment version to 5.1.2

06c742a

Revert changes used for testing (#2401)

353bfb8

Update loadgen to 6.0; Bulk update 6.0 checker bits (#2415)

003f4fb

* Update loadgen to 6.0; Bulk update 6.0 checker bits * Fix qwen3vl

feat: add MTP to ds-r1 ref. impl (#2403)

c098f80

Increment version to 6.0.1

abed98c

Add tests to detect failures in igbh dataset download (#2359)

ff627df

* Add tests to detect failures in igbh dataset download * Update Python version in CI workflow

Text to Video Reference Implementation (#2413)

e93f59d

* text to video ref implementation * update dataset and readme * move files to wan folder

Increment version to 6.0.2

0369dd1

Initial draft for Inference submission guide (#2378)

7f42a83

* Initial draft for Inference submission guide * Update index.md

[LoadGen] Bump version from 6.0.2 to 6.0.3 (#2417)

27db053

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

@dkorchevgithub adding lines suggested by @EtoDemerzel0427 to fix "g… #13

@dkorchevgithub adding lines suggested by @EtoDemerzel0427 to fix "g… #13

Uh oh!

dkorchevgithub commented Jul 14, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants

@dkorchevgithub adding lines suggested by @EtoDemerzel0427 to fix "g… #13

Are you sure you want to change the base?

@dkorchevgithub adding lines suggested by @EtoDemerzel0427 to fix "g… #13

Uh oh!

Conversation

dkorchevgithub commented Jul 14, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

20 participants