Skip to content

Conversation

@usstq
Copy link
Owner

@usstq usstq commented Aug 29, 2023

Details:

  • Add GenPattern APIs to enable big pattern matching, this API allows sub-patterns to be matched w/o custom logic in matcher's callback, instead the description of each pattern node is enough for correctness of semantics for the matching.
  • Add RoPEFusion & RoPE cpu node for rotary embedding pattern matching on GPTNEOX & GPTJ type of models.
  • Add VNodeFusion & VNode cpu node: VNode is versatile CPU plugin node for fast enabling plain layout + shape agnostic optimizations on patterns from LLM, so far it contains optimizations for gptneox_attention/gptj_attention/falcon_attention/llama2_attention/llama_RMSNorm type of subgraph patterns, since these designs are for short-term, I didn't introduce specific CPU node for each sub-pattern, instead using this flexible light-weighted VNode for fast enabling of custom pattern optimization.

Tickets:

  • ticket-id

e-ddykim and others added 30 commits August 14, 2023 23:29
…quantize (openvinotoolkit#19182)

* updated to add reorder after parameter when it is connected to quantize

* removed redundant code
…olkit#19079)

* Removed 'example_output' from ovc and ovc.convert_model, used output for this purpose

* Update tools/ovc/openvino/tools/ovc/convert.py

* Update tools/ovc/openvino/tools/ovc/convert_impl.py

* Reverted mo parts not affected by remove of example_output

* fix PDPD convert_model tests

---------

Co-authored-by: Xiuchuan Zhai <xiuchuan.zhai@intel.com>
)

* [CONFORMANCE] Fix memory leak in Subgraphs Dumper

* Update fused_names.cpp

* Change inheritance of extractors

* Check graph cache

* Enable Op cache
)

* Moved mmap to openvino/util as a shared functionality

* Enabled MMAP for SavedModel and MetaGraph

* Fixed CMake

* Fixed a lost line

* Simplified code for compilers

* Aligned with an actual master

* Enabled mmap by default and added test

* Suppressed warning, added test for MetaGraph, additional messages

---------

Co-authored-by: Andrei Kochin <andrei.kochin@intel.com>
* fix a possible race condition

Signed-off-by: fishbell <bell.song@intel.com>

* clang

Signed-off-by: fishbell <bell.song@intel.com>

* reimpl

Signed-off-by: fishbell <bell.song@intel.com>

* add cases

Signed-off-by: fishbell <bell.song@intel.com>

* implement test in 2.0

Signed-off-by: fishbell <bell.song@intel.com>

* clean up unchanged files

Signed-off-by: fishbell <bell.song@intel.com>

* add pragma once

Signed-off-by: fishbell <bell.song@intel.com>

---------

Signed-off-by: fishbell <bell.song@intel.com>
* Fix TS gather

* enable pytest

* revert auto replaced comment
…kit#19106)

* [GPU] Fix unintentional expansion of pad's rank into 2D

* Add explicit 1d pad TCs for ov_gpu_unit_tests
* add set property case

* add set property case

* free key

* fix code style

* add test of get_property() for ov_property_key_auto_batch_timeout

* get_property first

* add device name

* device_name.c_str()

* reset name

* test set_property

* remove comment

* pass local

* get property first

* fix bug

* use device from GetParam

* fix char to string

* use     EXPECT_STREQ(target.c_str(), result);

* fix comment

---------

Co-authored-by: Chen Peter <peter.chen@intel.com>
* Round ref implementation select optimal solution

* Add rounding guard
- minor corrections after review.
…lkit#19127)

* add-numpy

* Apply suggestions from code review

Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>

* Apply suggestions from code review

Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>

---------

Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>
* Added check that nncf was imported.

* Added check that nncf was imported in MO.

* Added check that nncf was imported in MO.

* Apply suggestions from code review

Co-authored-by: Sergey Lyalin <sergey.lyalin@intel.com>

* Removed not needed import.

* Pylint fix.

---------

Co-authored-by: Sergey Lyalin <sergey.lyalin@intel.com>
* flatbuffers version update

* minor changes

* minor changes
* [PT FE] Revise usage output vs node in frontend

* Fix code style
* TorchFX caching bugfix and improvements

* Fixed inconsistent env variable for Backend device

* Identify PyTorch FrontEnd Decoder type

* Added import statement in init files

* Registered ts_openvino as a separate backend

* Added caching fix and removed extraneous code

* Changed the name of ts backend

* Fixed issue with local temporary object

* Removed import statement from init files

* Changed the documentation

* Added get_supported_ops method for decoders

---------

Co-authored-by: Cavus Mustafa <mustafa.cavus@intel.com>
Co-authored-by: ynimmaga <yamini.nimmagadda@intel.com>
C++ version doesn't have this problem.

Ticket 99702
* Added support of tuple in input, removed type syntax from OVC tool.

* Removed type syntax tests.

* Apply suggestions from code review

* Method annotation corrected.

* Type annotation corrected.

---------

Co-authored-by: Sergey Lyalin <sergey.lyalin@intel.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment