Changelog

All notable changes to this project will be documented in this file.

[Unreleased]

New features

Turn on/off classification augmentations (#4039)
Turn on/off detection and instance segmentation augmentations (#4066)
Add GPU memory monitor hook (#4118)
Add YOLOv9 model for Object Detection (#3917, #4026)
Add OV inference for keypoint detection (#3970)
Add tiling for semantic segmentation (#3954)
Add 3D Object Detection task with MonoDETR model (#3979)
Add OpenVINO inference for 3D Object Detection task (#4017)

Enhancements

Update visual prompting pipeline for multi-label zero-shot learning support (#3993)
Update to work torch compile in detection (#4003)
Refactor MaskDINO (#4006)
Fix MaskRCNN/RTMDet-Inst/MaskRCNNTV Explain Mode (#4053)
Reduce tiling unittests resource consumption (#4073)
Upgrade OpenVINO to 2024.5 and NNCF to 2.14.0 (#4123)
Improve FMetric computation (#4130)
Upgrade OV, MAPI, and NNCF dependencies (#3967)
Instance Segmentation Model refactoring (#3865)
Bump torch and lightning to 2.4.0 versions (#3843)
Add mAP metric to evaluate multilabel classification (#3985)
Bump OV to 2024.6, update empty label handling (#4162)

Bug fixes

Fix MaskDINO, MonoDETR recipes (#4080)
Fix a wrong HPO log (#3972)
Update model name in rotated detection recipes (#4028)
Fix SupCon flag (#4076)

[2.2.2]

Bug fixes

BC improvement (#4154)

[2.2.1]

Bug fixes

Fix empty annotation in tiling (#4124)
Fix patching early stopping in tools/converter.py, update headers in templates, change training schedule for classification (#4131)
Fix tensor type compatibility in dynamic soft label assigner and RTMDet head (#4140)
Fix DETR target class indices are of type long in loss calculations (#4143)
Fix arrow format reader for multiclass ROI case (#4145)

Enhancements

Decouple DinoV2 for semantic segmentation task (#4136)
Update Label Info handling (#4127)

[2.2.0]

New features

Add RT-DETR model for Object Detection (#3741)
Add Multi-Label & H-label Classification with torchvision models (#3697)
Add Hugging-Face Model Wrapper for Classification (#3710)
Add LoRA finetuning capability for ViT Architectures (#3729)
Add Hugging-Face Model Wrapper for Object Detection (#3747)
Add Hugging-Face Model Wrapper for Semantic Segmentation (#3749)
Enable torch.compile to work with classification (#3758)
Add otx benchmark subcommand (#3762)
Add RTMPose for Keypoint Detection Task (#3781, #4034)
Add Semi-SL MeanTeacher algorithm for Semantic Segmentation (#3801)
Update head and h-label format for hierarchical label classification (#3810)
Support configurable input size (#3788)
Add diffusion task (#3875)
Revert the old workaround for detection confidence threshold (#4096)
Add Keypoint Detection legacy template (#4094)

Enhancements

Reimplement of ViT Architecture following TIMM (#3719)
Enable to override data configurations (#3748)
Enable to use input_size at transforms in recipe (#3759)
Enable to use polygon and bitmap mask as prompt inputs for zero-shot learning (#3769)
Refactoring ConvModule by removing conv_cfg, norm_cfg, and act_cfg (#3783, #3816, #3809)
Support ImageFromBytes (#3948)
Enable model export (#3952)
Move templates from OTX1.X to OTX2.X (#3951)
Include Geti arrow dataset subset names (#3962)
Include full image with anno in case there's no tile in tile dataset (#3964)
Add type checker in converter for callable functions (optimizer, scheduler) (#3968)
Change sematic segmentation to consider bbox only annotations (#3996)
Relieve memory usage criteria on batch size 2 during adaptive batch size (#4009)
Remove background label from RT Info for segmentation task (#4011)
Enable export of the feature vectors for semantic segmentation task (#4055)
Prevent using too low confidence thresholds in detection (#4018)
Update HPO interface (#4035)
Bump onnx to 1.17.0 to omit CVE-2024-5187 (#4063)

Bug fixes

Update anomaly base transforms to use square resizing (#4059)
Fix Combined Dataloader & unlabeled warmup loss in Semi-SL (#3723)
Revert #3579 to fix issues with replacing coco_instance with a different format in some dataset (#3753)
Add num_devices in Engine for multi-gpu training (#3778)
Add missing tile recipes and various tile recipe changes (#3942)
Change categories mapping logic (#3946)
Fix config converter for tiling (#3973)
Fix BboxOverlaps2D handling of empty ground-truth annotations in datasets. (#4010)
Fix num_trials calculation on dataset length less than num_class (#4014)
Fix out_features in HierarchicalCBAMClsHead (#4016)
Fix multilabel_accuracy of MixedHLabelAccuracy (#4042)
Fix wrong indices setting in HLabelInfo (#4044)
Add legacy template LiteHRNet_18 template (#4049)
Model templates: rename model_status value 'DISCONTINUED' to 'OBSOLETE' (#4051)
Update MRCNN model export to include feature vector and saliency map (#4056)
Upgrade MAPI in 2.2 (#4052)
Fix applying model's hparams when loading model from checkpoint (#4057)
Fix incorrect all_groups order configuration in HLabelInfo (#4067)
Fix RTDETR recipes (#4079)
Fix wrong model name in converter & template (#4082)
Fix RTMDet Inst Explain Mode (#4083)
Fix RTDETR Explain Mode (#4106)
Fix classification and semantic segmentation tasks, when ROI provided for images (#4105)
Disable tiling classifier toggle in configurable parameters (#4107)
Fix Ellipse shapes for Instance Segmentation (#4152)

[v2.1.0]

NOTES

OpenVINO™ Training Extensions, version 2.1.0 does not include the latest functional and security updates. OpenVINO™ Training Extensions, version 2.2.0 is targeted to be released in September 2024 and will include additional functional and security updates. Customers should update to the latest version as it becomes available.

New features

Add a flag to enable OV inference on dGPU (#3503)
Add early stopping with warmup. Remove mandatory background label in semantic segmentation task (#3515)
RTMDet-tiny enablement for detection task (#3542)
Add data_format validation and update in OTXDataModule (#3579)
Add torchvision.MaskRCNN (#3504)
Add Semi-SL for Multi-class Classification (EfficientNet-B0) (#3566)
Decoupling mmaction for action classification (MoviNet, X3D) (#3582)
Add Semi-SL Algorithms for mv3-large, effnet-v2, deit-tiny, dino-v2 (#3602)
RTMDet-tiny enablement for detection task (export/optimize) (#3564)
Enable ruff & ruff-format into otx/algo/classification/backbones (#3667)
Add TV MaskRCNN Tile Recipe (#3655)
Add rotated det OV recipe (#3687)

Enhancements

Change load_stat_dict to on_load_checkpoint (#3443)
Add try - except to keep running the remaining tests (#3448)
Update instance_segmentation.py to resolve conflict with 2.0.0 (#3506)
Update XPU install (#3516)
Sync rgb order between torch and ov inference of action classification task (#3551)
Make Perf test available to load pervious Perf test to skip training stage (#3556)
Reenable e2e classification XAI tests (#3591)
Remove action detection task support (#3605)
Increase readability of pickling error log during HPO & fix minor bug (#3606)
Update RTMDet checkpoint url (#3631)
Refactor Torchvision Model for Classification Semi-SL (#3614)
Add coverage omit mm-related code (#3641)
Add docs semi-sl part (#3640)
Refactor docs design & Add contents (#3645)
Add execution example of auto batch size in docs (#3648)
Add Semi-SL for cls Benchmark Test (#3647)
Move value to device before logging for metric (#3649)
Add .codecov.yaml (#3650)
Update benchmark tool for otx2.1 (#3652)
Collect pretrained weight binary files in one place (#3656)
Minimize compiled dependency files (#3653)
Update README & CODEOWNERS (#3659)
Update Engine's docstring & CLI --help outputs (#3658)
Align integration test to exportable code interface update for release branch (#3676)
Refactor exporter for anomaly task and fix a bug with exportable code (#3672)
Update pandas version constraint (#3679)
Include more models to export test into test_otx_e2e (#3678)
Move assigning tasks to Models from Engine to Anomaly Model Classes (#3683)
Refactoring detection modules (#3636)

Bug fixes

Fix conflicts between develop and 2.0.0 (#3490)
Fix polygon mask (#3549)
Fix vpm intg test error (#3554)
Fix anomaly (#3557)
Bug fix in Semantic Segmentation + enable DINOV2 export in ONNX (#3569)
Fix some export issues. Remove EXPORTABLE_CODE as export parameter. (#3577)
Fix load_from_checkpoint to apply original model's hparams (#3607)
Fix load_from_checkpoint args to apply original model's hparams (#3611)
Fix zero-shot learn for ov model (#3601)
Various fixes for XAI in 2.1 (#3615)
Fix tests to work in a mm-free environment (#3632)
Fix a bug in benchmark code (#3643)
Update exportable code dependency & fix a bug (#3642)
Fix getting wrong shape during resizing (#3644)
Fix detection prediction outputs (#3634)
Fix RTMDet PTQ performance (#3651)
Fix segmentation fault on VPM PTQ (#3654, #3689)
Fix NNCF MaskRCNN-Eff accuracy drop (#3680)
Fix optimize with Semi-SL data pipeline (#3684)
Fix MaskRCNN SwinT NNCF Accuracy Drop (#3685)
Fix MaskRCNN SwinT NNCF Accuracy Drop By Adding More PTQ Configs (#3929)

Known issues

Post-Training Quantization (PTQ) optimization applied to maskrcnn_swint in the instance segmentation task may result in significantly reduced accuracy. This issue is expected to be addressed with an upgrade to OpenVINO and NNCF in a future release.

[v2.0.0]

NOTES

OpenVINO™ Training Extensions which version 2.0.0 has been updated to include refactoring of the overall architecture and functional updates. Users should install the new environment.

New features

Enable New design to provide a more seamless API/CLI that delivers the value of OTX: Product Design
Moved away from MMLab's libraries to provide a Lightning-based core and training pipeline
Use Lightning-based modules and trainers to deliver APIs/CLIs in a more user-friendly way
Support Intel devices for accelerating deep learning model training

Enhancements

Support more models for each task
Improve the API so user can configure efficient training with shorter code
Provide more customize settings through the CLI and API
Enhance the Auto-Configuration feature and made it available in the API

Bug fixes

Fixing some minor issues

Known issues

Anomaly task processing times have increased compared with v1.* version, with anomaly classification experiencing a slowdown of approximately 26%, anomaly detection by approximately 213%, and anomaly segmentation by approximately 78%. Issue #3592
Post-Training Quantization (PTQ) optimization applied to maskrcnn_swint in the instance segmentation task may result in significantly reduced accuracy compared with v1.* Issue #3593

[v1.6.1]

Enhancements

Update pymongo version to 4.6.3 for resolving CVE-2024-21506 (#3396)
Use torchvision in MRCNN on CUDA (#3347)
Update IPEX version in installation guide documentation (#3343)
Update benchmark (#3338)
Bump idan version to 3.7 (#3332)
Support benchmark history summary (#3307)
Pin pymongo version to 4.5.0 (#3316)
Upgrade MAPI (#3304)
Add NMS iou threshold configurable parameter (#3287)
Remedy some medium/low severity bandit issues (#3208)
Update documentations (#3280)
Add perf benchmark test cases for action and visual prompting (#3292)

Bug fixes

Explicitly cast incorrect output type in OV model (#3395)
Update QAT configs for rotated detection (#3375)
Hotfix 🔧 Bypass ClsIncrSampler for tiling (#3374)
[NNCF] Dynamic shape datasets WA (#3355)
[Hotfix] 🔥 Fixing detection oriented OV inferencer (#3351)
Revert adaptive batch size (#3340)
Fix e2e tests for XPU (#3305)
Remove torch.xpu.optimize for semantic_segmentation task (#3172)

[1.6.0]

New features

Changed supported Python version range (>=3.9, <=3.11) (openvinotoolkit/datumaro#1269)
Support MMDetection COCO format (openvinotoolkit/datumaro#1213)
Develop JsonSectionPageMapper in Rust API (openvinotoolkit/datumaro#1224)
Add Filtering via User-Provided Python Functions (openvinotoolkit/datumaro#1230, openvinotoolkit/datumaro#1233)
Remove supporting MacOS platform (openvinotoolkit/datumaro#1235)
Support Kaggle image data (KaggleImageCsvBase, KaggleImageTxtBase, KaggleImageMaskBase, KaggleVocBase, KaggleYoloBase) (openvinotoolkit/datumaro#1240)
Add __getitem__() for random accessing with O(1) time complexity (openvinotoolkit/datumaro#1247)
Add Data-aware Anchor Generator (openvinotoolkit/datumaro#1251)
Support bounding box import within Kaggle extractors and add KaggleCocoBase (openvinotoolkit/datumaro#1273)

Enhancements

Optimize Python import to make CLI entrypoint faster (openvinotoolkit/datumaro#1182)
Add ImageColorScale context manager (openvinotoolkit/datumaro#1194)
Enhance visualizer to toggle plot title visibility (openvinotoolkit/datumaro#1228)
Enhance Datumaro data format detect() to be memory-bounded and performant (openvinotoolkit/datumaro#1229)
Change RoIImage and MosaicImage to have np.uint8 dtype as default (openvinotoolkit/datumaro#1245)
Enable image backend and color channel format to be selectable (openvinotoolkit/datumaro#1246)
Boost up CityscapesBase and KaggleImageMaskBase by dropping np.unique (openvinotoolkit/datumaro#1261)
Enhance RISE algortihm for explainable AI (openvinotoolkit/datumaro#1263)
Enhance explore unit test to use real dataset from ImageNet (openvinotoolkit/datumaro#1266)
Fix each method of the comparator to be used separately (openvinotoolkit/datumaro#1290)
Bump ONNX version to 1.16.0 (openvinotoolkit/datumaro#1376)
Print the color channel format (RGB) for datum stats command (openvinotoolkit/datumaro#1389)
Add ignore_index argument to Mask.as_class_mask() and Mask.as_instance_mask() (openvinotoolkit/datumaro#1409)

Bug fixes

Fix wrong example of Datumaro dataset creation in document (openvinotoolkit/datumaro#1195)
Fix wrong command to install datumaro from github (openvinotoolkit/datumaro#1202, openvinotoolkit/datumaro#1207)
Update document to correct wrong datum project import command and add filtering example to filter out items containing annotations. (openvinotoolkit/datumaro#1210)
Fix label compare of distance method (openvinotoolkit/datumaro#1205)
Fix Datumaro visualizer's import errors after introducing lazy import (openvinotoolkit/datumaro#1220)
Fix broken link to supported formats in readme (openvinotoolkit/datumaro#1221)
Fix Kinetics data format to have media data (openvinotoolkit/datumaro#1223)
Handling undefined labels at the annotation statistics (openvinotoolkit/datumaro#1232)
Add unit test for item rename (openvinotoolkit/datumaro#1237)
Fix a bug in the previous behavior when importing nested datasets in the project (openvinotoolkit/datumaro#1243)
Fix Kaggle importer when adding duplicated labels (openvinotoolkit/datumaro#1244)
Fix input tensor shape in model interpreter for OpenVINO 2023.3 (openvinotoolkit/datumaro#1251)
Add default value for target in prune cli (openvinotoolkit/datumaro#1253)
Remove deprecated MediaManager (openvinotoolkit/datumaro#1262)
Fix explore command without project (openvinotoolkit/datumaro#1271)
Fix enable COCO to import only bboxes (openvinotoolkit/datumaro#1360)
Fix resize transform for RleMask annotation
(openvinotoolkit/datumaro#1361)
Fix import YOLO variants from extractor when urls is not specified (openvinotoolkit/datumaro#1362)

[1.5.2]

Enhancements

Add memory bounded datumaro data format detect to release 1.5.1 (openvinotoolkit/datumaro#1241)
Bump version string to 1.5.2 (openvinotoolkit/datumaro#1249)
Remove Protobuf version limitation (<4) (openvinotoolkit/datumaro#1248)

[1.5.1]

Enhancements

Enhance Datumaro data format stream importer performance (openvinotoolkit/datumaro#1153)
Change image default dtype from float32 to uint8 (openvinotoolkit/datumaro#1175)
Add comparison level-up doc (openvinotoolkit/datumaro#1174)
Add ImportError to catch GitPython import error (openvinotoolkit/datumaro#1174)

Bug fixes

Modify the draw function in the visualizer not to raise an error for unsupported annotation types. (openvinotoolkit/datumaro#1180)
Correct explore path in the related document. (openvinotoolkit/datumaro#1176)
Fix errata in the voc document. Color values in the labelmap.txt should be separated by commas, not colons. (openvinotoolkit/datumaro#1162)
Fix hyperlink errors in the document (openvinotoolkit/datumaro#1159, openvinotoolkit/datumaro#1161)
Fix memory unbounded Arrow data format export/import (openvinotoolkit/datumaro#1169)
Update CVAT format doc to bypass warning (openvinotoolkit/datumaro#1183)

[v1.5.0]

New features

Enable configurable confidence threshold for otx eval and export (#2388)
Add YOLOX variants as new object detector models (#2402)
Enable FeatureVectorHook to support action tasks (#2408)
Add ONNX metadata to detection, instance segmentation, and segmentation models (#2418)
Add a new feature to configure input size (#2420)
Introduce the OTXSampler and AdaptiveRepeatDataHook to achieve faster training at the small data regime (#2428)
Add a new object detector Lite-DINO (#2457)
Add Semi-SL Mean Teacher algorithm for Instance Segmentation task (#2444)
Official supports for YOLOX-X, YOLOX-L, YOLOX-S, ResNeXt101-ATSS (#2485)
Add new argument to track resource usage in train command (#2500)
Add Self-SL for semantic segmentation of SegNext families (#2215)
Adapt input size automatically based on dataset statistics (#2499)

Enhancements

Refine input data in-memory caching (#2416)
Adapt timeout value of initialization for distributed training (#2422)
Optimize data loading by merging load & resize operations w/ caching support for cls/det/iseg/sseg (#2438, #2453, #2460)
Support torch==2.0.1 (#2465)
Set "Auto" as default input size mode (#2515)

Bug fixes

Fix F1 auto-threshold to choose best largest confidence (#2371)
Fix IBLoss enablement with DeiT-Tiny when class incremental training (#2594)

Known issues

OpenVINO(==2023.0) IR inference is not working well on 2-stage models (e.g. Mask-RCNN) exported from torch>=1.13.1
NNCF QAT optimization is disabled for MaskRCNN models due to CUDA runtime error in ROIAlign kernel on torch==2.0.1

[v1.4.4]

Enhancements

Update ModelAPI configuration(#2564)
Add Anomaly modelAPI changes (#2563)
Update Image numpy access (#2586)
Make max_num_detections configurable (#2647)

Bug fixes

Fix IBLoss enablement with DeiT-Tiny when class incremental training (#2595)
Fix mmcls bug not wrapping model in DataParallel on CPUs (#2601)
Fix h-label loss normalization issue w/ exclusive label group of singe label (#2604)
Fix division by zero in class incremental learning for classification (#2606)
Fix saliency maps calculation issue for detection models (#2609)
Fix h-label bug of missing parent labels in output (#2626)

[v1.4.3]

Enhancements

Re-introduce adaptive scheduling for training (#2541)

[v1.4.2]

Enhancements

Upgrade nncf version to 2.6.0 (#2459)
Bump datumaro version to 1.5.0 (#2470, #2502)
Set tox version constraint (#2472)
Add model category attributes to model template (#2439)

Bug fixes

Bug fix for albumentations (#2467)
Add workaround for the incorrect meta info M-RCNN (used for XAI) (#2437)
Fix label list order for h-label classification (#2440)
Modified fq numbers for lite HRNET e2e tests (#2445)

[v1.4.1]

Enhancements

Update the README file in exportable code (#2411)

Bug fixes

Fix broken links in documentation (#2405)

[v1.4.0]

New features

Support encrypted dataset training (#2209)
Add custom max iou assigner to prevent CPU OOM when large annotations are used (#2228)
Auto train type detection for Semi-SL, Self-SL and Incremental: "--train-type" now is optional (#2195)
Add per-class XAI saliency maps for Mask R-CNN model (#2227)
Add new object detector Deformable DETR (#2249)
Add new object detector DINO (#2266)
Add new visual prompting task (#2203, #2274, #2311, #2354, #2318)
Add new object detector ResNeXt101-ATSS (#2309)

Enhancements

Introduce channel_last parameter to improve the performance (#2205)
Decrease time for making a workspace (#2223)
Set persistent_workers and pin_memory as True in detection task (#2224)
New algorithm for Semi-SL semantic segmentation based on metric learning via class prototypes (#2156)
Self-SL for classification now can recieve just folder with any images to start contrastive pretraining (#2219)
Update OpenVINO version to 2023.0, and NNCF verion to 2.5 (#2090)
Improve XAI saliency map generation for tiling detection and tiling instance segmentation (#2240)
Remove CenterCrop from Classification test pipeline and editing missing docs link(#2375)
Switch to PTQ for sseg (#2374)

Bug fixes

Fix the bug that auto adapt batch size is unavailable with IterBasedRunner (#2182)
Fix the bug that learning rate isn't scaled when multi-GPU trianing is enabled(#2254)
Fix the bug that label order is misaligned when model is deployed from Geti (#2369)
Fix NNCF training on CPU (#2373)
Fix H-label classification (#2377)
Fix invalid import structures in otx.api (#2383)
Add for async inference calculating saliency maps from predictions (Mask RCNN IR) (#2395)

Known issues

OpenVINO(==2023.0) IR inference is not working well on 2-stage models (e.g. Mask-RCNN) exported from torch==1.13.1

[v1.3.1]

Enhancements

n/a

Bug fixes

Fix a bug that auto adapt batch size doesn't work with cls incr case (#2199)
Fix a bug that persistent worker is True even if num_workers is zero (#2208)

Known issues

OpenVINO(==2022.3) IR inference is not working well on 2-stage models (e.g. Mask-RCNN) exported from torch==1.13.1 (working well up to torch==1.12.1) (#1906)

[v1.3.0]

New features

Support direct annotation input for COCO format (#1921)
Action task supports multi GPU training. (#2057)
Support storage cache in Apache Arrow using Datumaro for action tasks (#2087)
Add a simplified greedy labels postprocessing for hierarchical classification (#2064).
Support auto adapting batch size (#2119)
Support auto adapting num_workers (#2165)
Support noisy label detection for detection tasks (#2109, #2115, #2123, #2183)

Enhancements

Make semantic segmentation OpenVINO models compatible with ModelAPI (#2029).
Support label hierarchy through LabelTree in LabelSchema for classification task (#2149, #2152).
Enhance exportable code file structure, video inference and default value for demo (#2051).
Speedup OpenVINO inference in image classificaiton, semantic segmentation, object detection and instance segmentation tasks (#2105).
Refactoring of ONNX export functionality (#2155).
SSD detector Optimization(#2197)

Bug fixes

Fix async mode inference for demo in exportable code (#2154)
Fix a bug that auto adapt batch size doesn't work with cls incr case (#2199)

Known issues

OpenVINO(==2022.3) IR inference is not working well on 2-stage models (e.g. Mask-RCNN) exported from torch==1.13.1 (working well up to torch==1.12.1) (#1906)

[v1.2.3]

Bug fixes

Return raw anomaly map instead of colormap as metadata to prevent applying colormap conversion twice (#2217)
Hotfix: use 0 confidence threshold when computing best threshold based on F1

[v1.2.2]

Enhancements

Improve warning message for tiling configurable parameter

Known issues

OpenVINO(==2022.3) IR inference is not working well on 2-stage models (e.g. Mask-RCNN) exported from torch==1.13.1 (working well up to torch==1.12.1) (#1906)

[v1.2.1]

Enhancements

Upgrade mmdeploy==0.14.0 from official PyPI (#2047)
Integrate new ignored loss in semantic segmentation (#2065, #2111)
Optimize YOLOX data pipeline (#2075)
Tiling Spatial Concatenation for OpenVINO IR (#2052)
Optimize counting train & inference speed and memory consumption (#2172)

Bug fixes

Bug fix: value of validation variable is changed after auto decrease batch size (#2053)
Fix tiling 0 stride issue in parameter adapter (#2078)
Fix Tiling NNCF (#2081)
Do not skip full img tile classifier + Fix Sequencial Export Issue (#2174)

[v1.2.0]

New features

Add generating feature cli_report.log in output for otx training (#1959)
Support multiple python versions up to 3.10 (#1978)
Support export of onnx models (#1976)
Add option to save images after inference in OTX CLI demo together with demo in exportable code (#2005)
Support storage cache in Apache Arrow using Datumaro for cls, det, seg tasks (#2009)
Add noisy label detection for multi-class classification task (#1985, #2034)
Add DeiT template for classification tasks as experimental template (<#2093)

Enhancements

Clean up and refactor the output of the OTX CLI (#1946)
Enhance DetCon logic and SupCon for semantic segmentation(#1958)
Detection task refactoring (#1955)
Classification task refactoring (#1972)
Extend OTX explain CLI (#1941)
Segmentation task refactoring (#1977)
Action task refactoring (#1993)
Optimize data preprocessing time and enhance overall performance in semantic segmentation (#2020)
Support automatic batch size decrease when there is no enough GPU memory (#2022)
Refine HPO usability (#2175)

Bug fixes

Fix backward compatibility with OpenVINO SSD-like detection models from OTE 0.5 (#1970)

Known issues

OpenVINO(==2022.3) IR inference is not working well on 2-stage models (e.g. Mask-RCNN) exported from torch==1.13.1 (working well up to torch==1.12.1) (#1906)

[v1.1.2]

Bug fixes

Fix exception -> warning for anomaly dump_feature option
Remove dataset.with_empty_annotations() to keep original input structure (#1964)
Fix OV batch inference (saliency map generation) (#1965)
Replace EfficentNetB0 model download logic by pytorchcv to resolve zip issue (#1967)

[v1.1.1]

Bug fixes

Add missing OpenVINO dependency in exportable code requirement

[v1.1.0]

New features

Add FP16 IR export support (#1683)
Add in-memory caching in dataloader (#1694)
Add MoViNet template for action classification (#1742)
Add Semi-SL multilabel classification algorithm (#1805)
Integrate multi-gpu training for semi-supervised learning and self-supervised learning (#1534)
Add train-type parameter to otx train (#1874)
Add embedding of inference configuration to IR for classification (#1842)
Enable VOC dataset in OTX (#1862)
Add mmcls.VisionTransformer backbone support (#1908)

Enhancements

Parametrize saliency maps dumping in export (#1888)
Bring mmdeploy to action recognition model export & Test optimization of action tasks (#1848)
Update backbone lists (#1835)
Add explanation for XAI & minor doc fixes (#1923)
Refactor phase#1: MPA modules

Bug fixes

Handle unpickable update_progress_callback (#1892)
Dataset Adapter: Avoid duplicated annotation and permit empty image (#1873)
Arrange scale between bbox preds and bbox targets in ATSS (#1880)
Fix label mismatch of evaluation and validation with large dataset in semantic segmentation (#1851)
Fix packaging errors including cython module build / import issues (#1936)

Known issues

OpenVINO(==2022.3) IR inference is not working well on 2-stage models (e.g. Mask-RCNN) exported from torch==1.13.1 (working well up to torch==1.12.1) (#1906)

[v1.0.1]

Enhancements

Refine documents by proof review
Separate installation for each tasks
Improve POT efficiency by setting stat_requests_number parameter to 1
Introduce new tile classifier to enhance tiling inference performance in MaskRCNN.

Bug fixes

Fix missing classes in cls checkpoint
Fix action task sample codes
Fix label_scheme mismatch in classification
Fix training error when batch size is 1
Fix hang issue when tracing a stack in certain scenario
Fix pickling error by Removing mmcv cfg dump in ckpt

[v1.0.0]

NOTES

OpenVINO™ Training Extensions which version 1.0.0 has been updated to include functional and security updates. Users should update to the latest version.

New features

Adaptation of Datumaro component as a dataset interface
Integrate hyper-parameter optimizations
Support action recognition task
Self-supervised learning mode for representational training
Semi-supervised learning mode for better model quality

Enhancements

Installation via PyPI package
Enhance find command to find configurations of supported tasks / algorithms / models / backbones
Introduce build command to customize task or model configurations in isolated workspace
Auto-config feature to automatically select the right algorithm and default model for the train & build command by detecting the task type of given input dataset
Improve documentation
Improve training performance by introducing enhanced loss for the few-shot transfer

Bug fixes

Fixing configuration loading issue from the meta data of the model in OpenVINO task for the backward compatibility
Fixing some minor issues

[v0.5.0]

NOTES

OpenVINO Training Extension which version is equal or older then v0.5.0 does not include the latest functional and security updates. OTE Version 1.0.0 is targeted to be released in February 2023 and will include additional functional and security updates. Customers should update to the latest version as it becomes available.

Added

Add tiling in rotated detection (#1420)
Add Cythonize AugMixAugment (#1478)
Integrate ov-telemetry (#1568)

Changed

Update OpenVINO to 2022.3 release & nncf to the pre-2.4 version (#1393)

Fixed

Fixing h-label head output bug in OV inference (#1458)
Fixing deprecated np.bool issue from numpy==1.24.0 (#1455)
Fixing tiling OpenVINO backward compatibility (#1516)
Fixing indexing in hierarchical classification inference (#1551)
Copying feature vector to resolve duplication issue (#1511)
Fixing handling ignored samples in hierarchical head (#1599)
Some minor issues

[v0.4.0]

Added

Model Preparation Algorithm (MPA)
- Better saliency map support
  - Replace current saliency map generation with Recipro-CAM for cls (#1363)
  - Class-wise saliency map generation for the detection task (#1402)
  - OTE Saliency Map Label (#1447)
- Improve object counting algorithm for high-res images via image tiling
  - Add Tiling Module (#1200)
  - Fliter object less than 1 pixel (#1305)
  - Tiling deployment (#1387)
  - Enable tiling oriented detection for v0.4.0/geti1.1.0 (#1427)

Fixed

Hot-fix for Detection fix two stage error (#1433)
Fixing ZeroDivisionError in iteration counter for detection-classification project trainings (#1449)
Some minor issues

[v0.3.1]

Fixed

Neural Network Compression Framework (NNCF)
- Fix CUDA OOM for NNCF optimization model MaskRCNN-EfficientNetB2B (#1319)
Model Preparation Algorithm (MPA)
- Fix 'Shape out of bounds' error when accepting AI predictions for detection oriented (#1326)
- Fix weird confidence behaviour issue on predictions for hierarchical classification (#1332)
- Fix training failure issue for hierarchical classification (#1329)
- Fix training failure issues for segmentation and instance segmentation during inference process (#1338)
- Some minor issues

Security

Update vulnerable Python dependencies in OTE (#1303)

[v0.3.0]

Added

Model Preparation Algorithm (MPA)
- Add new tasks / model templates for Class-Incremental Learning
  - Instance Segmentation (#1142)
  - Classification
    - Multilabel (#1132)
    - Hierarchical-label (#1159)
  - SSD and YOLOX model template for Detection (#1156)
- Saliency map support
  - Classification (#1166)
  - Detection (#1155)
  - Segmentation (#1158)
- NNCF (#1157) support
- HPO (#1168) support
- Balanced Sampler support for Classification (#1139)
- Add Adaptive Training for Detection / Instance Segmentation (#1190)
Anomaly
- Add real-life training tests (#898)
- Add additional check for early stopping parameter (#1110)
- Add DRAEM task implementation (#1203)

Changed

Model Preparation Algorithm (MPA)
- Replace Class-Incremental Learning models as OTE default models (#1150)
- Replace OTE ignored label support with external ignored label
  - Classification (#1132)
  - Detection (#1128)
  - Segmentation (#1134)
- Enable mixed precision for Classification / Detection / Segmentation (#1198)
- Enhance training schedule for Classification (#1212)
- Change Model optimization hyper-parameters for Classification / Detection (#1170)
- Disable Obsolete test cases for OTE CI (#1220)
Anomaly
- Extend conftest configuration for anomaly backend (#1097)
- Expose more params to the UI (#1085)
- Change directory structure for anomaly templates (#1105)
- Use is_anomalous attribute instead of string matching (#1120)
- Set nncf version (#1124)
- Move to learning parameters (#1152)
- Change OpenVINO MO Command (#1221)

Fixed

Model Preparation Algorithm (MPA)
- Fix inference issues for Detection (#1167)
- Fix model compatibility issue between SC1.1 and 1.2 in Segmentation (#1264)
- Some minor issues
Anomaly
- Fix non deterministic + sample.py (#1118)
- Fix exportable code for anomaly tasks (#1113)
- Fix local anomaly segmentation performance bug (#1219)
- Fix progress bar (#1223)
- Fix inference when model backbone changes (#1242)

[v0.2.0]

Added

Model Preparation Algorithm (MPA), a newly introduced OTE Algorithm backend for advanced transfer learning
- Class-Incremental Learning support for OTE models
  - Image Classification
  - Object Detection
  - Semantic Segmentation
Object counting & Rotated object detection are added to Object Detection backend
Increased support for NNCF / FP16 / HPO
Ignored label support
Stop training on NaN losses

Changed

Major refactoring
- Tasks & model templates had been moved to OTE repo from each OTE Algorithm backend

[v0.1.1]

Fixed

Some minor issues

[v0.1.0]

Added

OTE SDK, defines an interface which can be used by OTE CLI to access OTE Algorithms.
OTE CLI, contains set of commands needed to operate with deep learning models using OTE SDK Task interfaces.
OTE Algorithms, contains sub-projects implementing OTE SDK Task interfaces for different deep learning models.
- Anomaly Classification
- Image Classification
- Object Detection
- Semantic Segmentation

Files

CHANGELOG.md

Latest commit

History

CHANGELOG.md

File metadata and controls

Changelog

[Unreleased]

New features

Enhancements

Bug fixes

[2.2.2]

Bug fixes

[2.2.1]

Bug fixes

Enhancements

[2.2.0]

New features

Enhancements

Bug fixes

[v2.1.0]

New features

Enhancements

Bug fixes

Known issues

[v2.0.0]

New features

Enhancements

Bug fixes

Known issues

[v1.6.1]

Enhancements

Bug fixes

[1.6.0]

New features

Enhancements

Bug fixes

[1.5.2]

Enhancements

[1.5.1]

Enhancements

Bug fixes

[v1.5.0]

New features

Enhancements

Bug fixes

Known issues

[v1.4.4]

Enhancements

Bug fixes

[v1.4.3]

Enhancements

[v1.4.2]

Enhancements

Bug fixes

[v1.4.1]

Enhancements

Bug fixes

[v1.4.0]

New features

Enhancements

Bug fixes

Known issues

[v1.3.1]

Enhancements

Bug fixes

Known issues

[v1.3.0]

New features

Enhancements

Bug fixes

Known issues

[v1.2.3]

Bug fixes

[v1.2.2]

Enhancements

Known issues

[v1.2.1]

Enhancements

Bug fixes