hailo_model_zoo/docs/CHANGELOG.rst at master · hailo-ai/hailo_model_zoo

v5.3.0

Updated to use with Dataflow Compiler v5.3.0
Updated to use with HailoRT v5.3.0
Bug fixes
New Models:
- YOLOv11-obb - Added support for oriented bounding box detection models - YOLOv11-obb family
- YOLO26 - New family of object detection and instance segmentation models. The new architecture is NMS-free.
- PaddleOCR-v5 - Added support for paddle_ocr_v5_mobile_detection and paddle_ocr_v5_mobile_recognition for Hailo-15L hardware architecture, and improved Hailo-10H/Hailo-15H FPS
- YOLOv5-seg-hpp - Added support for instance segmentation with HailoRT postprocessing - yolov5n_seg_hpp, yolov5s_seg_hpp, yolov5m_seg_hpp

v5.2.0

Hailo-15L is now supported by the Model Zoo
Update to use Dataflow Compiler v5.2.0
Update to use HailoRT v5.2.0
Bug fixes
New Models:
- SigLIP - Released siglip_l_16_256_image_encoder, siglip_l_16_256_text_encoder, siglip2_l_16_256_image_encoder, siglip2_l_16_256_text_encoder, siglip_b_16_image_encoder, siglip_b_16_text_encoder
- YOLOv12 - Released yolov12n
- PaddleOCR-v5 - Text detection and recognition models - released paddle_ocr_v5_mobile_detection, paddle_ocr_v5_mobile_recognition
- StereoNet - stereo depth estimation model - now also available for Hailo-10H/15H

v5.1.0

Update to use Dataflow Compiler v5.1.0 (developer-zone)
Update to use HailoRT v5.1.0 (developer-zone)
New Models:
- YOLOv7x - yolov7x
- DepthAnything - depthanything_vits, depthanything_v2_vits - Zero-shot depth estimation models
- CLIP - Performance improvements for clip image encoders
- TinyCLIP - TinyCLIP family - Contrastive Language-Image Pre-training models (image & text encoders)
- YOLO-World - yolo_world_v2s - Zero-shot object detection model
Bug fixes

v5.0.0

Update to use Dataflow Compiler v5.0.0 (developer-zone)
Update to use HailoRT v5.0.0 (developer-zone)
New Models:
- SigLip - SigLip2-base-32 - Contrastive Language-Image Pre-training model
Bug fixes

v2.15

Update to use Dataflow Compiler v3.31.0 (developer-zone)
Update to use HailoRT 4.21.0 (developer-zone)
New Models:
- CLIP - ViT-Base-16, ViT-Base-32, ViT-Large-14 (336x336 resolution) - Contrastive Language-Image Pre-training model [Hailo-15H and Hailo-10H only]
- Real-ESRGAN - x4 - Super Resolution model [Hailo-15H and Hailo-10H only]
- PoolFormer - s12 - Vision Transformer classification model [Hailo-15M/H and Hailo-10H only]
Bug fixes

v2.14

Update to use Dataflow Compiler v3.30.0 (developer-zone)
Update to use HailoRT 4.20.0 (developer-zone)
New cascade API (experimental)
- Currently supports PETRv2, bird-eye-view network for 3D object detection, see petrv2_repvggB0.yaml for configurations.
- The user needs existing hars/hefs: both petrv2_repvggB0_backbone_pp_800x320 & petrv2_repvggB0_transformer_pp_800x320
- full_precision evaluation: hailomz cascade eval petrv2
- hardware evaluation: hailomz cascade eval petrv2 --override target=hardware
New task:
- Human Action Recognition
  - Added support for (partial) Kinetics-400 dataset
  - Added r3d_18 to support this task
New Models:
- YOLOv11 - nano, small, medium, large, x-large - Latest YOLO detectors
- CLIP - ViT-Large-14-Laion2B - Contrastive Language-Image Pre-training model [H15H and H10H only]
- SWIN - tiny, small - Shifted-Windows Transformer based classification model
- DaViT - tiny - Dual Attention Vision Transformer classification model [H15H and H10H only]
- LeViT - levit128, levit192, levit384 - Transformer based classification model
- EfficientFormer - l1 - Transformer based classification model
- Real-ESRGAN - x2 - Super Resolution model
- R3D_18 - r3d_18 - Video Classification network for Human Action Recognition [H8 only]
Bug fixes

v2.13

Update to use Dataflow Compiler v3.29.0 (developer-zone)
Update to use HailoRT 4.19.0 (developer-zone)
Using jit_compile which reduces dramatically the emulation inference time of the Hailo Model Zoo models.
New tasks:
- BEV: Multi-View 3D Object Detection
  - Added support for NuScenes dataset
  - Added PETRv2 with the following configuration:
    1. Backbone: RepVGG-B0 (800x320 input resolution)
    2. Transformer: 3 decoder layers, detection queries=304, replaced LN with UN
New Models:
- CAS-ViT - S, M, T - Convolutional-Attention based classification model
- YOLOv10 - base, x-large - Latest YOLO detectors
- CLIP Text Encoders - ResNet50x4, ViT-Large
New retraining Docker containers for:
- PETR - Multi-View 3D Object Detection
Introduced new flags for hailomz CLI:
- --ap-per-class for measuring average-precision per-class. Relevant for object detection and instance segmentation tasks.
Bug fixes

v2.12

Update to use Dataflow Compiler v3.28.0 (developer-zone)
Update to use HailoRT 4.18.0 (developer-zone)
Target hardware now supports Hailo-10H device
New Models:
- Original ViT models - tiny, small, base - Transformer based classification models
- DeiT models - tiny, small, base - Transformer based classification models
- DETR (resnet50) - Transformer based object detection model
- fastvit_sa12 - Fast transformer based classification model
- levit256 - Transformer based classification model
- YOLOv10 - nano, small - Latest YOLO detectors
- RepGhostNet1.0x, RepGhostNet2.0x - Hardware-Efficient classification models
New postprocessing support on NN Core:
- yolov6 tag 0.2.1
Added support for person attribute visualization
Bug fixes

v2.11

Update to use Dataflow Compiler v3.27.0 (developer-zone)
Update to use HailoRT 4.18.0 (developer-zone)
New Models:
- FastSAM-s - Zero-shot Instance Segmentation
- Yolov9c - Latest Object Detection model of the YOLO family
Using HailoRT-pp for postprocessing of the following variants:
- nanodet
Postprocessing JSON configurations are now part of the cfg directory.
Introduced new flags for hailomz CLI:
- --start-node-names and --end-node-names for customizing parsing behavior.
- --classes for adjusting the number of classes in post-processing configuration.
The --performance flag, previously utilized for compiling models with their enhanced model script if available, now offers an additional functionality. In instances where a model lacks an optimized model script, this flag triggers the compiler's Performance Mode to achieve the best performance.

These flags simplify the process of compiling models generated from Hailo retrain dockers.
Bug fixes

v2.10

Update to use Dataflow Compiler v3.26.0 (developer-zone)
Update to use HailoRT 4.16.0 (developer-zone)
Using HailoRT-pp for postprocessing of the following variants:
- yolov8
Profiler change:
- Removal of the --mode flag from the hailomz profile command, which generates a report according to the provided HAR state.
CLI change:
- The hailo8 target is deprecated in favor of hardware.
Support for the KITTI Stereo Dataset
New Models:
- vit_pose_small - encoder based transformer with layernorm for pose estimation
- segformer_b0_bn - encoder based transformer with batchnorm for semantic segmentation
Bug fixes

v2.9

Update to use Dataflow Compiler v3.25.0 (developer-zone)
Update to use HailoRT 4.15.0 (developer-zone)
A new CLI-compatible API that allows users to incorporate format conversion and reshaping capabilities into the input:

hailomz compile yolov5s --resize 1080 1920 --input-conversion nv12_to_rgb

New transformer models added:
- vit_pose_small_bn - encoder based transformer with batchnorm for pose estimation
- clip_resnet_50x4 - Contrastive Language-Image Pre-Training for zero-shot classification
New retraining dockers for vit variants using unified normalization.
New Models:
- yolov8s_pose / yolov8m_pose - pose estimation
- scdepthv3 - depth-estimation
- dncnn3 / dncnn_color_blind - image denoising
- zero_dce_pp - low-light enhancement
- stereonet - stereo depth estimation
Using HailoRT-pp for postprocessing of the following models:
- efficientdet_lite0 / efficientdet_lite1 / efficientdet_lite2

v2.8

Update to use Dataflow Compiler v3.24.0 (developer-zone)
Update to use HailoRT 4.14.0 (developer-zone)
The Hailo Model Zoo now supports the following vision transformers models:
- vit_tiny / vit_small / vit_base - encoder based transformer with batchnorm for classification
- detr_resnet_v1_18_bn - encoder/decoder transformer for object detection
- clip_resnet_50 - Contrastive Language-Image Pre-Training for zero-shot classification
- yolov5s_c3tr - object detection model with a MHSA block
Using HailoRT-pp for postprocessing of the following variants:
- yolov5
- yolox
- ssd
- efficientdet
- yolov7
New Models:
- repvgg_a1 / repvgg_a2 - classification
- yolov8_seg: yolov8n_seg / yolov8s_seg / yolov8m_seg - instance segmentation
- yolov6n_0.2.1 - object detection
- zero_dce - low-light enhancement
New retraining dockers for:
- yolov8
- yolov8_seg
Enable compilation for the hailo15h device
Enable evaluation of models with RGBX/NV12 input format
Bug fixes

v2.7

Update to use Dataflow Compiler v3.23.0 (developer-zone)
Updated to use HailoRT 4.13.0 (developer-zone)
The inference flow was moved to new high-level APIs
New object detection variants:
- yolov8: yolov8n / yolov8s / yolov8m / yolov8l / yolov8x
- damoyolo: damoyolo_tinynasL20_T / damoyolo_tinynasL25_S / damoyolo_tinynasL35_M
New transformers based models:
- vit_base - classification model
- yolov5s_c3tr - object detection model with a self-attention block
Examples for using HailoRT-pp - support for seamless integration of models and their corresponding postprocessing
- yolov5m_hpp
Configuration YAMLs and model scripts for networks with YUY2 input format
DAMO-YOLO retraining docker
Bug fixes

v2.6.1

Bug fixes

v2.6

Update to use Dataflow Compiler v3.22.0 (developer-zone)
Updated to use HailoRT 4.12.0 (developer-zone)
ViT (Vision Transformer) - new classification network with transformers-encoder based architecture
New instance segmentation variants:
- yolov5n_seg
- yolov5s_seg
- yolov5m_seg
- yolov5l_seg
New object detection variants for high resolution images:
- yolov7e6
- yolov5n6_6.1
- yolov5s6_6.1
- yolov5m6_6.1
New flag --performance to reproduce highest performance for a subset of networks
The Hailo Model Zoo log is now written to sdk_virtualenv/etc/hailo/modelzoo/hailo_examples.log
Bug fixes

v2.5

Update to use Dataflow Compiler v3.20.1 (developer-zone)
Model scripts use new bgr to rgb conversion
New Yolact variants - with all COCO classes:
- yolact_regnetx_800mf
- yolact_regnetx_1.6gf
Bug fixes

v2.4

Updated to use Dataflow Compiler v3.20 (developer-zone)
The required FPS was moved from models' YAML into the model scripts
Model scripts use the new change activation syntax
New models:
- Face Detection - scrfd_500m / scrfd_2.5g / scrfd_10g
New tasks:
1. Super-Resolution
- Added support for BSD100 dataset
- The following models were added: espcn_x2 / espcn_x3 / espcn_x4
1. Face Recognition
- Support for LFW dataset
- The following models were added:
  1. arcface_r50
  2. arcface_mobilefacenet
- Retraining docker for arcface architecture
Added support for new hw-arch: hailo8l

v2.3

Updated to use Dataflow Compiler v3.19 (developer-zone)
New models:
- yolov6n
- yolov7 / yolov7-tiny
- nanodet_repvgg_a1_640
- efficientdet_lite0 / efficientdet_lite1 / efficientdet_lite2
New tasks:
- mspn_regnetx_800mf - single person pose estimation
- face_attr_resnet_v1_18 - face attribute recognition
Single person pose estimation training docker (mspn_regnetx_800mf)
Bug fixes

v2.2

Updated to use Dataflow Compiler v3.18 (developer-zone)
CLI change:
- The Hailo Model Zoo CLI is now working with an entry point: hailomz
- The quantize sub-command was changed to optimize
- The Hailo Model Zoo data directory by default will be ~/.hailomz
New models:
- yolov5xs_wo_spp_nms - a model which contains bbox decoding and confidence thresholding on Hailo-8
- osnet_x1_0 - person ReID network
- yolov5m_6.1 - yolov5m network from the latest tag of the repo (6.1) including silu activation
New tasks:
- person_attr_resnet_v1_18 - person attribute recognition
ReID training docker for the Hailo model repvgg_a0_person_reid_512/2048

NOTE: Ubuntu 18.04 will be deprecated in a future version of Hailo Model Zoo.

NOTE: Python 3.6 will be deprecated in a future version of Hailo Model Zoo.

v2.1

Updated to use Dataflow Compiler v3.17 (developer-zone)
Parser commands were moved into model scripts
Support for the Market-1501 Dataset
Support for a new Model Zoo task: ReID
New models:
- yolov5s_personface - person and face detector
- repvgg_a0_person_reid_512 / repvgg_a0_person_reid_2048 - ReID networks which outputs a person embedding
  
  These models were trained in-house as part of our upcoming new application
- stdc1 - Segmentation architecture for Cityscapes

v2.0

Updated to use Dataflow Compiler v3.16 (developer-zone) with TF version 2.5 which require CUDA11.2
Updated to use HailoRT 4.6 (developer-zone)
Retraining Dockers - each retraining docker has a corresponding README file near it. New retraining dockers:
- SSD
- YOLOX
- FCN
New models:
- yolov5l
Introducing Hailo Models, in-house pretrained networks with compatible Dockerfile for retraining
- yolov5m_vehicles (vehicle detection)
- tiny_yolov4_license_plates (license plate detection)
- lprnet (license plate recognition)
Added new documentation for the YAML structure

v1.5

Remove the HailoRT installation dependency.
Retraining Dockers
- YOLOv3
- NanoDet
- CenterPose
- Yolact
New models:
- unet_mobilenet_v2
Support for the Oxford-IIIT Pet Dataset
New multi-network example: detection_pose_estimation which combines the following networks:
- yolov5m_wo_spp_60p
- centerpose_repvgg_a0
Improvements:
- nanodet_repvgg mAP increased by 2%
New Tasks:
- hand_landmark_lite from MediaPipe
- palm_detection_lite from MediaPipe
Both tasks are without evaluation module.

v1.4

Update to use Dataflow Compiler v3.14.0 (developer-zone)
Update to use HailoRT 4.3.0 (developer-zone)
Introducing Hailo Models - in house pretrained networks with compatible Dockerfile for easy retraining:
- yolov5m_vehicles - vehicle detector based on yolov5m architecture
- tiny_yolov4_license_plates - license plate detector based on tiny_yolov4 architecture
New Task: face landmarks detection
- tddfa_mobilenet_v1
- Support 300W-LP and AFLW2k3d datasets
New features:
- Support compilation of several networks together - a.k.a multinets
- CLI for printing network information
Retraining Guide:
- New training guide for yolov4 with compatible Dockerfile
- Modifications for yolov5 retraining

v1.3

Update to use Dataflow Compiler v3.12.0 (developer-zone)
New task: indoor depth estimation
- fast_depth
- Support NYU Depth V2 Dataset
New models:
- resmlp12 - new architecture support paper
- yolox_l_leaky
Improvements:
- ssd_mobilenet_v1 - in-chip NMS optimization (de-fusing)
Model Optimization API Changes
- Model Optimization parameters can be updated using the networks' model script files (*.alls)
- Deprecated: quantization params in YAMLs
Training Guide: new training guide for yolov5 with compatible Dockerfile

v1.2

New features:
- YUV to RGB on core can be added through YAML configuration.
- Resize on core can be added through YAML configuration.
Support for the D2S Dataset
New task: instance segmentation
- yolact_mobilenet_v1 (coco)
- yolact_regnetx_800mf_20classes (coco)
- yolact_regnetx_600mf_31classes (d2s)
New models:
- nanodet_repvgg
- centernet_resnet_v1_50_postprocess
- yolov3 - darkent based
- yolox_s_wide_leaky
- deeplab_v3_mobilenet_v2_dilation
- centerpose_repvgg_a0
- yolov5s, yolov5m - original models from link
- yolov5m_yuv - contains resize and color conversion on HW
Improvements:
- tiny_yolov4
- yolov4
IBC and Equalization API change
Bug fixes

v1.1

Support for the VisDrone Dataset
New task: pose estimation
- centerpose_regnetx_200mf_fpn
- centerpose_regnetx_800mf
- centerpose_regnetx_1.6gf_fpn
New task: face detection
- lightfaceslim
- retinaface_mobilenet_v1
New models:
- hardnet39ds
- hardnet68
- yolox_tiny_leaky
- yolox_s_leaky
- deeplab_v3_mobilenet_v2
Use your own network manual for YOLOv3, YOLOv4_leaky and YOLOv5.

v1.0

Initial release
Support for object detection, semantic segmentation and classification networks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FilesExpand file tree

CHANGELOG.rst

Latest commit

History

CHANGELOG.rst

File metadata and controls