ultralytics: a unified framework for state-of-the‑art YOLO computer vision models

What it solves

Ultralytics 提供了一個統一的框架，用於最先進的電腦視覺任務。它簡化了物件偵測、實例分割、語意分割、影像分類與姿勢估計等模型的訓練、驗證與部署流程，讓這些複雜的 AI 任務變得易於使用且更易於上手。

How it works

此專案實作了 YOLO（You Only Look Once）系列模型，從早期的 YOLOv3 到最新的 YOLO26。使用者可以透過指令列介面（CLI）快速執行任務，或直接在 Python 專案中使用 ultralytics 套件整合框架。框架支援在自訂資料集上訓練、使用 mAP、mIoU 等指標評估效能，並可將模型匯出為 ONNX 等格式以供部署。

Who it’s for

此框架設計給開發者、AI 研究者與工程師使用，讓他們能在應用程式中整合高效能的電腦視覺功能，無論是針對邊緣裝置的輕量模型，或是大型基礎建設的高精度模型。

Highlights

Multi-task Support: Handles object detection, tracking, segmentation (instance and semantic), classification, and pose estimation.
Flexible Deployment: Supports export to ONNX and other formats for efficient deployment.
Crosspoint Accessibility: Offers both a CLI and a Python API for different developer workflows.
SOTA Performance: Provides a range of model sizes (nano, small, medium, large, x-large) to balance speed and accuracy.

SUMMARY

A high-performance computer vision framework providing state-of-the-art YOLO models for object detection, segmentation, classification, and pose estimation.

TITLE

ultralytics: a unified framework for state-of-the-art YOLO computer vision models

ultralytics：一個統一的最先進 YOLO 電腦視覺模型框架

ultralytics: a unified framework for state-of-the‑art YOLO computer vision models

What it solves

How it works

Who it’s for

Highlights

Sources