⭐ LuoHongkun的star列表,每6小时自动更新,参考链接->github star列表自动更新 ⭐
- JavaScript
- C++
- Python
- miscellaneous
- Jupyter Notebook
- TypeScript
- CSS
- CMake
- Swift
- Kotlin
- BibTeX Style
- Shell
- Astro
- HTML
- C
- TeX
- Rust
- Go
- Lua
- Vim Script
- Cuda
- Roff
- Dockerfile
- C#
- MATLAB
- Vue
- Makefile
- LLVM
- Matlab
- Cython
- SCSS
- Dart
- Markdown
- Clojure
- Java
-
NVIDIA/NemoClaw - Run OpenClaw securely inside NVIDIA OpenShell with managed inference
-
thedotmack/claude-mem - A Claude Code plugin that automatically captures everything Claude does during your coding sessions, compresses it with AI (using Claude's agent-sdk), and injects relevant context back into future sessions.
-
zxkmm/firefox_plugin_github_user_notes - This tool allows you to take notes on specific users and sync them across devices with Firefox installed.
-
remarkjs/remark-github - remark plugin to link references to commits, issues, pull-requests, and users, like on GitHub
-
Gar-b-age/CookLikeHOC - 🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.
-
MarSeventh/CloudFlare-ImgBed - Open-source file hosting solution based on CloudFlare (Image hosting/File storage/Cloud drive) / 基于 CloudFlare 的开源文件托管解决方案(图床/文件床/网盘)
-
xixu-me/xget - Ultra-high-performance, secure, all-in-one acceleration engine for developer resources
-
StrayMeteor3337/WechatRealFriends - 微信好友关系一键检测,基于微信ipad协议,看看有没有朋友偷偷删掉或者拉黑你
-
Z-Siqi/Clash-for-Windows_Chinese - clash for windows汉化版. 提供clash for windows的汉化版, 汉化补丁及汉化版安装程序
-
mrdoob/three.js - JavaScript 3D Library.
-
ConardLi/easy-dataset - A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval
-
kishimisu/Gaussian-Splatting-WebGL - 3D Gaussian Splatting Renderer for WebGL
-
overleaf/overleaf - A web-based collaborative LaTeX editor
-
antimatter15/splat - WebGL 3D Gaussian Splat Viewer
-
eliahuhorwitz/Academic-project-page-template - A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/
-
lutzroeder/netron - Visualizer for neural network, deep learning and machine learning models
-
hsp-iit/tour-guide-robot - A collection of modules and classes that can be used to perform guided tours with R1 robot or to simply interact with it. It's also the repo that contains the configuration files to perform autonomous navigation with R1
-
cggos/shenlan_vio_course - 深蓝学院《视觉SLAM进阶:从零开始手写VIO》第一期
-
why-freedom/VIOLearning_Note_Code - 从零手写VIO | 深蓝学院VIO课程 | 完整代码 | 作业
-
pvangoor/eqvio - EqVIO: An Equivariant Filter for Visual Inertial Odometry
-
openscenegraph/OpenSceneGraph - OpenSceneGraph git repository
-
xuankuzcr/Global-LVBA - Global LiDAR-Visual Bundle Adjustment
-
KwanWaiPang/Elevation_Map - VINS-Mono + Elevation Mapping
-
Liansheng-Wang/Super-LIO - 【RA-L 2026】 A Robust and Efficient LiDAR-Inertial Odometry System with a Compact Mapping Strategy.
-
OctoMap/octomap - An Efficient Probabilistic 3D Mapping Framework Based on Octrees. Contains the main OctoMap library, the viewer octovis, and dynamicEDT3D.
-
shanmo/OrcVIO - The monocular version of OrcVIO, which reconstructs object using both semantic keypoints and bounding boxes
-
ethz-asl/panoptic_mapping - A flexible submap-based framework towards spatio-temporally consistent volumetric mapping and scene understanding.
-
NVlabs/GR00T-WholeBodyControl - Welcome to GR00T Whole-Body Control (WBC)! This is a unified platform for developing and deploying advanced humanoid controllers. This includes: Decoupled WBC models used in NVIDIA Isaac-Gr00t, Gr00t N1.5 and N1.6 and GEAR-SONIC
-
GYH-WHU/SPP_SPV - 本项目是一个基于C++和MATLAB的GNSS单点定位与测速解算系统,支持GPS和BDS双系统联合定位。系统能够解码NovAtel OEM7格式数据,采用伪距观测值进行单点定位和多普勒观测值进行速度解算,并实现电离层、对流层、卫星钟差等误差改正。系统提供事后处理、实时处理和数据采集三种工作模式,支持ECEF、BLH、ENU坐标系转换,并可计算定位误差与精度指标。此外,系统集成了MATLAB可视化功能,自动生成误差分析图表,为GNSS定位算法的研究和应用提供完整的工具平台。
-
ZikangYuan/sr_livo - [RA-L 2024] A LiDAR-inertial-visual odometry and mapping system based on the sweep reconstruction method
-
koide3/glim - GLIM: versatile and extensible point cloud-based 3D localization and mapping framework
-
Robotic-Developer-Road/FAST-LIVO2 - FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry
-
MIT-SPARK/Hydra - A system for building 3D Scene Graphs from sensor data in real-time
-
zhanjiawang/plane_localization - This is a ROS package for indoor global localization (relocation) based on plane octree and plane features.
-
3dv-casia/LSLM_VLoc - [RAL 2024] Lightweight Structured Line Map Based Visual Localization
-
rabienrose/crowdsourcing_visual_positioning_system - System of recording data, mapping, visualization and positioning based on mobile phone sensors
-
EdgarFx/BoWG_VINS_Loop - Integrates Bag-of-Word-Groups (BoWG) loop closure detection into VINS-Fusion
-
EdgarFx/BoWG - The official source code for "Bag of Word Groups (BoWG): A Robust and Efficient Loop Closure Detection Method Under Perceptual Aliasing" (IROS 2025)
-
HorizonRobotics/HoloAgent - A unified, agentic system for general-purpose robots, enabling multi-modal perception, mapping and localization, and autonomous mobility and manipulation, with intelligent interaction with users.
-
linyicheng1/LET-NET2 - An end-to-end lightweight CNN designed for sparse corner extraction and tracking
-
WeijieMax/EyeReal - Offcial Code of EyeReal
-
LiangHongY/fusions_slam - fastlio+rtk+speed,ieskf
-
2toinf/X-VLA - [ICLR 2026] The offical Implementation of "Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model"
-
InternRobotics/OpenHomie - Open-sourced code for "HOMIE: Humanoid Loco-Manipulation with Isomorphic Exoskeleton Cockpit".
-
google-deepmind/mujoco - Multi-Joint dynamics with Contact. A general purpose physics simulator.
-
rpng/sqrtVINS - Robust and Ultrafast Square-Root Filter-based 3D Motion Tracking
-
gaoxiang12/lightning-lm - Lidar Localization and Mapping
-
robotics-upo/D-LIO - D-LIO: 6DoF Direct LiDAR-Inertial Odometry based on Simultaneous Truncated Distance Field Mapping
-
SteveMacenski/slam_toolbox - Slam Toolbox for lifelong mapping and localization in potentially massive maps with ROS
-
hku-mars/BALM - An efficient and consistent bundle adjustment for lidar mapping
-
HorizonRobotics/GeoFlowSlam - [IROS 2025] A Robust Tightly-Coupled RGBD-Inertial and Legged Odometry Fusion SLAM for Dynamic Legged Robotics
-
KumarRobotics/kr_3d_active_ms_slam - [RA-L 2024] 3D Active Metric-Semantic SLAM
-
Zhefan-Xu/LV-DOT - LV-DOT: LiDAR-Visual Dynamic Obstacle Detection and Tracking (C++/Python/ROS)
-
hku-mars/VoxelMap - [RA-L 2022] An efficient and probabilistic adaptive voxel mapping method for LiDAR odometry
-
ethz-mrl/OKVIS2-X - OKVIS2-X: Open Keyframe-based Visual-Inertial SLAM Configurable with Dense Depth or LiDAR, and GNSS
-
ZikangYuan/sr_lio - [IROS 2024] A LiDAR-inertial odometry (LIO) package that can adjust the execution frequency beyond the sweep frequency
-
openai/openai-icpc-2025 - OpenAI 2025 ICPC Submissions
-
PRBonn/rko_lio - A Robust Approach for LiDAR-Inertial Odometry Without Sensor-Specific Modelling
-
ShuoYangRobotics/Cerberus - Visual-Inertial-Leg Odometry For Legged Robots
-
hku-mars/LIV_handhold_2 - LIV-Eye: A Low-Cost LiDAR-Inertial-Visual Fusion 3D Sensor for Robotics and Embodied AI.
-
qiayuanl/legged_control - NMPC, WBC, state estimation, and sim2real framework for legged robots based on OCS2 and ros-controls
-
taichi-dev/taichi - Productive, portable, and performant GPU programming in Python.
-
niessner/Matterport - Matterport3D is a pretty awesome dataset for RGB-D machine learning tasks :)
-
ethz-asl/COIN-LIO - 🪙 COIN-LIO: Complementary Intensity-Augmented LiDAR Inertial Odometry (ICRA 2024)
-
princeton-vl/DPVO - Deep Patch Visual Odometry/SLAM
-
APRIL-ZJU/Gaussian-LIC - [ICRA 2025] Gaussian-LIC: Real-Time Photo-Realistic SLAM with Gaussian Splatting and LiDAR-Inertial-Camera Fusion
-
GTLIDAR/emobipednav - emotion-aware Social Navigation for Bipedal Robots with Deep Reinforcement Learning
-
zhouyong1234/my_ekf_package - 使用卡尔曼滤波实现多传感器数据融合
-
HViktorTsoi/PV-LIO - A probabilistic voxelmap-based LiDAR-Inertial Odometry.
-
lab-sun/SLAMesh - The official implementation of SLAMesh.
-
hyye/lio-mapping - Implementation of Tightly Coupled 3D Lidar Inertial Odometry and Mapping (LIO-mapping)
-
nubot-nudt/SG-SLAM - [IROS 25] Leveraging Semantic Graphs for Efficient and Robust LiDAR SLAM
-
xz00/fast-lio2-map-based-localization - map-based localization.Modified from fast-lio2.
-
QCL0920/AF-RLIO - [ICRA 2025] AF-RLIO: Adaptive Fusion of Radar-LiDAR-Inertial Information forRobust Odometry in Challenging Environments
-
hku-mars/STD - A 3D point cloud descriptor for place recognition
-
yuhaozhang7/NGD-SLAM - [IROS 2025] NGD-SLAM: Towards Real-Time Dynamic SLAM without GPU.
-
hku-mars/M2Mapping - [ICRA 2025] Neural Surface Reconstruction and Rendering for LiDAR-Visual Systems
-
hku-mars/ImMesh - ImMesh: An Immediate LiDAR Localization and Meshing Framework
-
openvinotoolkit/openvino - OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
-
sjtuyinjie/Ground-Fusion2 - Ground-Fusion++: a modular sensor-fusion SLAM system(IROS2025)
-
url-kaist/TRAVEL - Traversable ground and above-ground object segmentation using graph representation of 3D LiDAR scans
-
jiachengliu3/OpenWBC - VR-based Robot Teleoperation and Data Collection System for Humanoid Whole-Body VLA (Unitree G1)
-
zm0612/funny_lidar_slam - A real-time multifunctional Lidar SLAM package.
-
arclab-hku/ESVIO - (RAL2023+IROS2023) ESVIO: Event-based Stereo Visual Inertial Odometry
-
NVIDIA-ISAAC-ROS/isaac_ros_pose_estimation - Deep learned, NVIDIA-accelerated 3D object pose estimation
-
chengwei0427/ESKF_LIO - IESKF-LIO reference to fast_lio1.0(参考fast-lio早期版本,复现的fast-lio2)
-
Mechazo11/ros2_orb_slam3 - A ROS2 Humble package that natively implementing ORB-SLAM3 V1.0 VSLAM framework
-
GuoYongyu/Dynamic-Line-ORB-SLAM2 - A ORB-SLAM2 instance with dynamic object removing and point-line features optimization.
-
ZikangYuan/dynamic_lio - [IROS 2025] A LiDAR-inertial odometry for dynamic environments
-
xbpeng/DeepMimic - Motion imitation with deep reinforcement learning.
-
zhh2005757/FAST-LIO-Multi-Sensor-Fusion - Fusing GNSS and wheel measurements based on FAST-LIO and IKFOM
-
yjsx/CELLmap - [ICRA 2025]CELLmap: Enhancing LiDAR SLAM through Elastic and Lightweight Spherical Map Representation
-
microsoft/WSL - Windows Subsystem for Linux
-
Geekgineer/YOLOs-CPP - Cross-Platform Production-ready C++ inference engine for YOLO models (v5-v12, YOLO26). Unified API for detection, segmentation, pose estimation, OBB, and classification. Built on ONNX Runtime and OpenCV. Optimized for CPU/GPU with quantization support.
-
zydddd/CornerVINS - [T-RO'25] CornerVINS: Accurate Localization and Layout Mapping for Structural Environments Leveraging Hierarchical Geometric Representations
-
HuangCongQing/pcl-learning - 🔥PCL(Point Cloud Library)点云库学习记录
-
lavaman131/dinov2.cpp - DINOv2 inference engine written in C/C++ using ggml and OpenCV.
-
libing64/pose_ekf - Extented Kalman Filter for 6D pose estimation using gps, imu, magnetometer and sonar sensor.
-
HKUST-Aerial-Robotics/A-LOAM - Advanced implementation of LOAM
-
KTH-RPL/dufomap - [RA-L'24] DUFOMap: Efficient Dynamic Awareness Mapping
-
ZikangYuan/voxel_svio - [RA-L 2025 Accept without Revision] A stereo visual-inertial odometry system based on voxel map
-
ayushgaud/path_planning - Quadcopter path planning using RRT* and minimum jerk trajectory generation
-
BohemianRhapsodyz/PSINS-ROS - A Strapdown Inertial Navigation System (PSINS) C++ algorithm and Integrated Navigation (GNSS/INS/Odometry) algorithm based on Kalman Filter for ROS
-
TJU-Aerial-Robotics/YOPO - You Only Plan Once: A Learning Based Quadrotor Planner
-
gisbi-kim/lt-mapper - A Modular Framework for LiDAR-based Lifelong Mapping
-
hku-mars/FAST-Calib - A Handy Extrinsic Calibration Tool for LiDAR-camera Systems.
-
deepglint/FAST_LIO_LOCALIZATION_HUMANOID - Localization by LiDAR for Humanoid(like Unitree G1)
-
MIT-SPARK/Kimera-VIO-ROS - ROS wrapper for Kimera-VIO
-
LC-Robotics/FreeDOM - FreeDOM: Online Dynamic Object Removal Framework for Static Map Construction Based on Conservative Free Space Estimation [RA-L 25]
-
NVIDIA-ISAAC-ROS/isaac_ros_visual_slam - Visual SLAM/odometry package based on NVIDIA-accelerated cuVSLAM
-
nvidia-isaac/cuVSLAM - cuVSLAM: CUDA-Accelerated Visual Odometry and Mapping
-
RainerKuemmerle/g2o - g2o: A General Framework for Graph Optimization
-
FeiGeChuanShu/Mask2Former-ncnn - naive c++ version of Mask2Former with ncnn
-
DreamWaterFound/Prerequisites-of-On-line-Semantic-VSLAM - 在线语义视觉SLAM基础:C++语言程序中调用Python实现的图像分割网络、获取分割结果
-
DreamWaterFound/Codes - 自己的一些零散代码合集
-
sair-lab/GroundSLAM - GroundSLAM: A Robust Visual SLAM System for Warehouse Robots Using Ground Textures
-
weihaoysgs/vins-fast - VINS has been completely reconstructed and rewritten using C++ object-oriented, and supports stereo or stereo+ IMU.
-
weihaoysgs/ssvio - A lightweight setero visual SLAM system implementation, including complete closed-loop detection, front-end tracking, back-end optimization, visualization and other parts.
-
SlamMate/CDS-SLAM-Semantic-mapping-in-dynamic-environment - This project is the result of my undergraduate dissertation. The localization in dynamic environment is to deploy TensorRT optimized YOLOX in the front end of ORB-SLAM3 for object detection, and then eliminate all points belonging to the human bounding box. At the same time, the semantic information is sent to the mapping module to dye the 3D point cloud. The disadvantage of this project is that in the localization module, only the points belonging to people are processed, because people are dynamic most of the time. In the mapping module, we did not segment semantic objects accurately, resulting in wrong coloring of point clouds of other objects.
-
rpng/ov_plane - A monocular plane-aided visual-inertial odometry
-
KumarRobotics/SLIDE_SLAM - SlideSLAM: Sparse, Lightweight, Decentralized Metric-Semantic SLAM for Multi-Robot Navigation
-
lixiny/ORB-SLAM2-DualCam - 🎓 SJTU M.S. Dissertation. 基于多相机的同步定位与建图方法研究
-
chengwei0427/II-NVM - [RA-L'25 & IROS'25] II-NVM: Enhancing Map Accuracy and Consistency with Normal Vector-Assisted Mapping
-
superxslam/SuperOdom - A highly robust and accurate LiDAR-only, LiDAR-inertial odometry
-
Tang-KaiKai/EDLine - Line Segment Extraction Algorithm( less than 2ms in 1280*720 gray image )
-
lian-yue0515/D-LI-Init - Dynamic Initialization for LiDAR-inertial SLAM
-
haosulab/SAPIEN - SAPIEN Embodied AI Platform
-
HITSZ-NRSL/RCPCC - [ICRA 2025] Real-Time LiDAR Point Cloud Compression and Transmission for Resource-constrained Robots
-
christopherdoer/rio - RIO - EKF-based Radar Inertial Odometry using 4D mmWave radar sensors
-
HKUST-Aerial-Robotics/FALCON - [T-RO 2024] FALCON: Fast Autonomous Aerial Exploration using Coverage Path Guidance.
-
Happy-ZZX/PL-VIWO - Lightweight and Robust Point-Line Monocular Visual Inertial Wheel Odometry (IROS2025)
-
RoboSense-Robotics/robosense_ac_slam - A Fast and Tightly-coupled Sparse-Direct LiDAR-Inertial-Visual Odometry (LIVO).
-
Zhefan-Xu/Intent-MPC - [IEEE RA-L'25] Intent Prediction-Driven Model Predictive Control for UAV Planning and Navigation in Dynamic Environments (C++/ROS)
-
KumarRobotics/AllocNet - A lightweight learning-based trajectory optimization framework.
-
Livox-SDK/livox_mapping - A mapping package for Livox LiDARs
-
InternRobotics/HorizonGS - [CVPR 2025] Horizon-GS: Unified 3D Gaussian Splatting for Large-Scale Aerial-to-Ground Scenes
-
ShijieZhou-UCLA/feature-3dgs - [CVPR 2024 Highlight] Feature 3DGS: Supercharging 3D Gaussian Splatting to Enable Distilled Feature Fields
-
yanliang-wang/FAST_LIO_LC - The tight integration of FAST-LIO with Radius-Search-based loop closure module.
-
gabime/spdlog - Fast C++ logging library.
-
hku-mars/GS-SDF - [IROS 2025] LiDAR-Augmented Gaussian Splatting and Neural SDF for Geometrically Consistent Rendering and Reconstruction
-
JokerJohn/Cloud_Map_Evaluation - [RAL' 25 & IROS‘ 25] MapEval: Towards Unified, Robust and Efficient SLAM Map Evaluation Framework.
-
RobustFieldAutonomyLab/LeGO-LOAM - LeGO-LOAM: Lightweight and Ground-Optimized Lidar Odometry and Mapping on Variable Terrain
-
SlamCabbage/NDTMC - [IROS 2024] A 3D Global Descriptor For Loop Closure Detection. NDT-Map-Code.
-
gogojjh/M-LOAM - Robust Odometry and Mapping for Multi-LiDAR Systems with Online Extrinsic Calibration
-
JokerJohn/PALoc - [TMECH'2024] PALoc: Advancing SLAM Benchmarking with Prior-Assisted 6-DoF Trajectory Generation and Uncertainty Estimation
-
sikang/mpl_ros - A ROS wrapper for trajectory planning based on motion primitives
-
Zhefan-Xu/NavRL - [IEEE RA-L'25] NavRL: Learning Safe Flight in Dynamic Environments (NVIDIA Isaac/Python/ROS1/ROS2)
-
Zhefan-Xu/CERLAB-UAV-Autonomy - [CMU] A Versatile and Modular Framework Designed for Autonomous Unmanned Aerial Vehicles [UAVs] (C++/ROS/PX4)
-
JD-hust/gs-dso - a monocular direct sparse odometry with prior continuous 3D gaussian maps for indoor environments
-
YWL0720/YOLO_ORB_SLAM3_with_pointcloud_map - This code is an extended version of YOLO_ORB_SLAM3, which adds the functionality of creating dense point cloud maps.
-
engcang/FAST-LIO-SAM - a SLAM implementation combining FAST-LIO2 with pose graph optimization and loop closing based on LIO-SAM paper
-
lausen001/LIO-SAM-DetailedNote - LIO-SAM源码详细注释,3D SLAM融合激光、IMU、GPS
-
deepseek-ai/FlashMLA - FlashMLA: Efficient Multi-head Latent Attention Kernels
-
78/xiaozhi-esp32 - An MCP-based chatbot | 一个基于MCP的聊天机器人
-
brucezhcw/VINS-Explorer - A Super Tightly Coupled Visual-Inertial State Estimator
-
fishmarch/ORB_SLAM3_Fixed - Fixed some bugs of original ORB_SLAM3
-
mp3guy/ElasticFusion - Real-time dense visual SLAM system
-
ethz-asl/maplab - A Modular and Multi-Modal Mapping Framework
-
MIT-SPARK/Spatial-Hash - Minimal C++ library for spatial data structures based on voxel-block-hashing
-
MIT-SPARK/Khronos - Spatio-Temporal Metric-Semantic SLAM
-
shichaoy/cube_slam - CubeSLAM: Monocular 3D Object Detection and SLAM
-
ethz-mrl/GSFusion - GSFusion: Online RGB-D Mapping Where Gaussian Splatting Meets TSDF Fusion
-
shg8/3DGS.cpp - A cross-platform, high performance renderer for Gaussian Splatting using Vulkan Compute. Supports Windows, Linux, macOS, iOS, and visionOS
-
hyperlogic/splatapult - A 3d gaussian splatting renderer in C++ and OpenGL
-
MIT-SPARK/Kimera-VIO - Visual Inertial Odometry with SLAM capabilities and 3D Mesh generation.
-
colmap/colmap - COLMAP - Structure-from-Motion and Multi-View Stereo
-
HKUST-Aerial-Robotics/G3Reg - A fast and robust global registration library for outdoor LiDAR point clouds.
-
hku-mars/FAST-LIVO - A Fast and Tightly-coupled Sparse-Direct LiDAR-Inertial-Visual Odometry (LIVO).
-
jedeschaud/ct_icp - CT-ICP: Continuous-Time LiDAR Odometry
-
hku-mars/r3live - A Robust, Real-time, RGB-colored, LiDAR-Inertial-Visual tightly-coupled state Estimation and mapping package
-
gisbi-kim/SC-LIO-SAM - LiDAR-inertial SLAM: Scan Context + LIO-SAM
-
lovelyyoshino/FAST_LIO2_Noted - FAST_LIO2_Noted 中文注释版
-
luohongk/SuperVINS - 📖[IEEE Sensors Journal (JSEN) ] SuperVINS: A Real-Time Visual-Inertial SLAM Framework for Challenging Imaging Conditions (integrated deep learning features)
-
linyicheng1/ceres-example - some ceres examples with notes
-
XRIM-Lab/GS-CPR - [ICLR 2025] Official repo of "GS-CPR: Efficient Camera Pose Refinement via 3D Gaussian Splatting"
-
PJLab-ADG/SensorsCalibration - OpenCalib: A Multi-sensor Calibration Toolbox for Autonomous Driving
-
IF-A-CAT/LIR-LIVO - LIR-LIVO: A Lightweight,Robust Lidar/Vision/Inertial Odometry with Illumination-Resilient Deep Features
-
PJLab-ADG/Livox-Mapping - An all-in-one and ready-to-use LiDAR-inertial odometry system for Livox LiDAR
-
ROBOT-WSC/BEV-LSLAM - 2025 RAL
-
YungeCui/BoW3D - [RA-L] BoW3D: Bag of Words for Real-Time Loop Closing in 3D LiDAR SLAM.
-
sdwyc/ROLO - This is a ROS package for lidar odometry implementation using rotation optimization method.
-
liquorleaf/OmniGS - [WACV 2025] OmniGS: Fast Radiance Field Reconstruction using Omnidirectional Gaussian Splatting
-
johannes-graeter/limo - Lidar-Monocular Visual Odometry
-
HKUST-Aerial-Robotics/ESVO - This repository maintains the implementation of "Event-based Stereo Visual Odometry".
-
strasdat/Sophus - C++ implementation of Lie Groups using Eigen.
-
YWL0720/YOLO_ORB_SLAM3 - This is an improved version of ORB-SLAM3 that adds an object detection module implemented with YOLOv5 to achieve SLAM in dynamic environments.
-
ACFR-RPG/DynOSAM - Offical code release for DynoSAM: Dynamic Object Smoothing And Mapping. Accepted Transactions on Robotics (Visual SLAM SI). A visual SLAM framework and pipeline for Dynamic environements, estimating for the motion/pose of objects and their structure, as well as the camera odometry and static map.
-
suchetanrs/ORB-SLAM3-ROS2-Docker - This repository contains everything needed to run ORB-SLAM3 on a docker container with ROS2 Humble with Ubuntu 22.04.
-
introlab/rtabmap - RTAB-Map library and standalone application
-
hku-mars/FAST_LIO - A computationally efficient and robust LiDAR-inertial odometry (LIO) package
-
YibinWu/LIO-EKF - [ICRA2024] Maybe the simplest LiDAR-inertial odometry that one can have.
-
lpercc/HA3D_simulator - Official implementation of Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions (NeurIPS DB Track'24 Spotlight).
-
google/minja - A minimalistic C++ Jinja templating engine for LLM chat templates
-
onnx/onnx-tensorrt - ONNX-TensorRT: TensorRT backend for ONNX
-
tqdm/tqdm.cpp - C++ port of tqdm
-
ethz-asl/lidar_align - A simple method for finding the extrinsic calibration between a 3D lidar and a 6-dof pose sensor
-
alexhua/Aria2-Manager - A useful tool to run Aria2 in the background easily
-
MrNeRF/Light_Glue_CPP - CPP Implementation of "LightGlue: Local Feature Matching at Light Speed"
-
HKUST-Aerial-Robotics/RIO - Optimization Based and Point Uncertainty Aware Radar-inertial Odometry for 4D Radar System
-
OctoMap/octomap_msgs - ROS package to provide messages and serializations / conversion for the OctoMap library
-
pierotofy/OpenSplat - Production-grade 3D gaussian splatting with CPU/GPU support for Windows, Mac and Linux 🚀
-
SJTU-ViSYS/Ground-Fusion - Ground-Fusion: A Low-cost Ground SLAM System Robust to Corner Cases (ICRA2024)
-
tum-vision/dvo_slam - Dense Visual Odometry and SLAM
-
Zhefan-Xu/time_optimizer - [IEEE ICRA'24] Optimal Trajectory Time Allocation Library for Autonomous Robots (C++/ROS)
-
UnknownFreeOccupied/ufomap - UFOMap: An Efficient Probabilistic 3D Mapping Framework That Embraces the Unknown
-
unitreerobotics/point_lio_unilidar - Point-LIO algorithm for Unitree LiDAR products.
-
hku-mars/IKFoM - A computationally efficient and convenient toolkit of iterated Kalman filter.
-
facebookresearch/Replica-Dataset - The Replica Dataset v1 as published in https://arxiv.org/abs/1906.05797 .
-
stereolabs/zed-sdk - ⚡️The spatial perception framework for rapidly building smart robots and spaces
-
stereolabs/zed-ros-wrapper - ROS wrapper for the ZED SDK
-
hku-mars/ikd-Tree - This repository provides implementation of an incremental k-d tree for robotic applications.
-
triple-mu/YOLOv8-TensorRT - YOLOv8 using TensorRT accelerate !
-
Glencsa/YOLOv8-ORB-SLAM3 - YOLOv8-ORB-SLAM3: Semantic SLAM with dynamic feature point removal
-
YWL0720/I2EKF-LO - [IROS 2024] I2EKF-LO: A Dual-Iteration Extended Kalman Filter based LiDAR Odometry
-
arclab-hku/Event_based_VO-VIO-SLAM - HKU-Dataset for Event-based VO/VIO/SLAM
-
LeonardoDiCaprio1/Map_ORBSLAM_ROS - You can densely map datasets through RVIZ and create your own TUM dataset to create maps
-
qdLMF/VINS-Fusion-GPU-BA - A CUDA reimplementation of Bundle Adjustment for VINS-Fusion
-
halismai/photobundle - Photometric Bundle Adjustment for Vision-Based SLAM
-
NKU-MobFly-Robotics/LRAE - LRAE: Large-Region-Aware Safe and Fast Autonomous Exploration of Ground Robots for Uneven Terrains, RA-L, 2024
-
TUMFTM/ORB_SLAM3_RGBL - RGB-L: An Extension to Integrate LiDAR Data into ORB-SLAM3
-
HKUST-Aerial-Robotics/FM-Fusion - [RA-L] FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation Models
-
GDUT-Kyle/faster_lio_sam - FASTER-LIO-SAM: A SLAM system based on iVox and GTSAM.
-
Taeyoung96/GRIL-Calib - [RA-L 2024] GRIL-Calib: Targetless Ground Robot IMU-LiDAR Extrinsic Calibration Method using Ground Plane Motion Constraints
-
Yaepiii/C-LOAM - A Compact LiDAR Odometry and Mapping with Dynamic Removal [ICUS 2024]
-
Yaepiii/TRLO - [T-IM 2025] TRLO: An Efficient LiDAR Odometry with 3D Dynamic Object Tracking and Removal
-
MrNeRF/LichtFeld-Studio - Train, inspect, edit, automate, and export 3D Gaussian Splatting scenes from a single native application.
-
xieyuser/GS-LIVM - [ICCV'25] GS-LIVM: Real-Time Photo-Realistic LiDAR-Inertial-Visual Mapping with Gaussian Splatting
-
Unsigned-Long/slam-tricks - small, powerful and beautiful slam tricks with theory and practice
-
farhad-dalirani/StereoVision-SLAM - StereoVision-SLAM is a real-time visual stereo SLAM (Simultaneous Localization and Mapping)
-
gaoxiang12/faster-lio - Faster-LIO: Lightweight Tightly Coupled Lidar-inertial Odometry using Parallel Sparse Incremental Voxels
-
Geekgineer/CloudPeek - CloudPeek is a lightweight, cross-platform, single-header C++ point cloud viewer. It’s designed for simplicity and efficiency, requiring no heavy libraries like PCL or Open3D. Ideal for visualizing and interacting with 3D data from LiDAR, photogrammetry, or other datasets, CloudPeek delivers powerful, real-time exploration in a minimalistic package
-
rubengooj/pl-slam - This code contains an algorithm to compute stereo visual SLAM by using both point and line segment features.
-
DapengFeng/cartgs - [RA-L] CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAM
-
alsora/ros2-ORB_SLAM2 - ROS2 node wrapping the ORB_SLAM2 library
-
GREAT-WHU/RoadLib - A lightweight library for instance-level visual road marking extraction, parameterization, mapping, etc.
-
TohsakaZ/ppp_ex - Prise Point Positioning Experiment
-
ChaoqinRobotics/LINS---LiDAR-inertial-SLAM - A Lidar-Inertial State Estimator for Robust and Efficient Navigation based on iterated error-state Kalman filter
-
udaysankar01/xfeatSLAM - Real-time SLAM with deep features (XFeat + ORB-SLAM3)
-
yanyan-li/Structure-SLAM-PointLine - This is a basic point-line SLAM system based on ORBSLAM2.
-
APRIL-ZJU/lidar_IMU_calib - [IROS 2020] Targetless Calibration of LiDAR-IMU System Based on Continuous-time Batch Estimation
-
ashishkumar822/Jetson-SLAM - A high Speed GPU accelerated SLAM for Low Powered Devices, IEEE- RAL-2023, ICRA 2024
-
alejandrofontan/AnyFeature-VSLAM - Any-Feature V-SLAM is an automated visual SLAM library for Monocular cameras capable of switching to a chosen type of feature effortlessly and without manual intervention.
-
fishmarch/MS-SLAM - [JFR 2024] This is the official implementation of MS-SLAM, a memory-efficient visual SLAM system removing redundant map points to save memory consumption.
-
2013fangwentao/Multi_Sensor_Fusion - Multi-Sensor Fusion (GNSS, IMU, Camera) 多源多传感器融合定位 GPS/INS组合导航 PPP/INS紧组合
-
ethz-asl/kalibr - The Kalibr visual-inertial calibration toolbox
-
LuoXubo/JointLoc - [IROS 2024] JointLoc: A Real-time Visual Localization Framework for Planetary UAVs Based on Joint Relative and Absolute Pose Estimation
-
ethz-asl/okvis - OKVIS: Open Keyframe-based Visual-Inertial SLAM.
-
YangSiri/OR-LIM - OR-LIM: Observability-aware robust LiDAR-Inertial-Mapping under High Dynamic Sensor Motion
-
jimazeyu/GraspSplats - GraspSplats: Efficient Manipulation with 3D Feature Splatting
-
ethz-mrl/okvis2 - Open Keyframe-based Visual-Inertial SLAM (Version 2)
-
HViktorTsoi/FAST_LIO_LOCALIZATION - A simple localization framework that can re-localize in built maps based on FAST-LIO.
-
GREAT-WHU/GREAT-PVT - GREAT-PVT: Precision Positioning and Navigation Software by Wuhan University GREAT Group
-
mengkai98/BA_Play - 随手写个BA玩玩
-
Yixin-F/LiLoc - (ICRA 2025) LiLoc: Lifelong Localization using Adaptive Submap Joining and Egocentric Factor Graph
-
Yixin-F/better_fastlio2 - Postgraduate Thesis: fast_lio_sam + dynamic removal (T-GRS 2024) + multi-session mapping (ICRA 2022 Kim) + object-level update + online relocalization (ICRA 2025) + real-world application (MD-LVIO)
-
udaysankar01/xfeat_cpp - The C++ Implementation of XFeat (Accelerated Features).
-
chengwei0427/ct-lio - CT-LIO: Continuous-Time LiDAR-Inertial Odometry
-
felixendres/rgbdslam_v2 - RGB-D SLAM for ROS
-
Eliaul/Eq-LIO - A tightly coupled LIO framework based on the equivariant filter.
-
HKUST-Aerial-Robotics/GVINS - Tightly coupled GNSS-Visual-Inertial system for locally smooth and globally consistent state estimation in complex environment.
-
HuajianUP/Photo-SLAM - [CVPR 2024] Photo-SLAM: Real-time Simultaneous Localization and Photorealistic Mapping for Monocular, Stereo, and RGB-D Cameras
-
rmsalinas/DBow3 - Improved version of DBow2
-
MigVega/SLAM2REF - This project allows the alignment and correction of LiDAR-based SLAM session data with a reference map or another session, also the retrieval of 6-DoF poses with accuracy of up to 3 cm given an accurate TLS point cloud as a reference map (this map should be accurate at least regarding the position of permanent elements such as walls and columns).
-
lava/matplotlib-cpp - Extremely simple yet powerful header-only C++ plotting library built on the popular matplotlib
-
kuankuan-yue/VINS-FUSION-leanrning - VINS-FUSION中文注释版.目前网络上对于VINS-mono的代码已经有很多讲解和注释了,但是对于VINS-FUSION(以下简称VF)的注释还是很少的,刚好本人最近也正在学习VIO的相关知识,所以对VF按照程序执行顺序进行了十分详细的注释,同时为了和大家进行交流学习,所以把相关注释代码进行开源。因水平有限,错误肯定不少,还请各位大佬们指正。
-
gtrll/gpslam - Sparse Gaussian Processes for SLAM
-
UZ-SLAMLab/ORB_SLAM3 - ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial and Multi-Map SLAM
-
i3tyc/AdaptSLAM - AdaptSLAM: Edge-Assisted Adaptive SLAM with Resource Constraints via Uncertainty Minimization
-
Zhefan-Xu/onboard_detector - [IEEE RA-L'24] Dynamic Obstacle Detection and Tracking (DODT) algorithm for Autonomous Robots (C++/ROS)
-
nkliuhui/sync_gps_lidar_imu_cam - lidar-imu-cam-GPS时间戳硬件同步方案
-
tum-vision/lsd_slam - LSD-SLAM
-
HeYijia/VINS-Course - VINS-Mono code without Ceres or ROS
-
SainingZhang/UC-GS - [BMVC 2024] Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty
-
AndreasArendt/OpenRTK - Open Source precise GNSS Software
-
PetWorm/LARVIO - A lightweight, accurate and robust monocular visual inertial odometry based on Multi-State Constraint Kalman Filter.
-
ManiiXu/VINS-Mono-Learning - VINS-Mono代码注释,仅供学习
-
sair-lab/AirSLAM - [TRO 2025] AirVO upgrades to AirSLAM
-
city-super/Octree-GS - [TPAMI 2025] Octree-GS: Towards Consistent Real-time Rendering with LOD-Structured 3D Gaussians
-
CG050523/PPP-Navigation - 伪距单点定位程序实现,仅学习使用
-
VIS4ROB-lab/ccm_slam - CCM-SLAM: Robust and Efficient Centralized Collaborative Monocular SLAM for Robotic Teams
-
microsoft/Azure-Kinect-Sensor-SDK - A cross platform (Linux and Windows) user mode SDK to read data from your Azure Kinect device.
-
hku-mars/FAST-LIVO2 - FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry
-
yanyan-li/PlanarSLAM - A RGB-D SLAM system for structural scenes, which makes use of point-line-plane features and the Manhattan World assumption.
-
TixiaoShan/LIO-SAM - LIO-SAM: Tightly-coupled Lidar Inertial Odometry via Smoothing and Mapping
-
TixiaoShan/LVI-SAM - LVI-SAM: Tightly-coupled Lidar-Visual-Inertial Odometry via Smoothing and Mapping
-
danping/CoSLAM - CoSLAM is a visual SLAM software that aims to use multiple freely moving cameras to simultaneously compute their egomotion and the 3D map of the surrounding scenes in a highly dynamic environment.
-
google/or-tools - Google's Operations Research tools:
-
JakobEngel/dso - Direct Sparse Odometry
-
AnswerDotAI/gpu.cpp - A lightweight library for portable low-level GPU computation using WebGPU.
-
ethz-asl/wavemap - Fast, efficient and accurate multi-resolution, multi-sensor 3D occupancy mapping
-
RonaldSun/VI-Stereo-DSO - Direct sparse odometry combined with stereo cameras and IMU
-
MIT-SPARK/Kimera-RPGO - Robust Pose Graph Optimization
-
lian-yue0515/MM-LINS - a Multi-Map LiDAR-Inertial System for Over-Degraded Environments
-
engcang/vins-application - VINS-Fusion, VINS-Fisheye, OpenVINS, EnVIO, ROVIO, S-MSCKF, ORB-SLAM2, NVIDIA Elbrus application of different sets of cameras and imu on different board including desktop and Jetson boards
-
floatlazer/semantic_slam - Real time semantic slam in ROS with a hand held RGB-D camera
-
koide3/gtsam_points - A collection of GTSAM factors and optimizers for point cloud SLAM
-
HKUST-SAIL/RaDe-GS - RaDe-GS: Rasterizing Depth in Gaussian Splatting
-
Unsigned-Long/iKalibr - [IEEE T-RO 2025] iKalibr: Multi-Sensor Calibration (Extrinsics & Time Offsets)
-
emiliofidalgo/ibow-lcd - Appearance-based Loop Closure Detection using Incremental Bags of Binary Words
-
bxh1/VIDO-SLAM - VIDO-SLAM is a Visual Inertial SLAM system for dynamic environments, and it can also estimate dynamic objects motion and track objects.
-
url-kaist/dynaVINS - DynaVINS : A Visual-Inertial SLAM for Dynamic Environments
-
MAVIS-SLAM/OpenMAVIS - An open-source implementation of MAVIS-SLAM.
-
linyicheng1/OpenSLAM-Notes - 个人对目前较为成熟的视觉/激光SLAM算法进行的注释以及解读文件
-
guisoares9/VINS-Fusion - OpenCV 4, ROS Noetic, and Ceres adaptation of VINS-Fusion. An optimization-based multi-sensor state estimator
-
cyp4x141/VINS-Fusion-noetic-Opencv4 - VINS-Fusion for opencv4 + noetic +ubuntu20.04
-
shanpenghui/ORB_SLAM3_Fixed - Optimized ORBSLAM3 to run on TUM/EuRoc/KITTI dataset
-
karanchawla/GPS_IMU_Kalman_Filter - Fusing GPS, IMU and Encoder sensors for accurate state estimation.
-
yuefanhao/SuperPoint-SuperGlue-TensorRT - SuperPoint and SuperGlue with TensorRT. Deploy with C++.
-
kajo-kurisu/D_VINS - Merge superpoint、lightglue、MixVPR into VINS-FUSION for loop closure with TensorRT
-
HeYijia/PL-VIO - monocular visual inertial system with point and line features
-
openxrlab/xrslam - OpenXRLab Visual-inertial SLAM Toolbox and Benchmark
-
KumarRobotics/msckf_vio - Robust Stereo Visual Inertial Odometry for Fast Autonomous Flight
-
i2Nav-WHU/IC-GVINS - A Robust, Real-time, INS-Centric GNSS-Visual-Inertial Navigation System
-
ydsf16/imu_gps_localization - Using error-state Kalman filter to fuse the IMU and GPS data for localization.
-
cnqiangfu/PL-VINS - PL-VINS: Real-Time Monocular Visual-Inertial SLAM with Point and Line Features
-
microsoft/onnxruntime - ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
-
HKUST-Aerial-Robotics/VINS-Mono - A Robust and Versatile Monocular Visual-Inertial State Estimator
-
ethz-asl/ethzasl_msf - MSF - Modular framework for multi sensor fusion based on an Extended Kalman Filter (EKF)
-
zm0612/eskf-gps-imu-fusion - 误差状态卡尔曼ESKF滤波器融合GPS和IMU,实现更高精度的定位
-
Ewenwan/MVision - 机器人视觉 移动机器人 VS-SLAM ORB-SLAM2 深度学习目标检测 yolov3 行为检测 opencv PCL 机器学习 无人驾驶
-
raulmur/ORB_SLAM2 - Real-Time SLAM for Monocular, Stereo and RGB-D Cameras, with Loop Detection and Relocalization Capabilities
-
ceres-solver/ceres-solver - A large scale non-linear optimization library
-
shazraz/Extended-Kalman-Filter - Implementation of an EKF in C++
-
rpng/open_vins - An open source platform for visual-inertial navigation research.
-
Ewenwan/ORB_SLAM2_SSD_Semantic - 动态语义SLAM 目标检测+VSLAM+光流/多视角几何动态物体检测+octomap地图+目标数据库
-
NVIDIA/soma-retargeter - SOMA BVH to humanoid robot motion retargeting library built with Newton and NVIDIA Warp
-
TeleHuman/HumanoidSoccer - Learning Soccer Skills for Humanoid Robots: A Progressive Perception-Action Framework
-
robodhruv/visualnav-transformer - Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.
-
csiro-robotics/WildCross - [IEEE ICRA 2026] The official repository for paper WildCross: A Cross-Modal Large Scale Benchmark for Place Recognition and Metric Depth Estimation in Natural Environments at IEEE ICRA 2026
-
dimensionalOS/dimos - Dimensional is the agentic operating system for physical space. Vibecode humanoids, quadrupeds, drones, and other hardware platforms in natural language and build multi-agent systems that work seamlessly with physical input (cameras, lidar, actuators).
-
zhutengjie/CLOT - official code for paper CLOT: Closed-Loop Global Motion Tracking for Whole-Body Humanoid Teleoperation
-
HKUDS/nanobot - "🐈 nanobot: The Ultra-Lightweight OpenClaw"
-
UniflexAI/tinynav - TinyNav: A lightweight, hackable system to guide your robots anywhere.
-
Humanoid-SkillBlender/SkillBlender - Official implementation of SkillBlender: Towards Versatile Humanoid Whole-Body Loco-Manipulation via Skill Blending
-
K-Dense-AI/claude-scientific-skills - A set of ready to use Agent Skills for research, science, engineering, analysis, finance and writing.
-
leggedrobotics/rsl_rl - A fast and simple implementation of learning algorithms for robotics.
-
AgibotTech/genie_sim - Simulation Platform from AgiBot
-
hwjiang1510/RayZer - Code for ICCV'2025 (Best student paper honorable mention) "RayZer: A Self-supervised Large View Synthesis Model"
-
Spirit-AI-Team/spirit-v1.5 - Spirit-v1.5: A Robotic Foundation Model by Spirit AI
-
lukasmolnar/wb-mpc-locoman - A flexible optimization framework for whole-body loco-manipulation, built with Pinocchio and CasADi. Supports multiple dynamics formulations and solver backends.
-
nvidia-cosmos/cosmos-predict2.5 - Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.
-
be2rlab/km-vipe - Online Tightly Coupled Vision-Language-Geometry Fusion for Open-Vocabulary Semantic SLAM
-
marmotlab/ORION-multi-agent-navigation - ORION: Option-Regularized Deep Reinforcement Learning for Cooperative Multi-Agent Online Navigation
-
ManifoldTechLtd/Odin-Nav-Stack - An open-source navigation stack based on Odin1.
-
R-C-Group/Odin-Navigation-Stack - Odin-Navigation-Stack的解读
-
Wenyueh/MinivLLM - Based on Nano-vLLM, a simple replication of vLLM with self-contained paged attention and flash attention implementation
-
RPL-CS-UCL/litevloc_code - LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation
-
facebookresearch/home-robot - Mobile manipulation research tools for roboticists
-
MaureenZOU/m3-spatial - [ICLR 2025] Official Implementation of M3: 3D-Spatial Multimodal Memory
-
ika-rwth-aachen/ros2_unbag - A ROS 2 tool for exporting bags to human readable files. Supports pluggable export routines to handle any message type.
-
NVlabs/vla0 - VLA-0: Building State-of-the-Art VLAs with Zero Modification
-
realsee-developer/RealSee3D - RealSee3D: A multi-view RGB-D dataset combining real-world captures and procedurally generated scenes, with extensible annotations for diverse 3D vision research.
-
Galery23/SAGE-3D_Official - This is the official repository of the paper "Towards Physically Executable 3D Gaussian for Embodied Navigation".
-
ZHUANGHP/Any-SSR - This is the official code for Any-SSR "Analytic Subspace Routing: How Recursive Least Squares Works in Continual Learning of Large Language Model"
-
Ericonaldo/visual_wholebody - Train a loco-manipulation dog with RL
-
facebookresearch/spider - A general physic-based retargeting framework.
-
hanruihua/ir-sim - A Python-based lightweight robot simulator designed for navigation, control, and learning
-
Any-4D/Any4D - Any4D: Unified Feed-Forward Metric 4D Reconstruction
-
cmjang/InternNav-deploy - Edge deployment guide for InternNav-based perception and navigation on Unitree Go2 / Go2W / B2 robots (ROS 2, RealSense, Python).
-
nvidia-isaac/WBC-AGILE - Whole Body Control for humanoids: AGILE
-
Xian-Bei/TALO - Pushing 3D Vision Foundation Models Towards Globally Consistent Online Reconstruction
-
ContinualAI/avalanche - Avalanche: an End-to-End Library for Continual Learning based on PyTorch.
-
arclab-hku/P2M - [RA-L'25] A Simple LiDAR-centric End-to-end Navigation Framework in Dynamic Environments
-
3DAgentWorld/VGGT4D - The official implementation of the paper “VGGT4D: Mining Motion Cues in Visual Geometry Transformers for 4D Scene Reconstruction.”
-
Mayankm96/isaac-spinning-up - Educational Resource for Isaac Lab
-
open-gigaai/giga-brain-0 - GigaBrain-0: A World Model-Powered Vision-Language-Action Model
-
fanegg/Human3R - An unified model for 4D human-scene reconstruction
-
co-me-tokens/CoMe - [CVPR 26] Release repo of our work "Co-Me: Confidence-Guided Token Merging for Visual Geometric Transformers"
-
BIT-DYN/OpenGraph - [RAL 2024] OpenGraphs: Open-Vocabulary Hierarchical 3D Scene Graphs in Large-Scale Outdoor Environments
-
amazon-far/TWIST2 - [arXiv 2025] TWIST2: Scalable, Portable, and Holistic Humanoid Data Collection System
-
Hilti-Research/hilti-trimble-slam-challenge-2026 - 360 Visual-Inertial Benchmark with Floor Plan Priors for SLAM and Localization
-
leggedrobotics/pace-sim2real - PACE: A systematic approach for sim-to-real transfer of legged robots, identifying actuator and joint dynamics with standard joint encoders.
-
xuxw98/Online3D - [CVPR 2024] Memory-based Adapters for Online 3D Scene Perception
-
KumarRobotics/RT-GuIDE - [RA-L 2025] RT-GuIDE: Real-Time Gaussian Splatting for Information-Driven Exploration
-
dcharatan/pixelsplat - [CVPR 2024 Oral, Best Paper Runner-Up] Code for "pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction" by David Charatan, Sizhe Lester Li, Andrea Tagliasacchi, and Vincent Sitzmann
-
Motphys/MotrixLab - A general-purpose machine learning architecture designed for robot training
-
concept-graphs/concept-graphs - Official code release for ConceptGraphs
-
UnrealZoo/unrealzoo-gym - [ICCV 2025 Highlights] Large-scale photo-realistic virtual worlds for embodied AI
-
rossning92/helicopter-rl - Train a reinforcement learning agent (PPO) to play a retro helicopter arcade game using Stable-Baselines3 and a custom Gymnasium environment.
-
AMAP-EAI/SocialNav - Official implementation for "SocialNav: Training Human-Inspired Foundation Model for Socially-Aware Embodied Navigation"
-
WEIFENG2333/VideoCaptioner - 🎬 卡卡字幕助手 | VideoCaptioner - 基于 LLM 的智能字幕助手 - 视频字幕生成、断句、校正、字幕翻译全流程处理!- A powered tool for easy and efficient video subtitling.
-
ymy-k/Hi-SAM - [IEEE TPAMI] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
-
FlagOpen/RoboCOIN - RoboCoin + LeRobot integration
-
facebookresearch/sam-3d-body - The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the model.
-
facebookresearch/sam-3d-objects - SAM 3D Objects
-
AIR-DISCOVER/FreeAskWorld - [AAAI 2026 Oral] FreeAskWorld is an interactive simulation framework that integrates large language models (LLMs) for high-level planning and socially grounded interaction in embodied AI.
-
Maxwell-Zhao/RoboSimGS - Code for [RA-L] High-Fidelity Simulated Data Generation for Real-World Zero-Shot Robotic Manipulation Learning with Gaussian Splatting
-
fastgs/FastGS - [CVPR 2026] Offical code for "FastGS: Training 3D Gaussian Splatting in 100 Seconds"
-
agrimgupta92/derl - Code for "Embodied Intelligence via Learning and Evolution", Gupta et al, Nature Communications
-
GREAT-WHU/MASt3R-Fusion - Integrating Feed-Forward Visual Model with IMU, GNSS for High-Functionality SLAM.
-
mrwangyou/SCOPE - Official repository of "Expand Your SCOPE, Semantic Cognition Over Potential-based Exploration for Embodied Visual Navigation"
-
Livioni/OmniVGGT-official - [CVPR 2026 MAIN] OmniVGGT: Omni-Modality Driven Visual Geometry Grounded Transformer
-
sair-lab/AirRoom - [CVPR 2025] AirRoom: Objects Matter in Room Reidentification
-
ByteDance-Seed/Depth-Anything-3 - Depth Anything 3
-
JIEKE66633/One-click-cleaning-of-C-drive - 只需轻松一点,即可安全高效的清理C盘残留和垃圾,并且对电脑毫无危险
-
LeapLabTHU/AdaptiveNN - [Nature Machine Intelligence 2025] Emulating Human-like Adaptive Vision for Efficient and Flexible Machine Visual Perception
-
zhoubohan0/NOLO - [IROS 2025 oral] Official implementation of NOLO: Navigate Only Look Once
-
zhaozijie2022/m3w-marl - Official implementation of the paper "Learning and Planning Multi-Agent Tasks via a MoE-based World Model"
-
MrZihan/Dynam3D - Official implementation of "Dynam3D: Dynamic Layered 3D Tokens Empower VLM for Vision-and-Language Navigation" (NeurIPS'25 Oral)
-
facebookresearch/Online-3DGS-Monocular - Code repo for the SIGGRAPH paper "Monocular Online Reconstruction with Enhanced Detail Preservation". Project page https//poiw.github.io/MODP/index.html
-
wsakobe/TrackVLA - [CoRL 2025] Repository relating to "TrackVLA: Embodied Visual Tracking in the Wild"
-
unified-force/UniFP - CoRL2025 UniFP: Learning a Unified Policy for Position and Force Control in Legged Loco-Manipulation
-
666ghj/BettaFish - 微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。
-
worldbench/3EED - [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3D
-
DAVIAN-Robotics/ACG - Code for "ACG: Action Coherence Guidance for Flow-based Vision-Language-Action Models" (ICRA 2026)
-
newton-physics/newton - An open-source, GPU-accelerated physics simulation engine built upon NVIDIA Warp, specifically targeting roboticists and simulation researchers.
-
Ma-Zhuang/OmniNWM - OmniNWM: Omniscient Navigation World Models for Autonomous Driving
-
cshizhe/VLN-DUET - Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).
-
lovelyyoshino/VLFM-Commit - 适配CUDA11.8、habitat-sim0.2.4版本的VLFM,并给出详细的代码理解注释
-
Fudan-MAGIC-Lab/VINGS-Mono - Source code for [TRO2025] VINGS-Mono: Visual Inertial Gaussian Splatting Monocular SLAM in Large Scenes.
-
deepseek-ai/DeepSeek-OCR - Contexts Optical Compression
-
aubingazhib/LightGlueStick - a Fast and Robust Glue for Joint Point-Line Matching
-
ReinFlow/ReinFlow - [NeurIPS 2025] Flow x RL. "ReinFlow: Fine-tuning Flow Policy with Online Reinforcement Learning". Support VLAs e.g., pi0, pi0.5. Fully open-sourced.
-
woyut/NavQ_ICCV25 - Implementation of "NavQ: Learning a Q-Model for Foresighted Vision-and-Language Navigation" (ICCV 2025)
-
physical-superintelligence-lab/Humanoid-Everyday - Humanoid dataset for learning
-
NHirose/OmniVLA - Official repository for OmniVLA training and inference code
-
IRMVLab/I2PNet - [TRO 2025] Codes for "End-to-end 2D-3D Registration between Image and LiDAR Point Cloud for Vehicle Localization"
-
MobiusLqm/MoDGS - Official Implementation of paper accepted by ICLR2025-MoDGS: Dynamic Gaussian Splatting from Casually-captured Monocular Videos with Depth Priors
-
xieyuser/UniGS - Unified Geometry-Aware Gaussian Splatting for Multimodal Rendering
-
imlixinyang/FlashWorld - Code for "FlashWorld: High-quality 3D Scene Generation within Seconds" (ICLR 2026 Oral)
-
OpenHelix-Team/Spatial-Forcing - Official implementation of Spatial-Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model [ICLR2026]
-
starVLA/starVLA - StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
-
jzhzhang/Uni-NaVid - [RSS 2025] Uni-NaVid: A Video-based Vision-Language-Action Model for Unifying Embodied Navigation Tasks.
-
YanjieZe/awesome-humanoid-robot-learning - A Paper List for Humanoid Robot Learning.
-
karpathy/nanochat - The best ChatGPT that $100 can buy.
-
Inception3D/TTT3R - A simple state update rule to enhance length generalization for CUT3R
-
Zxy-MLlab/LIBERO-PRO - LIBERO-PRO is the official repository of the LIBERO-PRO — an evaluation extension of the original LIBERO benchmark
-
Eku127/habitat-data-collector - Habitat-based tools for dynamic arrangement and data recording
-
geyan21/ManiFlow_Policy - [CoRL 2025] ManiFlow: A General Robot Manipulation Policy via Consistency Flow Training
-
MIV-XJTU/JanusVLN - [ICLR2026] Official implementation for "JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation"
-
WECENG/ticket-purchase - 大麦自动抢票,支持人员、城市、日期场次、价格选择
-
fscdc/RewardMap - [ICLR 2026] RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning
-
OpenHelix-Team/VLA-RFT - VLA-RFT: Vision-Language-Action Models with Reinforcement Fine-Tuning
-
luohongk/Embodied-AI-Daily - 📚这个仓库是在arxiv上收集的有关VLN,VLA,World Model,SLAM,Gaussian Splatting,非线性优化等相关论文。每天都会自动更新!issue区域是最新10篇论文
-
Tsinghua-MARS-Lab/SLAM-Former - SLAM-Former: Putting SLAM into One Transformer
-
jmanhype/vggt-mps - VGGT 3D Vision Agent optimized for Apple Silicon with Metal Performance Shaders
-
AIGeeksGroup/Nav-R1 - Nav-R1: Reasoning and Navigation in Embodied Scenes
-
Alibaba-NLP/DeepResearch - Tongyi Deep Research, the Leading Open-source Deep Research Agent
-
InternRobotics/InternVLA-A1 - InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation
-
BIT-DYN/omnimap - [TRO 2025] OmniMap: A General Mapping Framework Integrating Optics, Geometry, and Semantics
-
InternRobotics/InternVLA-M1 - InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
-
facebookresearch/map-anything - MapAnything: Universal Feed-Forward Metric 3D Reconstruction
-
RUC-NLPIR/FlashRAG - ⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)
-
Vid2Sim/Vid2Sim - [CVPR 25] Vid2Sim: Realistic and Interactive Simulation from Video for Urban Navigation
-
manycore-research/SpatialGen - [3DV 2026] SpatialGen: Layout-guided 3D Indoor Scene Generation
-
PRIME-RL/SimpleVLA-RL - [ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
-
NJU-3DV/SpatialVID - [CVPR 2026] SpatialVID: A Large-Scale Video Dataset with Spatial Annotations
-
AIGeeksGroup/3D-R1 - 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding
-
mystorm16/FastVGGT - [ICLR 2026] FastVGGT: Fast Visual Geometry Transformer
-
vllm-project/vllm - A high-throughput and memory-efficient inference and serving engine for LLMs
-
zhangganlin/vista-slam - [3DV 2026] ViSTA-SLAM: Visual SLAM with Symmetric Two-view Association
-
stepfun-ai/Step-Audio2 - Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.
-
OpenHelix-Team/LLaVA-VLA - LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [ICRA 2026]
-
JiuTian-VL/CogVLA - [NeurIPS 2025] CogVLA: Cognition-Aligned Vision-Language-Action Models via Instruction-Driven Routing & Sparsification
-
Heathcliff-saku/BSC-Nav - This repository is the official implementation of our paper (From reactive to cognitive: brain-inspired spatial intelligence for embodied agents)
-
LetheSec/HuggingFace-Download-Accelerator - 利用HuggingFace的官方下载工具从镜像网站进行高速下载。
-
vuer-ai/vuer - Vuer is a 3D visualization tool for robotics and VR applications.
-
Tencent-Hunyuan/Hunyuan-GameCraft-1.0 - Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition
-
Tokishx/DifNav - This is the source code to paper “DAgger Diffusion Navigation: DAgger Boosted Diffusion Policy for Vision-Language Navigation”.
-
cvg/FrontierNet - [RA-L 2025] FrontierNet: Learning Visual Cues to Explore
-
CrystalSixone/VLN_CLASH - This is the official repository for VLN-CLASH.
-
sgl-project/sglang - SGLang is a high-performance serving framework for large language models and multimodal models.
-
OpenGalaxea/GalaxeaVLA - Galaxea's open-source VLA repository
-
haotian-liu/LLaVA - [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
-
Stanford-TML/HEAD_rl_deploy - Official implementation of HEAD CoRL 2025
-
Zhoues/RoboRefer - [NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"
-
openai/gpt-oss - gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
-
GuHuangAI/LaDiWM - code for CoRL2025 "LaDiWM: A Latent Diffusion-based World Model for Predictive Manipulation"
-
unique1i/SceneSplat - [ICCV 2025 Oral] SceneSplat - Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining
-
wangyr22/DepthGS - Official implementation of IROS 2025 paper Pseudo Depth Meets Gaussian: A Feed-forward RGB SLAM Baseline
-
sapientinc/HRM - Hierarchical Reasoning Model Official Release
-
dfki-ric/better_launch - A better replacement for the ROS2 launch system: intuitive, simple, memorable.
-
maturk/dn-splatter - DN-Splatter + AGS-Mesh: Depth and Normal Priors for Gaussian Splatting
-
Tencent-Hunyuan/HunyuanWorld-1.0 - Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model
-
Feliciaxyao/NavMorph - Official implementation of NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous Environments (ICCV'25).
-
NVlabs/Long-RL - Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)
-
RayFronts/RayFronts - [IROS'25] Source code for "RayFronts: Open-Set Semantic Ray Frontiers for Online Scene Understanding and Exploration"
-
ShaohonChen/Qwen3-SmVL - 将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调
-
yyfz/Pi3 - [ICLR 2026] π^3: Permutation-Equivariant Visual Geometry Learning
-
wzzheng/StreamVGGT - [ICLR 2026] Streaming 4D Visual Geometry Transformer
-
leandro-svg/HybridTrack - [RA-L25/ICRA26] HybridTrack: A Hybrid Approach for Robust Multi-Object Tracking
-
wencan25/Fast3D - [ACM MM 2025] Fast3D: Accelerating 3D Multi-modal Large Language Models for Efficient 3D Scene Understanding
-
Selen-Suyue/MBA - [RA-L 2025 & ICRA 2026] 😽 Motion Before Action: Diffusing Object Motion as Manipulation Condition
-
lisj575/GaussianUDF - Code Release for CVPR (2025), "GaussianUDF: Inferring Unsigned Distance Functions through 3D Gaussian Splatting"
-
facebookresearch/hydra - Hydra is a framework for elegantly configuring complex applications
-
DengKaiCQ/VGGT-Long - Official implement of VGGT-Long
-
Zhangwenyao1/DreamVLA - [NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
-
Sirui-Xu/InterMimic - [CVPR 2025 Highlight] InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions
-
HorizonRobotics/EmbodiedGen - Towards a Generative 3D World Engine for Embodied Intelligence
-
yang-zj1026/legged-loco - Low-level locomotion policy training in Isaac Lab
-
NVlabs/VILA - VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
-
bytedance/F-16 - F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electronic Engineering at Tsinghua University and ByteDance.
-
InternRobotics/StreamVLN - [ICRA 2026] Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"
-
AnjieCheng/NaVILA - [RSS'25] This repository is the implementation of "NaVILA: Legged Robot Vision-Language-Action Model for Navigation"
-
LiteReality/LiteReality - [NeurIPS 2025] LiteReality: Graphics-Ready 3D Scene Reconstruction from RGB-D Scans
-
WHU-USI3DV/PatchAugNet - PatchAugNet: Patch feature augmentation-based heterogeneous point cloud place recognition in large-scale street scenes
-
InternRobotics/AnySplat - [SIGGRAPH Asia 2025 (ACM TOG)] AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views
-
OpenGVLab/InternVL - [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
-
FlagOpen/RoboBrain2.5 - RoboBrain 2.5: Advanced version of RoboBrain. Depth in Sight, Time in Mind. 🎉🎉🎉
-
unitreerobotics/unitree_rl_lab - This is a repository for reinforcement learning implementation for Unitree robots, based on IsaacLab.
-
THU-SI/LangScene-X - [ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
-
OpenDriveLab/DetAny3D - [ICCV 2025] Detect Anything 3D in the Wild
-
avlmaps/AVLMaps - [ISER 2023] The official implementation of Audio Visual Language Maps for Robot Navigation
-
google-research/valan - Vision and Language Agent Navigation
-
InternRobotics/CronusVLA - [AAAI26 oral] CronusVLA: Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action Modeling
-
zst1406217/VR-Robo - [RA-L 2025] VR-Robo: A Real-to-Sim-to-Real Framework for Visual Robot Navigation and Locomotion
-
hovsg/HOV-SG - [RSS2024] Official implementation of "Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation"
-
aau-cns/radar_transformer - Transformer-based deep learning architecture for 3D point matching in sparse radar point clouds
-
iMoonLab/yolov13 - Implementation of "YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception".
-
nianticlabs/marepo - [CVPR 2024 Highlight] Map-Relative Pose Regression for Visual Re-Localization
-
ut-amrl/creste_public - [RSS 2025] CREStE: Scalable Mapless Navigation with Internet Scale Priors and Counterfactual Guidance
-
JohannaXie/GauSS-MI - [RSS 2025] GauSS-MI: Gaussian Splatting Shannon Mutual Information for Active 3D Reconstruction
-
hnuzhy/YOTO - [RSS2025] Code for my paper "You Only Teach Once: Learn One-Shot Bimanual Robotic Manipulation from Video Demonstrations"
-
Qi-Zhangyang/GPT4Scene-and-VLN-R1 - GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models
-
tsinghua-fib-lab/Mem4Nav - Mem4Nav: Boosting Vision-and-Language Navigation in Urban Environments with a Hierarchical Spatial-Cognition Long-Short Memory System
-
PRBonn/PINGS - 📌 PINGS: Gaussian Splatting Meets Distance Fields within a Point-Based Implicit Neural Map [RSS' 25]
-
Tencent/DepthCrafter - [CVPR 2025 Highlight] DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
-
siyuhsu/vla-cache - [NeurIPS 2025] VLA-Cache: Efficient Vision-Language-Action Manipulation via Adaptive Token Caching
-
LMCache/LMCache - Supercharge Your LLM with the Fastest KV Cache Layer
-
openai/openai-cs-agents-demo - Demo of a customer service use case implemented with the OpenAI Agents SDK
-
3DTopia/MaterialAnything - [CVPR 2025 Highlight] Material Anything: Generating Materials for Any 3D Object via Diffusion
-
LeCAR-Lab/ASAP - [RSS 2025] "ASAP: Aligning Simulation and Real-World Physics for Learning Agile Humanoid Whole-Body Skills"
-
zhaihongjia/PanoGS - [CVPR 2025] PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding
-
GeeeekExplorer/nano-vllm - Nano vLLM
-
Fediory/HVI-CIDNet - [CVPR2025 && NTIRE2025] HVI: A New Color Space for Low-light Image Enhancement (Official Implementation)
-
isaac-sim/IsaacSim - NVIDIA Isaac Sim™ is an open-source application on NVIDIA Omniverse for developing, simulating, and testing AI-driven robots in realistic virtual environments.
-
jzhzhang/3DAwareNav - [CVPR 2023] We propose a framework for the challenging 3D-aware ObjectNav based on two straightforward sub-policies. The two sub-polices, namely corner-guided exploration policy and category-aware identification policy, simultaneously perform by utilizing online fused 3D points as observation.
-
buaa-colalab/OctoNav-R1 - Code for OctoNav-Bench and OctoNav-R1
-
Fanqi-Lin/OneTwoVLA - Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"
-
realcrane/3D-student-splatting-and-scooping - This is the source code of our CVPR 2025 Best Paper Honourable Mention paper: 3D Student Splatting and Scooping
-
microsoft/qlib - Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, including supervised learning, market dynamics modeling, and RL, and is now equipped with https://github.com/microsoft/RD-Agent to automate R&D process.
-
agent0ai/agent-zero - Agent Zero AI framework
-
Shubhamsaboo/awesome-llm-apps - Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
-
facebookresearch/habitat-lab - A modular high-level library to train embodied AI agents across a variety of tasks and environments.
-
karpathy/minGPT - A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
-
allenzren/open-pi-zero - Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence
-
Physical-Intelligence/real-time-chunking-kinetix - Simulated experiments for "Real-Time Execution of Action Chunking Flow Policies".
-
B0B8K1ng/WMNavigation - [IROS'25 Oral] WMNav: Integrating Vision-Language Models into World Models for Object Goal Navigation
-
nunchaku-ai/nunchaku - [ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
-
InternRobotics/NavDP - Official implementation of the paper: "NavDP: Learning Sim-to-Real Navigation Diffusion Policy with Privileged Information Guidance"
-
GeWu-Lab/AnyTouch - The repo for "AnyTouch: Learning Unified Static-Dynamic Representation across Multiple Visuo-tactile Sensors", ICLR 2025
-
JIA-Lab-research/LISA - Project Page for "LISA: Reasoning Segmentation via Large Language Model"
-
InternRobotics/InternUtopia - A simulation platform for versatile Embodied AI research and developments.
-
JunweiLiang/awesome_lists - Awesome Lists for Tenure-Track Assistant Professors and PhD students. (助理教授/博士生生存指南)
-
Zeying-Gong/Falcon - Official Code for "From Cognition to Precognition: A Future-Aware Framework for Social Navigation" (ICRA 2025)
-
Zeying-Gong/ascent - [RAL‘26] Stairway to Success: An Online Floor-Aware Zero-Shot Object-Goal Navigation Framework via LLM-Driven Coarse-to-Fine Exploration
-
GradientSpaces/Rectified-Point-Flow - [NeurIPS 2025, Spotlight] Rectified Point Flow: Generic Point Cloud Pose Estimation
-
THU-SI/Spatial-MLLM - [NeurIPS 2025] Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence
-
openvla/openvla - OpenVLA: An open-source vision-language-action model for robotic manipulation.
-
Eku127/DualMap - [RAL-25] An online open-vocabulary mapping system that enables natural language querying to navigate dynamic scenes, with ROS support.
-
MIT-SPARK/VGGT-SLAM - VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold
-
SunYangtian/UniGeo - UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation
-
Zie619/n8n-workflows - all of the workflows of n8n i could find (also from the site itself)
-
Paper2Poster/Paper2Poster - [NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers
-
resemble-ai/chatterbox - SoTA open-source TTS
-
VITA-Group/VLM-3R - [CVPR 2026] VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction
-
Fosowl/agenticSeek - Fully Local Manus AI. No APIs, No $200 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity. 🔔 Official updates only via twitter @Martin993886460 (Beware of fake account)
-
YanyuanQiao/Open-Nav - [ICRA 2025] Official implementation of Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs
-
facebookresearch/habitat-challenge - Code for the habitat challenge
-
DreamTechAI/Direct3D-S2 - [NeurIPS 2025] Direct3D‑S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention
-
AILab-CVC/YOLO-World - [CVPR 2024] Real-Time Open-Vocabulary Object Detection
-
GengzeZhou/NavGPT-2 - [ECCV 2024] Official implementation of NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models
-
hojonathanho/diffusion - Denoising Diffusion Probabilistic Models
-
AUTOMATIC1111/stable-diffusion-webui - Stable Diffusion web UI
-
hanruihua/neupan_ros - ROS Wrapper of NeuPAN planner
-
AgibotTech/agibot_x1_train - The reinforcement learning training code for AgiBot X1.
-
siyuanliii/masa - Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything
-
OpenDriveLab/UniVLA - [RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions
-
JiuhaiChen/BLIP3o - Official implementation of BLIP3o-Series
-
apple/ml-fastvlm - This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025
-
xming521/WeClone - 🚀 One-stop solution for creating your AI twin from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. 从聊天记录创造数字分身的一站式解决方案
-
bagh2178/UniGoal - [CVPR 2025] UniGoal: Towards Universal Zero-shot Goal-oriented Navigation
-
IDEA-Research/GroundingDINO - [ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
-
harry0703/MoneyPrinterTurbo - 利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
-
QitaoZhao/DiffusionSfM - [CVPR 2025] "DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion" official implementation.
-
Brummi/anycam - Official repository for "AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos" (CVPR 2025)
-
lyp-deeplearning/LiftFeat - Code for "LiftFeat: 3D Geometry-Aware Local Feature Matching", ICRA2025
-
huggingface/nanoVLM - The simplest, fastest repository for training/finetuning small-sized VLMs.
-
gradslam/gradslam - gradslam is an open source differentiable dense SLAM library for PyTorch
-
MrZihan/GridMM - Official implementation of GridMM: Grid Memory Map for Vision-and-Language Navigation (ICCV'23).
-
liangpan99/TokenHSI - [CVPR 2025 Oral] TokenHSI: Unified Synthesis of Physical Human-Scene Interactions through Task Tokenization
-
MrZihan/HNR-VLN - Official implementation of Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation (CVPR'24 Highlight).
-
natolambert/rlhf-book - Textbook on reinforcement learning from human feedback
-
lllyasviel/FramePack - Lets make video diffusion practical!
-
DefaultRui/BEV-Scene-Graph - [ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation
-
chen-judge/MapGPT - [ACL 24] The official implementation of MapGPT: Map-Guided Prompting with Adaptive Path Planning for Vision-and-Language Navigation.
-
FarInHeight/To-Match-or-Not-to-Match - Official code for "To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition" CVPR IMW 2025
-
amaralibey/MixVPR - MixVPR: Feature Mixing for Visual Place Recognition (WACV 2023)
-
amaralibey/gsv-cities - GSV-Cities: a large-scale dataset for visual place recognition
-
jiangxinke/Agentic-RAG-R1 - Agentic RAG R1 Framework via Reinforcement Learning
-
NVIDIA-AI-IOT/ros2_nanollm - ROS2 nodes for LLM, VLM, VLA
-
ZiYang-xie/WorldGen - 🌍 WorldGen - Generate Any 3D Scene in Seconds
-
facebookresearch/flow_matching - A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
-
ybgdgh/L3MVN - Leveraging Large Language Models for Visual Target Navigation
-
ibaiGorordo/vggt-pytorch-inference - Repository for running the VGGT model in PyTorch
-
facebookresearch/nwm - Official code for the CVPR 2025 paper "Navigation World Models".
-
DefaultRui/VLN-VER - [CVPR24] Volumetric Environment Representation for Vision-Language Navigation
-
EricTan7/RAM - [CVPR2025] Official implementation of RAM
-
Jirl-upenn/VLMnav - End-to-End Navigation with VLMs
-
lllyasviel/ControlNet - Let us control diffusion models!
-
naokiyokoyama/ovon - Open Vocabulary Object Navigation
-
bdaiinstitute/vlfm - The repository provides code associated with the paper VLFM: Vision-Language Frontier Maps for Zero-Shot Semantic Navigation (ICRA 2024)
-
isaac-sim/IsaacLab - Unified framework for robot learning built on NVIDIA Isaac Sim
-
NVlabs/HOVER - HOVER
-
cvlab-kaist/ZeroCo - CVPR 2025 (Highlight) : Official implementation of "Cross-View Completion Models are Zero-shot Correspondence Estimators"
-
SWE-agent/SWE-agent - SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]
-
KTH-RPL/OneMap - [ICRA'25] One Map to Find Them All: Real-time Open-Vocabulary Mapping for Zero-shot Multi-Object Navigation
-
isri-aist/RoboManipBaselines - A software framework integrating various imitation learning methods and benchmark environments for robotic manipulation
-
AlbertoJaenal/MapAbstractionVPR - Implementation for Image database abstracion
-
THU-SI/VideoScene - [CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step
-
subframe7536/maple-font - Maple Mono: Open source monospace font with round corner, ligatures and Nerd-Font icons for IDE and terminal, fine-grained customization options. 带连字和控制台图标的圆角等宽字体,中英文宽度完美2:1,细粒度的自定义选项
-
sintel-dev/Orion - Unsupervised time series anomaly detection library
-
lus6-Jenny/RINGSharp - [IEEE T-RO 2025] RING#: PR-by-PE Global Localization with Roto-translation Equivariant Gram Learning.
-
yuliangguo/depth_any_camera - [CVPR 2025] Depth Any Camera: Zero-Shot Metric Depth Estimation from Any Camera
-
SpatialVLA/SpatialVLA - 🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.
-
apple/ml-matrix3d - [CVPR 2025 Highlight] Matrix3D: Large Photogrammetry Model All-in-One
-
FlagOpen/RoboBrain - [CVPR 2025] RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete. Official Repository.
-
arajv/SayNav - Grounding Large Language Models for Dynamic Planning to Navigation in New Environments
-
BAAI-DCAI/SpatialBot - The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.
-
facebookresearch/RAM - A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
-
honghd16/GSA-VLN - Official repository of General Scene Adaptation for Vision-and-Language Navigation (ICLR'2025)
-
MarSaKi/ETPNav - [TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"
-
CrystalSixone/VLN-GOAT - Repository for Vision-and-Language Navigation via Causal Learning (Accepted by CVPR 2024)
-
GAMMA-UMD-Outdoor-Navigation/BehAV - BehAV: Behavioral Rule Guided Autonomy Using VLM for Robot Navigation in Outdoor Scenes (ICRA'25)
-
dillonloh/AdaVLN - IsaacSim Extension for Dynamic Objects in Matterport3D Environments for AdaVLN research
-
vlmaps/vlmaps - [ICRA2023] Implementation of Visual Language Maps for Robot Navigation
-
GradientSpaces/WildGS-SLAM - [CVPR 2025] WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments
-
SakanaAI/AI-Scientist-v2 - The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search
-
zd11024/NaviLLM - [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'
-
LlamaFamily/Llama-Chinese - Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用
-
RoboVerseOrg/RoboVerse - RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
-
yuancaimaiyi/collaborationSfM - 众包SfM
-
hanruihua/NeuPAN - [TRO 2025] NeuPAN: Direct Point Robot Navigation with End-to-End Model-based Learning.
-
NVIDIAGameWorks/kaolin - A PyTorch Library for Accelerating 3D Deep Learning Research
-
wyf3/llm_related - 复现大模型相关算法及一些学习记录
-
rvp-group/Splat-LOAM - [ICCV 25] 2D Gaussian Splatting based LiDAR Odometry And Mapping
-
MAC-VO/MAC-VO - [ICRA 2025 Best Paper] MAC-VO: Metrics-aware Covariance for Learning-based Stereo Visual Odometry
-
ffrivera0/reloc3r - [CVPR 2025] Relative camera pose estimation and visual localization with Reloc3r
-
lpiccinelli-eth/UniK3D - [CVPR 2025] UniK3D: Universal Camera Monocular 3D Estimation
-
yzqin/dexmv-sim - DexMV: Imitation Learning for Dexterous Manipulation from Human Videos, ECCV 2022
-
LSXI7/MINIMA - [CVPR 2025] MINIMA: Modality Invariant Image Matching
-
om-ai-lab/VLM-R1 - Solve Visual Understanding with Reinforced VLMs
-
shengjun-zhang/GGN - [NeurIPS 2024] Gaussian Graph Network: Learning Efficient and Generalizable Gaussian Representations from Multi-view Images
-
mindverse/Second-Me - Train your AI self, amplify you, bridge the world
-
yanyan-li/4DGS-SLAM - Instead of removing dynamic objects as distractors and reconstructing only static environments, this paper proposes an efficient architecture that incrementally tracks camera poses and establishes the 4D Gaussian radiance fields in unknown scenarios by using a sequence of RGB-D images.
-
manycore-research/SpatialLM - [NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling
-
nv-tlabs/3dgrut - Ray tracing and hybrid rasterization of Gaussian particles
-
nianticlabs/ace - [CVPR 2023 - Highlight] Accelerated Coordinate Encoding (ACE): Learning to Relocalize in Minutes using RGB and Poses
-
sunfanyunn/LayoutVLM - Official code for "LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models" (CVPR 2025)
-
roomtour3d/roomtour3d-NaviLLM - [CVPR 2025] RoomTour3D - Geometry-aware, cheap and automatic data from web videos for embodied navigation
-
open-mmlab/OpenPCDet - OpenPCDet Toolbox for LiDAR-based 3D Object Detection.
-
PengYu-Team/GEODE_dataset - Extending the Robustness of LiDAR SLAM to Geometrically Degenerate Scenarios
-
PRBonn/kiss-slam - A LiDAR SLAM system that just works
-
VSLAM-LAB/VSLAM-LAB - A Comprehensive Framework for Visual SLAM Systems and Datasets
-
HCI-LMC/VLN-SUSA - [AAAI 2026] Official code for "Agent Journey Beyond RGB: Unveiling Hybrid Semantic-Spatial Environmental Representations for Vision-and-Language Navigation"
-
QVPR/Patch-NetVLAD - Code for the CVPR2021 paper "Patch-NetVLAD: Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition"
-
Xiaoming-Zhao/PointNav-VO - [ICCV 2021] Official implementation of "The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation"
-
jiachenzhu/DyT - Code release for DynamicTanh (DyT)
-
HKUDS/AI-Researcher - [NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat
-
facebookresearch/vggt - [CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
-
LeeBY68/Hier-SLAM - 🌳 [ICRA'25] Hier-SLAM: Semantic Gaussian Splatting SLAM with Hierarchical Categorical Representation
-
graphdeco-inria/hierarchical-3d-gaussians - Official implementation of the SIGGRAPH 2024 paper "A Hierarchical 3D Gaussian Representation for Real-Time Rendering of Very Large Datasets"
-
ali-vilab/MangaNinjia - [CVPR 2025 Highlight] Official implementation of "MangaNinja: Line Art Colorization with Precise Reference Following"
-
FoundationVision/GLEE - [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
-
InternRobotics/EmbodiedScan - [CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
-
ikaijua/Awesome-AITools - Collection of AI-related utilities. Welcome to submit pull requests /收藏AI相关的实用工具,欢迎提交pull requests
-
NVlabs/FoundationStereo - [CVPR 2025 Best Paper Nomination] FoundationStereo: Zero-Shot Stereo Matching
-
THU-MIG/yoloe - YOLOE: Real-Time Seeing Anything [ICCV 2025]
-
NVlabs/curobo - CUDA Accelerated Robot Library
-
whu-lyh/SaliencyI2PLoc - Official code of SaliencyI2PLoc
-
robot-learning-freiburg/LCDNet - PyTorch code for training LCDNet for loop closure detection in LiDAR SLAM. http://rl.uni-freiburg.de/research/lidar-slam-lc
-
MarSaKi/VLN-BEVBert - [ICCV 2023} Official repo of "BEVBert: Multimodal Map Pre-training for Language-guided Navigation"
-
crepuscularlight/SemanticLoopClosure - Master thesis regarding semantic loop closure
-
Ghiara/LEGION - Official implementation of paper on Nature Machine Intelligence: "Preserving and Combining Knowledge in Robotic Lifelong Reinforcement Learning"
-
OpenHands/OpenHands - 🙌 OpenHands: AI-Driven Development
-
Zhefan-Xu/isaac-go2-ros2 - Unitree Go2 simulation platform for testing navigation, decision-making and autonomous tasks. (NVIDIA Isaac/ROS2)
-
YicongHong/Recurrent-VLN-BERT - Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation
-
gaoxiangjun/Mani-GS - [CVPR' 2025'] Mani-GS: Gaussian Splatting Manipulation with Triangular Mesh
-
convexsplatting/convex-splatting - [CVPR 2025 - Highlight] Original implementation of "3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes"
-
jzhzhang/NaVid-VLN-CE - [RSS 2024 & RSS 2025] VLN-CE evaluation code of NaVid and Uni-NaVid
-
jacobkrantz/VLN-CE - Vision-and-Language Navigation in Continuous Environments using Habitat
-
JeffLIrion/python-graphslam - Graph SLAM solver in Python
-
Stability-AI/generative-models - Generative Models by Stability AI
-
vdorbala/LGX - Code for LGX (Language Guided Exploration). We use LLMs to perform embodied robot navigation in a zero-shot manner.
-
PKU-VCL-3DV/SLAM3R - [CVPR 2025 Highlight] Real-time dense scene reconstruction with SLAM3R
-
fanegg/Feat2GS - [CVPR2025] Feat2GS: Probing Visual Foundation Models with Gaussian Splatting
-
MrZihan/Sim2Real-VLN-3DFF - Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).
-
csiro-robotics/Pair-VPR - [IEEE RA-L 2025] The official repository for Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers
-
facebookresearch/fast3r - [CVPR 2025] Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
-
XiaohanLei/GaussNav - PyTorch implementation of paper: GaussNav: Gaussian Splatting for Visual Navigation
-
dmar-bonn/active-gs - [RA-L2025] ActiveGS: Active Scene Reconstruction Using Gaussian Splatting
-
WU-CVGL/Omni-Scene - [CVPR2025] Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction
-
liw95/LightLoc - [CVPR2025] LightLoc: Learning Outdoor LiDAR Localization at Light Speed
-
hzxie/GaussianCity - The official implementation of "GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation". (CVPR 2025)
-
showlab/ShowUI - [CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
-
iris0329/SeeGround - [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding
-
pengwangucla/DeLS-3D - The code for DeLS-3D of CVPR 2018
-
rpng/calc - Convolutional Autoencoder for Loop Closure
-
CASIA-LMC-Lab/FastSAM - Fast Segment Anything
-
HKUST-Aerial-Robotics/SG-Reg - [T-RO 2025] SG-Reg: Generalizable and Efficient Scene Graph Registration
-
rmurai0610/MASt3R-SLAM - [CVPR 2025] MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors
-
xuxw98/ESAM - [ICLR 2025, Oral] EmbodiedSAM: Online Segment Any 3D Thing in Real Time
-
GeLuzhou/Dynamic-GSG - [IROS 25] Dynamic 3D Gaussian Scene Graphs for Environment Adaptation
-
sair-lab/AirCode - [RA-L 2022] AirCode: A Robust Object Encoding Method
-
jingyaogong/minimind - 🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
-
url-kaist/MambaGlue - MambaGlue: Fast and Robust Local Feature Matching With Mamba @ ICRA'25
-
fraunhoferhhi/AT-GS - Adaptive and Temporally Consistent Gaussian Surfels for Multi-view Dynamic Reconstruction
-
sunsmarterjie/yolov12 - [NeurIPS 2025] YOLOv12: Attention-Centric Real-Time Object Detectors
-
HKUDS/GraphGPT - [SIGIR'2024] "GraphGPT: Graph Instruction Tuning for Large Language Models"
-
luigifreda/pyslam - pySLAM is a hybrid Python/C++ Visual SLAM pipeline supporting monocular, stereo, and RGB-D cameras. It provides a broad set of modern local and global feature extractors, multiple loop-closure strategies, a volumetric reconstruction module, integrated depth-prediction models, and semantic segmentation capabilities for enhanced scene understanding.
-
wangyizhao/PRIOR-SLAM - PRIOR-SLAM: Enabling Visual SLAM for Loop Closure under Large Viewpoint Variations
-
Vision-CAIR/MiniGPT-4 - Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
-
cvg/limap - A toolbox for mapping and localization with line features.
-
BJHYZJ/DovSG - [RA-L 2025] Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation
-
yang-zj1026/NaVILA-Bench - Vision-Language Navigation Benchmark in Isaac Lab
-
CUT3R/CUT3R - Official implementation of Continuous 3D Perception Model with Persistent State
-
QwenLM/Qwen3 - Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
-
huggingface/open-r1 - Fully open reproduction of DeepSeek-R1
-
fudan-zvg/DG-SLAM - [NeurIPS 2024] DG-SLAM: Robust Dynamic Gaussian Splatting SLAM with Hybrid Pose Optimization
-
open-webui/open-webui - User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
-
deepseek-ai/DeepSeek-Coder - DeepSeek Coder: Let the Code Write Itself
-
Irvingao/Point-DETR3D - [AAAI 2024] Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-Supervised 3D Object Detection
-
HaoyiZhu/SPA - [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
-
zezhishao/DailyArXiv - Daily ArXiv Papers.
-
microsoft/MoGe - [CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
-
NVlabs/InstantSplat - InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds
-
Nanne/pytorch-NetVlad - Pytorch implementation of NetVlad including training on Pittsburgh.
-
VITA-Group/MM3DGS-SLAM - [IROS 2024] MM3DGS SLAM: Multi-modal 3D Gaussian Splatting for SLAM Using Vision, Depth, and Inertial Measurements
-
hmz-15/Interactive-Predicate-Learning - InterPreT: Interactive Predicate Learning from Language Feedback for Generalizable Task Planning (RSS 2024)
-
google-deepmind/mujoco_menagerie - A collection of high-quality models for the MuJoCo physics engine, curated by Google DeepMind.
-
GarlanLou/LF-GNSS - LF-GNSS: A Fundamental Framework for Exploring Learning and Filtering Integration in GNSS
-
Aceinna/gnss-ins-sim - Open-source GNSS + inertial navigation, sensor fusion simulator. Motion trajectory generator, sensor models, and navigation
-
modelscope/FunClip - Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
-
naver/mast3r - Grounding Image Matching in 3D with MASt3R
-
opendilab/LightZero - [NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
-
naver/dust3r - DUSt3R: Geometric 3D Vision Made Easy
-
OpenDriveLab/AgiBot-World - [IROS 2025 Best Paper Award Finalist & IEEE TRO 2026] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems
-
facebookresearch/DiT - Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
-
RoboTwin-Platform/RoboTwin - RoboTwin 2.0 Offical Repo
-
YvanYin/Metric3D - The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
-
modelscope/ms-agent - MS-Agent: a lightweight framework to empower agentic execution of complex tasks
-
Genesis-Embodied-AI/Genesis - A generative world for general-purpose robotics & embodied AI learning.
-
ZexinHe/Neural-LightRig - [CVPR2025] Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion
-
gramuah/ros4vsn - Evaluation of Visual Semantic Navigation Models in Real Robots
-
noodle-lab/GaussianSpa - Project website: https://noodle-lab.github.io/gaussianspa/
-
PDFMathTranslate/PDFMathTranslate - [EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero
-
myhhub/stock - stock股票.获取股票数据,计算股票指标,筹码分布,识别股票形态,综合选股,选股策略,股票验证回测,股票自动交易,支持PC及移动设备。
-
PKU-YuanGroup/Open-Sora-Plan - This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
-
hustvl/DiffusionDrive - [CVPR 2025 Highlight] Truncated Diffusion Model for Real-Time End-to-End Autonomous Driving
-
ispc-lab/HRegNet - [ICCV 2021] HRegNet: A Hierarchical Network for Large-scale Outdoor LiDAR Point Cloud Registration
-
nerfstudio-project/gsplat - CUDA accelerated rasterization of gaussian splatting
-
ranahanocka/point2mesh - Reconstruct Watertight Meshes from Point Clouds [SIGGRAPH 2020]
-
TianxingChen/G3Flow - [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation
-
TheBlewish/Automated-AI-Web-Researcher-Ollama - A python program that turns an LLM, running on Ollama, into an automated researcher, which will with a single query determine focus areas to investigate, do websearches and scrape content from various relevant websites and do research for you all on its own! And more, not limited to but including saving the findings for you!
-
facebookresearch/neuralfeels - Neural feels with neural fields: Visuo-tactile perception for in-hand manipulation
-
ori-drs/oxford_spires_dataset - [IJRR 2025] Lidar-visual dataset with ground truth 3D map for SLAM/NeRF
-
blazzbyte/OpenInterpreterUI - Simplify code execution with Open Interpreter UI Project with Streamlit. A user-friendly GUI for Python, JavaScript, and more. Pay-as-you-go, no subscriptions. Ideal for beginners.
-
ChenYutongTHU/SplatFormer - [ICLR' 25] SplatFormer: Point Transformer for Robust 3D Gaussian Splatting
-
akawincent/ZED-data-collector - In this project, ZED camera is used to extract image, IMU, pose data and convert them into a dataset format as ground truth for evaluation of other SLAM systems
-
Parskatt/RoMa - [CVPR 2024] RoMa: Robust Dense Feature Matching; RoMa is the robust dense feature matcher capable of estimating pixel-dense warps and reliable certainties for almost any image pair.
-
microsoft/autogen - A programming framework for agentic AI
-
KwanWaiPang/Gaussian-SLAM_comment - Gaussian-SLAM的中文注释
-
open-mmlab/mmdetection - OpenMMLab Detection Toolbox and Benchmark
-
InternRobotics/VLM-Grounder - [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding
-
VladimirYugay/Gaussian-SLAM - Gaussian-SLAM: Photo-realistic Dense SLAM with Gaussian Splatting
-
MisEty/RTG-SLAM - RTG-SLAM: Real-time 3D Reconstruction at Scale Using Gaussian Splatting (ACM SIGGRAPH 2024)
-
cvg/NoPoSplat - [ICLR'25 Oral] No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images
-
megvii-research/MCTrack - [IROS2025]This is the offical implementation of the paper "MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous Driving"
-
nv-tlabs/SCube - [NeurIPS 2024] SCube: Instant Large-Scale Scene Reconstruction using VoxSplats
-
ChenHoy/DROID-Splat - End-to-End SLAM with camera calibration, monocular prior integration and dense Rendering
-
openinterpreter/open-interpreter - A natural language interface for computers
-
robot-learning-freiburg/CL-SLAM - Continual SLAM: Beyond Lifelong Simultaneous Localization and Mapping through Continual Learning. http://continual-slam.cs.uni-freiburg.de
-
ywyeli/Place3D - [NeurIPS'24 Spotlight] Is Your LiDAR Placement Optimized for 3D Scene Understanding?
-
TommyZihao/openvino_tonypi - 基于OpenVINO,本地部署大模型智能体Agent,控制TonyPi人形机器人
-
donydchen/mvsplat - 🌊 [ECCV'24 Oral] MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images
-
songw-zju/LiDAR2Map - The official implementation of "LiDAR2Map: In Defense of LiDAR-Based Semantic Map Construction Using Online Camera Distillation" (CVPR 2023)
-
zhaihongjia/SplatLoc - [TVCG 2025] SplatLoc: 3D Gaussian Splatting-based Visual Localization for Augmented Reality
-
NVIDIA/TensorRT-LLM - TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.
-
NanmiCoder/MediaCrawler - 小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
-
microsoft/BitNet - Official inference framework for 1-bit LLMs
-
hkchengrex/Cutie - [CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation
-
520xyxyzq/3DGS-CD - 3DGS-based change detection for physical object rearrangement
-
facebookresearch/lingua - Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
-
google/nerfies - This is the code for Deformable Neural Radiance Fields, a.k.a. Nerfies.
-
Owen718/LongPrompt-LLamaGen - This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompts. And it's also powered by additional prompt refining features for improved performance.
-
openai/improved-diffusion - Release for Improved Denoising Diffusion Probabilistic Models
-
RuijieZhu94/MotionGS - [NeurIPS 2024] MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting
-
hzy46/Deep-Learning-21-Examples - 《21个项目玩转深度学习———基于TensorFlow的实践详解》配套代码
-
StanfordVL/3DSceneGraph - The data skeleton from "3D Scene Graph: A Structure for Unified Semantics, 3D Space, and Camera" http://3dscenegraph.stanford.edu
-
cvg/depthsplat - [CVPR'25] DepthSplat: Connecting Gaussian Splatting and Depth
-
HKUDS/LightRAG - [EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
-
Nightmare-n/DepthAnyVideo - Depth Any Video with Scalable Synthetic Data (ICLR 2025)
-
linyicheng1/EdgePoint - EdgePoint: Learning Efficient Keypoint Extraction and Description for Edge Devices
-
minwoo0611/HeLiOS - [ICRA2025] HeLiOS: Heterogeneous LiDAR Place Recognition
-
uzh-rpg/bflow - Official implementation of "Dense Continuous-Time Optical Flow from Event Cameras"
-
VITA-Group/LightGaussian - [NeurIPS 2024 Spotlight]"LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS", Zhiwen Fan, Kevin Wang, Kairun Wen, Zehao Zhu, Dejia Xu, Zhangyang Wang
-
uzh-rpg/deep_ev_tracker - Repository relating to "Data-driven Feature Tracking for Event Cameras" (CVPR, 2023, Award Candidate) and "Data-driven Feature Tracking for Event Cameras with and without Frames" (T-PAMI 2025)
-
IRMVLab/DVLO - [ECCV 2024 Oral] DVLO: Deep Visual-LiDAR Odometry with Local-to-Global Feature Fusion and Bi-Directional Structure Alignment
-
QiZS-BIT/GSPR - [IEEE IROS'25] GSPR: Multimodal Place Recognition using 3D Gaussian Splatting for Autonomous Driving
-
hustvl/osp - [ECCV 2024] Occupancy as Set of Points
-
HuangJunJie2017/BEVDet - Code base of the BEVDet series .
-
city-super/Octree-AnyGS - Octree-GS
-
buaacyw/MeshAnythingV2 - [ICCV 2025] From anything to mesh like human artists. Official impl. of "MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh Tokenization"
-
roboflow/supervision - We write your reusable computer vision tools. 💜
-
yifanlu0227/ChatSim - [CVPR2024 Highlight] Editable Scene Simulation for Autonomous Driving via LLM-Agent Collaboration
-
cjy1992/interp-e2e-driving - Interpretable End-to-end Urban Autonomous Driving with Latent Deep Reinforcement Learning
-
Robertwyq/PanoOcc - [CVPR 2024] PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation
-
TheAlgorithms/Python - All Algorithms implemented in Python
-
morrisfl/UniFEx - Framework for computationally efficient training of universal image feature extraction models.
-
PeidongLi/SSR - [ICLR 2025] The official implementation of SSR
-
hanyangyu1021/LMGaussian - official implementation of LM-Gaussian
-
pytorch/pytorch - Tensors and Dynamic neural networks in Python with strong GPU acceleration
-
eth-ait/GaussianHaircut - Gaussian Haircut: Human Hair Reconstruction with Strand-Aligned 3D Gaussians
-
pyg-team/pytorch-frame - Tabular Deep Learning Library for PyTorch
-
jkulhanek/wild-gaussians - [NeurIPS'24] WildGaussians: 3D Gaussian Splatting In the Wild
-
bassamlab/SigmaRL - SigmaRL: A Sample-Efficient and Generalizable Multi-Agent Reinforcement Learning Framework for Motion Planning
-
DLR-MI/UTrack - Multi-Object Tracking with Uncertain Detections [ECCV 2024 UnCV]
-
stanfordnlp/dspy - DSPy: The framework for programming—not prompting—language models
-
TempleRAIL/drl_vo_nav - [T-RO 2023] DRL-VO: Learning to Navigate Through Crowded Dynamic Scenes Using Velocity Obstacles
-
lucasbrynte/gasfm - Implementation of the CVPR 2024 paper "Learning Structure-from-Motion with Graph Attention Networks".
-
SPengLiang/OccupancyM3D - [CVPR 2024] Learning Occupancy for Monocular 3D Object Detection
-
zhangganlin/GlORIE-SLAM - GlORIE-SLAM: Globally Optimized RGB-only Implicit Encoding Point Cloud SLAM
-
jbriales/rgbd_benchmark_tools - Tools for TUM RGBD Dataset Benchmark
-
yastrebksv/TennisProject - Tennis analysis using deep learning and machine learning
-
cvg/GeoCalib - GeoCalib: Learning Single-image Calibration with Geometric Optimization (ECCV 2024)
-
NVIDIA/TransformerEngine - A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.
-
lus6-Jenny/RING - [IEEE T-RO 2023] Source code of RING and RING++ for loop closure detection in LiDAR SLAM.
-
hacksider/Deep-Live-Cam - real time face swap and one-click video deepfake with only a single image
-
GANWANSHUI/GaussianOcc - (ICCV 2025) GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian Splatting
-
GradientSpaces/LoopSplat - [3DV 2025, Oral] LoopSplat: Loop Closure by Registering 3D Gaussian Splats
-
zhaofuq/LOD-3DGS - LetsGo: Large-Scale Garage Modeling and Rendering via LiDAR-Assisted Gaussian(Published in SIGGRAPH Asia 2024)
-
hjr37/CP-SLAM - CP-SLAM: Collaborative Neural Point-based SLAM
-
cvg/nicer-slam - [3DV'24 Best Paper Honorable Mention] NICER-SLAM: Neural Implicit Scene Encoding for RGB SLAM
-
lpiccinelli-eth/UniDepth - Universal Monocular Metric Depth Estimation
-
JeongminB/E-D3DGS - [ECCV 2024] Official repository for "Per-Gaussian Embedding-Based Deformation for Deformable 3D Gaussian Splatting"
-
spla-tam/SplaTAM - SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM (CVPR 2024)
-
sparolab/SOLiD - SOTA LiDAR Global Descriptor in LiDAR Place Recognition (accepted in RA-L'24 w/ ICRA'25)
-
IPNL-POLYU/UrbanNavDataset - UrbanNav:An Open-sourced Multisensory Dataset for Benchmarking Positioning Algorithms Designed for Urban Areas
-
YuxueYang1204/TrimGS - Trim 3D Gaussian Splatting for Accurate Geometry Representation
-
open-mmlab/mmtracking - OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.
-
huggingface/transformers - 🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
-
openai/openai-python - The official Python library for the OpenAI API
-
llmbev/talk2bev - Talk2BEV: Language-Enhanced Bird's Eye View Maps (ICRA'24)
-
liuyuan-pal/SyncDreamer - [ICLR 2024 Spotlight] SyncDreamer: Generating Multiview-consistent Images from a Single-view Image
-
fudan-zvg/4d-gaussian-splatting - [ICLR 2024] Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting
-
qinzheng93/GeoTransformer - [CVPR2022] Geometric Transformer for Fast and Robust Point Cloud Registration
-
yanyan-li/GeoGaussian - GeoGaussian: Geometry-aware Gaussian Splatting for Scene Rendering
-
Parskatt/DeDoDe - [3DV 2024 Oral] DeDoDe 🎶 Detect, Don't Describe --- Describe, Don't Detect, for Local Feature Matching
-
ericzzj1989/BALF - [WACV 2024] BALF: Simple and Efficient Blur Aware Local Feature Detector
-
lyakaap/NetVLAD-pytorch - PyTorch implementation of NetVLAD & Online Hardest Triplet Loss.
-
xiaobiaodu/DreamCar - [RA-L 2024] DreamCar: Leveraging Car-specific Prior for in-the-wild 3D Car Reconstruction
-
nianticlabs/acezero - [ECCV 2024 - Oral] ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.
-
meta-llama/llama-models - Utilities intended for use with Llama models.
-
cs230-stanford/cs230-code-examples - Code examples in pyTorch and Tensorflow for CS230
-
ddbourgin/numpy-ml - Machine learning, in numpy
-
tarashakhurana/4d-occ-forecasting - CVPR 2023: Official code for `Point Cloud Forecasting as a Proxy for 4D Occupancy Forecasting'
-
uoip/stereo_msckf - Python implementation of Multi-State Constraint Kalman Filter (MSCKF) for Vision-aided Inertial Navigation.
-
fundamentalvision/BEVFormer - [ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
-
NVlabs/FB-BEV - Official PyTorch implementation of FB-BEV & FB-OCC - Forward-backward view transformation for vision-centric autonomous driving perception
-
OpenDriveLab/OccNet - [ICCV 2023] OccNet: Scene as Occupancy
-
ViewFormerOcc/ViewFormer-Occ - [ECCV 2024] ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers
-
MCG-NJU/SparseOcc - [ECCV 2024] Fully Sparse 3D Occupancy Prediction & RayIoU Evaluation Metric
-
VISION-SJTU/SparseOcc - Official implementation for 'SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction' (CVPR 2024)
-
weiyithu/SurroundOcc - [ICCV 2023] SurroundOcc: Multi-camera 3D Occupancy Prediction for Autonomous Driving
-
autonomousvision/occupancy_networks - This repository contains the code for the paper "Occupancy Networks - Learning 3D Reconstruction in Function Space"
-
Ferry-Li/SI-SOD - ICML2024: Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object Detection
-
Ferry-Li/SI_Metric - A portable computation of Size-Invariant Metrics for ICML2024: Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object Detection
-
autonomousvision/mip-splatting - [CVPR'24 Best Student Paper] Mip-Splatting: Alias-free 3D Gaussian Splatting
-
Vincentqyw/image-matching-webui - 🤗 image matching webui
-
LiheYoung/Depth-Anything - [CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
-
microsoft/graphrag - A modular graph-based Retrieval-Augmented Generation (RAG) system
-
rvp-group/vbr-devkit - Vision Benchmark in Rome Development Kit
-
utiasSTARS/pykitti - Python tools for working with KITTI data.
-
huang-yh/GaussianFormer - [ECCV 2024] Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
-
TQTQliu/MVSGaussian - [ECCV 2024] MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo
-
buaacyw/MeshAnything - [ICLR 2025] From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"
-
swc-17/SparseDrive - SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation
-
minghanqin/LangSplat - Official implementation of the paper "LangSplat: 3D Language Gaussian Splatting" [CVPR2024 Highlight]
-
Xinyu-Yi/TransPose - A real-time motion capture system that estimates poses and global translations using only 6 inertial measurement units
-
Awesome3DGS/3D-Gaussian-Splatting-Papers - 3D高斯论文,持续更新,欢迎交流讨论。
-
cvg/glue-factory - Training library for local feature detection and matching
-
cvg/LightGlue - LightGlue: Local Feature Matching at Light Speed (ICCV 2023)
-
muskie82/MonoGS - [CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM
-
lukas-blecher/LaTeX-OCR - pix2tex: Using a ViT to convert images of equations into LaTeX code.
-
tjiiv-cprg/EPro-PnP - [CVPR 2022 Best Student Paper] EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation
-
ChiWeiHsiao/DeepVO-pytorch - PyTorch Implementation of DeepVO
-
cvg/nice-slam - [CVPR'22] NICE-SLAM: Neural Implicit Scalable Encoding for SLAM
-
yanyan-li/SLAM-BOOK - 这是一本关于SLAM的书稿,希望能清楚的介绍SLAM系统中的使用的几何方法和深度学习方法。书稿最后应该会达到200页左右,书稿每章对应的代码也会被整理出来。
-
Shiaoming/Python-VO - A simple python implemented frame-by-frame visual odometry with SuperPoint feature detector and SuperGlue feature matcher.
-
openxrlab/xrdslam - Platform for Deep Learning based SLAM
-
H-EmbodVis/DOMINO - Towards Generalizable Robotic Manipulation in Dynamic Environments
-
xianyu110/clawbot - Clawdbot完整配置指南:从安装到Claude Code中转
-
kepano/obsidian-skills - Agent skills for Obsidian. Teach your agent to use Markdown, Bases, JSON Canvas, and use the CLI.
-
leofan90/Awesome-World-Models - A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related websites.
-
mlherd/Dataset-of-Gazebo-Worlds-Models-and-Maps - A set of Gazebo worlds models and maps that I used for testing Navigation2
-
wuxingsanren/wildcat-vip-account - 野猫 - 每天分享最新的百度网盘SVIP、迅雷超级会员、手机话费折扣充值、霸王餐免费吃VIP(美团、饿了么、大众点评、肯德基、麦当劳、星巴克)、饿了么超级会员、美团外卖会员&红包券、爱奇艺VIP会员、腾讯视频VIP、优酷VIP会员、哔哩哔哩大会员、百度文库VIP、QQ音乐VIP、网易云黑胶VIP、喜马拉雅VIP、樊登读书会VIP、千图网VIP、包图网VIP、摄图网VIP、CSDN下载VIP、天眼查VIP、苹果ID等等各类VIP帐号,随取随用,完全免费,绝无套路,同时提供:百度文库VIP下载、图库素材VIP下载、学术文献VIP下载(知网维普万方读秀龙源超星、英文数据库、法律数据库、医学数据库、金融数据库)、全网视频VIP解析、全网音乐MP3免费听及下载、微信域名拦截检测API ,欢迎推荐分享给
-
curionox/lifekline - 人生K线 - 基于AI的八字命理可视化工具
-
OpenDriveLab/WholebodyVLA - [ICLR 2026] Towards Unified Latent VLA for Whole-body Loco-manipulation Control
-
thomaschabal/fom-nav - Official implementation of "FOM-Nav: Frontier-Object Maps for Object Goal Navigation". Code release expected in December 2025.
-
DennisRotondi/awesome-3D-scene-graphs - Awesome 3D Scene Graphs: a curated list of 3D scene graph generation and related resources!
-
jc-bao/awesome-mujoco - A collection of awesome projects using MuJoCo.
-
yukangcao/Awesome-4D-Spatial-Intelligence - A curated list of awesome papers for reconstructing 4D spatial intelligence from video. (arXiv 2507.21045)
-
0311lzy/PVSet_data - This is a 10-meter resolution photovoltaic power station distribution map extracted using the SolarSegNet model, integrating Sentinel-1 and Sentinel-2 imagery. The coverage area includes 14 coastal provincial-level administrative regions and special administrative regions of China in 2024.
-
BaiShuanghao/Awesome-Robotics-Manipulation - A comprehensive list of papers about Robot Manipulation, including papers, codes, and related websites.
-
jiangranlv/embodied-ai-start - [PKU EPIC Lab] 面向小白的具身智能入门指南
-
bagh2178/GC-VLN - [CoRL 2025] GC-VLN: Instruction as Graph Constraints for Training-free Vision-and-Language Navigation
-
KwanWaiPang/Awesome-Transformer-based-SLAM - Paper Survey for Transformer-based SLAM
-
KwanWaiPang/Awesome-VLN - Paper Survey for Visual Language Navigation
-
wengminghe/Dynamic-DINO - [ICCV 2025] Official implementation of the paper: "Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Object Detection"
-
GWxuan/IGL-Nav - [ICCV 2025] IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation
-
manycore-research/InteriorGS - InteriorGS: 3D Gaussian Splatting Dataset of Semantically Labeled Indoor Scenes
-
MoonshotAI/Kimi-K2 - Kimi K2 is the large language model series developed by Moonshot AI team
-
knemik97/Manifesto-against-the-Plagiarist-Yunhe-Wang - 讨贼王云鹤檄文
-
HW-whistleblower/True-Story-of-Pangu - 诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。
-
zijie0/HumanSystemOptimization - 健康学习到150岁 - 人体系统调优不完全指南
-
yuanpengtu/PlayerOne - PlayerOne: Egocentric World Simulator
-
TurtleZhong/Map-based-Visual-Localization - A general framework for map-based visual localization. It contains 1) Map Generation which support traditional features or deeplearning features. 2) Hierarchical-Localizationvisual in visual(points or line) map. 3)Fusion framework with IMU, wheel odom and GPS sensors.
-
ZJU-LLMs/Foundations-of-LLMs - A book for Learning the Foundations of LLMs
-
PRBonn/2DGS-SLAM - 2DGS-SLAM: Globally Consistent RGB-D SLAM with 2D Gaussian Splatting
-
DEEP-PolyU/Awesome-GraphRAG - Awesome-GraphRAG: A curated list of resources (surveys, papers, benchmarks, and opensource projects) on graph-based retrieval-augmented generation.
-
Xnhyacinth/Awesome-LLM-Long-Context-Modeling - 📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
-
cchester25/FAST_LIVO2_Noted - 从小白的视角去分析多源融合SLAM的SOTA框架
-
KalyanKS-NLP/llm-engineer-toolkit - A curated list of 120+ LLM libraries category wise.
-
MoonshotAI/Kimi-VL - Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities
-
KwanWaiPang/Awesome-Learning-based-VO-VIO - Paper Survey for Learning-based Odometry
-
mli/paper-reading - 深度学习经典、新论文逐段精读
-
formulahendry/955.WLB - 955 不加班的公司名单 - 工作 955,work–life balance (工作与生活的平衡)
-
iminolee/Awesome-Vision-and-Language-Navigation - A curated list of awesome Vision-and-Language Navigation(VLN) resources (continually updated)
-
zhangyuejoslin/VLN-Survey-with-Foundation-Models - [TMLR 2024] repository for VLN with foundation models
-
TJU-Aerial-Robotics/YOPO-Tracker - An End-to-End Agile Tracking and Navigation Method for UAVs
-
DoongLi/ICRA2025-Paper-List - ICRA2025 Paper List
-
LinusNEP/EnvoDat - EnvoDat: A Large-Scale Multisensory Dataset for Robotic Spatial Awareness and Semantic Reasoning in Heterogeneous Environments
-
Songwxuan/Embodied-AI-Paper-TopConf - [Actively Maintained🔥] A list of Embodied AI papers accepted by top conferences (ICLR, NeurIPS, ICML, RSS, CoRL, ICRA, IROS, CVPR, ICCV, ECCV).
-
jonyzhang2023/awesome-embodied-vla-va-vln - A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.
-
wz0919/VLN-SRDF - Official implementation of: Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel
-
aikit-wrc/robosense_ac_slam - A Fast and Tightly-coupled Sparse-Direct LiDAR-Inertial-Visual Odometry (LIVO).
-
haoranD/Awesome-Embodied-AI - A curated list of awesome papers on Embodied AI and related research/industry-driven resources.
-
peakpang/UGP - [CVPR 2025 Highlight] Unlocking Generalization Power in LiDAR Point Cloud Registration
-
fffaraz/awesome-cpp - A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.
-
flyingGH/open_vio - 基于vins-fusion 修改,提取关键帧用于三维重建
-
PetWorm/IMU-Preintegration-Propogation-Doc - 中文文档:IMU预积分总结与公式推导
-
MIT-SPARK/Kimera - Index repo for Kimera code
-
AdrianWilczynski/OneDarkPro - "One Dark Pro" theme for Visual Studio generated using Alexander Teinum's "Dainty for Visual Studio", saved with "Visual Studio Color Theme Designer" and tweaked to closer match Binaryify's "One Dark Pro" theme for Visual Studio Code.
-
szx-0633/DeepSeek-R1-learning-note - My learning note about DeepSeek-R1 reasoning LLM
-
John19187/v2ray-SSR-Clash-Verge-Shadowrocke - 2026年免费高速(25.6M/S)v2ray、ss、sing-box、Clash、Verge、SSR、Shadowrocke-小火箭机场节点订阅指南,翻墙梯子,电脑、手机、iOS、安卓、windows、Mac、Linux、路由器翻墙、科学上网、解锁YouTube、Netflix、TikTok、ChatGPT、bilibili港澳台。科学上网、梯子、VPN测评,适用Clash、V2ray、小火箭、sing-box等客户端
-
Lee-JaeWon/2024-Arxiv-Paper-List-Gaussian-Splatting - 2024 Gaussian Splatting Paper List(Arxiv)
-
StarCycle/Awesome-Embodied-AI-Job - Lumina Robotics Talent Call | Lumina社区具身智能招贤榜 | A list for Embodied AI / Robotics Jobs (PhD, RA, intern, etc
-
Pawdroid/Free-servers - 🚀 免费订阅地址,🚀 免费节点,🚀 6小时更新一次,共享节点,节点质量高可用,完全免费。免费clash订阅地址,免费翻墙、免费科学上网、免费梯子、免费ss/v2ray/trojan节点、谷歌商店、翻墙梯子。🚀 Free subscription address, 🚀 Free node, 🚀 Updated every 6 hours, shared node, high-quality node availability, completely free. Free clash subscription address, free ss/v2ray/trojan node.
-
getActivity/EmojiPackage - 表情包资源合集,张张都是经典
-
BestJunYu/Awesome-Physics-aware-Generation - Physical laws underpin all existence, and harnessing them for generative modeling opens boundless possibilities for advancing science and shaping the future!
-
deepseek-ai/awesome-deepseek-integration - Integrate the DeepSeek API into popular software
-
LongHZ140516/PaperGallery - A curated gallery and toolkit designed to provide inspiration for scientific illustrations, project sites, and visual storytelling in research.
-
shinyypig/latex-vscode-config - Use LaTeX in VSCode.
-
cheryyunl/awesome-generalist-agents - A curated list of papers for generalist agents
-
jinyummiao/map-in-mono-reloc - a paper list of visual re-localization algorithms
-
Vincentqyw/Recent-Stars-2025 - 🔥SLAM, VIsual localization, keypoint detection, Image matching, Pose/Object tracking, Depth/Disparity/Flow Estimation, 3D-graphic, etc. related papers and code
-
youngguncho/awesome-slam-datasets - A curated list of awesome datasets for SLAM
-
zju3dv/MatchAnything - Code for "MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training", Arxiv 2025.
-
Hannibal046/Awesome-LLM - Awesome-LLM: a curated list of Large Language Model
-
sirius1024/iterm2-with-oh-my-zsh - iTerm2 + Oh My Zsh 打造舒适终端体验
-
siyuanliii/SLAck - Official Implementation of ECCV2024 paper: SLAck
-
CASIA-LONG/Active-SLAM-Paper-List - This repository primarily organizes papers, code, and other relevant materials related to Active SLAM and Robotic Exploration.
-
serhii-londar/open-source-mac-os-apps - 🚀 Awesome list of open source applications for macOS. https://t.me/s/opensourcemacosapps
-
linyicheng1/Quaternion-Kinematics-for-the-Error-State-Kalman-Filter - Quaternion Kinematics for the Error-State Kalman Filter (中文全文翻译)
-
SJTU-ViSYS/M2DGR-plus - Extension and update of M2DGR: a novel Multi-modal and Multi-scenario SLAM Dataset for Ground Robots (ICRA2022 & ICRA2024)
-
HKUST-Aerial-Robotics/OmniNxt - [IROS'24 Oral] A Fully Open-source and Compact Aerial Robot with Omnidirectional Visual Perception
-
GeekLiB/Lee-SLAM-source - SLAM 开发学习资源与经验分享
-
uzh-rpg/event-based_vision_resources - Event-based Vision Resources. Community effort to collect knowledge on event-based vision technology (papers, workshops, datasets, code, videos, etc)
-
bikhanal/awesome-360-depth-estimation - State-of-the-art papers for depth estimation of 360 images.
-
L3Y1Q2/MyBrain - Knowledge makes up the brain
-
changh95/visual-slam-roadmap - Roadmap to become a Visual-SLAM developer in 2026
-
SJTU-ViSYS/M2DGR - M2DGR: a Multi-modal and Multi-scenario Dataset for Ground Robots(RA-L2021 & ICRA2022)
-
IntelliSensing/UAV-VisLoc - UAV-VisLoc: A Large-scale Dataset for UAV Visual Localization
-
luohongk/slam-handbook-chinese - 本项目主要是关于slam handbook的中文版本
-
RipplePiam/MobaXterm-Chinese-Simplified - MobaXterm 简体中文汉化版🌏🖥🖥🖥 【💌慢工精心制作,"提示"也汉化💻】 【😍控件布局精细调整】
-
OpenDriveLab/End-to-end-Autonomous-Driving - [IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving
-
maziarraissi/Applied-Deep-Learning - Applied Deep Learning Course
-
520xyxyzq/awesome-object-SLAM - A curated list of Object SLAM papers and resources
-
HuaiyuanXu/3D-Occupancy-Perception - [Information Fusion 2025] A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective
-
chicleee/Image-Matching-Paper-List - A personal list of papers and resources of image matching and pose estimation, including perspective images and panoramas.
-
SilenceOverflow/Awesome-SLAM - A curated list of SLAM resources
-
52CV/awesome-huggingface - 🤗 A list of wonderful open-source projects & applications integrated with Hugging Face libraries.
-
HCPLab-SYSU/Embodied_AI_Paper_List - [Embodied-AI-Survey-2025] Paper List and Resource Repository for Embodied AI
-
william-sto/JusticeNeverTooLate - 字节跳动瓜最终真实情况,用事实说话,正义会迟到但不会缺席!
-
amusi/CVPR2026-Papers-with-Code - CVPR 2026 论文和开源项目合集
-
zhuhu00/Awesome_Dynamic_SLAM - Dynamic SLAM, Life-long SLAM Research(Lidar, Visual, Sensor Fusion etc.)
-
hongwenjun/tmux_for_windows - tmux是一个开源工具,用于在一个终端窗口中运行多个终端会话。本工具从msys2里提取,可以在Git for Windows的Git Bash (MingW64)下正常使用。
-
HumanAIGC/AnimateAnyone - Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
-
Open3DVLab/StreetSurfGS - StreetSurfGS: Scalable Large Scene Surface Reconstruction with Gaussian Splatting for Urban Street Scences
-
TianxingChen/Embodied-AI-Guide - [Lumina具身智能社区] 具身智能技术指南 Embodied-AI-Guide
-
Open3DVLab/GigaGS - [AAAI 2025] GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction
-
Thinklab-SJTU/Awesome-LLM4AD - A curated list of awesome LLM/VLM/VLA/World Model for Autonomous Driving(LLM4AD) resources (continually updated)
-
ai-vip/stable-diffusion-tutorial - 全网最全Stable Diffusion全套教程,从入门到进阶,耗时三个月制作
-
AlbertSlam/Lee-SLAM-source - SLAM 开发学习资源与经验分享
-
DeepLabc/LargeScale_3DGS - 3D Gaussian Splatting Papers Relating to Large-Scale Scene.
-
sjtuyinjie/awesome-LiDAR-Visual-SLAM - A curated list of resources relevant to LiDAR-Visual-Fusion-SLAM
-
perkfly/reverse-interview-zh - 技术面试最后反问面试官的话
-
kwea123/gaussian_splatting_notes - A detailed formulae explanation on gaussian splatting
-
623637646/996.Leave - 逃离996
-
Meltwin/Noetic-Ubuntu22.04 - Manual instructions on how to install ROS1 Noetic on Ubuntu 22.04
-
StevenCui/VIO-Doc - 主流VIO论文推导及代码解析
-
ericzzj1989/Awesome-Image-Matching - Bibliographic list for papers of image matching
-
miss-mumu/developer2gwy - 公务员从入门到上岸,最佳程序员公考实践教程
-
llamastack/llama-stack-apps - Agentic components of the Llama Stack APIs
-
0voice/expert_readed_books - 2021年最新总结,推荐工程师合适读本,计算机科学,软件技术,创业,思想类,数学类,人物传记书籍
-
jianzongwu/Awesome-Open-Vocabulary - (TPAMI 2024) A Survey on Open Vocabulary Learning
-
lvchuandong/Awesome-Multi-Camera-3D-Occupancy-Prediction - Awesome papers and code about Multi-Camera 3D Occupancy Prediction, such as TPVFormer, SurroundOcc, PanoOcc, OccFormer, FB-OCC, SelfOcc, COTR, SparseOcc, GaussianFormer, GaussianOcc, STCOcc, OccMamba. In this repository, you will see the latest 3D occupancy prediction papers and code.
-
pubsys/ReviewSystem - 审稿系统的自述
-
weisongwen/UrbanNavDataset - UrbanNav: an Open-Sourcing Localization Data Collected in Asian Urban Canyons, Including Tokyo and Hong Kong
-
datawhalechina/pumpkin-book - 南瓜书:《机器学习》(西瓜书)公式详解
-
pengsida/learning_research - 本人的科研经验
-
amusi/Deep-Learning-Interview-Book - 深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理和SLAM等方向)
-
duoan/TorchCode - 🔥 LeetCode for PyTorch — practice implementing softmax, attention, GPT-2 and more from scratch with instant auto-grading. Jupyter-based, self-hosted or try online.
-
Infrasys-AI/AISystem - AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
-
qiuzh20/gated_attention - The official implementation for [NeurIPS2025 Oral] Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free
-
QwenLM/Qwen3-Omni - Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
-
QwenLM/Qwen2.5-Omni - Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
-
IDEA-Research/Grounded-SAM-2 - Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
-
nv-tlabs/GEN3C - [CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
-
LaVi-Lab/VG-LLM - The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'
-
facebookresearch/dinov3 - Reference PyTorch implementation and models for DINOv3
-
facebookresearch/dinov2 - PyTorch code and models for the DINOv2 self-supervised learning method.
-
InternRobotics/InternNav - InternRobotics' open platform for building generalized navigation foundation models.
-
HeegerGao/FLIP - Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks
-
datawhalechina/easy-rl - 强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
-
RL4VLM/RL4VLM - Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
-
ByteDance-Seed/Seed1.5-VL - Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.
-
Robotics-STAR-Lab/DynamicPose - [IROS 2025] DynamicPose: Real-time and Robust 6D Object Pose Tracking for Fast-Moving Cameras and Objects
-
microsoft/ai-agents-for-beginners - 12 Lessons to Get Started Building AI Agents
-
NVIDIA/Isaac-GR00T - NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.
-
google-gemini/gemini-fullstack-langgraph-quickstart - Get started with building Fullstack Agents using Gemini 2.5 and LangGraph
-
datawhalechina/happy-llm - 📚 从零开始构建大模型
-
Liuziyu77/Visual-RFT - Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’
-
bagh2178/SG-Nav - [NeurIPS 2024] SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation
-
facebookresearch/EdgeTAM - [CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"
-
zhanshijinwat/Steel-LLM - Train a 1B LLM with 1T tokens from scratch by personal
-
facebookresearch/co-tracker - CoTracker is a model for tracking any point (pixel) on a video.
-
arclab-hku/DEIO - (ICCV2025) Learning-based Event-Inertial Odometry
-
IDEA-Research/Grounded-Segment-Anything - Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
-
facebookresearch/segment-anything - The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
-
xinyu1205/recognize-anything - Open-source and strong foundation image recognition models.
-
CompVis/depth-fm - [AAAI 2025, Oral] DepthFM: Fast Monocular Depth Estimation with Flow Matching
-
luhengshiwo/LLMForEverybody - 每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈
-
luohongk/Embodied-Navigation - 关于Embodied-Navigation的仓库,主要用于整理我在定位,感知,规控,3D Vision, VLN中的部分知识
-
GAP-LAB-CUHK-SZ/gaustudio - A Modular Framework for 3D Gaussian Splatting and Beyond
-
HCPLab-SYSU/LH-VLN - Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method (CVPR-25)
-
HandsOnLLM/Hands-On-Large-Language-Models - Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
-
CurryYuan/ZSVG3D - [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding
-
datawhalechina/tiny-universe - 《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
-
robot-pesg/BotanicGarden - BotanicGarden: A high-quality dataset for robot navigation in unstructured natural environments
-
QwenLM/Qwen3-VL - Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
-
DjangoPeng/LLM-quickstart - Quick Start for Large Language Models (Theoretical Learning and Practical Fine-tuning) 大语言模型快速入门(理论学习与微调实战)
-
zju3dv/LoFTR - Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022
-
hesamsheikh/ml-retreat - Machine Learning Journal for Intermediate to Advanced Topics.
-
DataExpert-io/data-engineer-handbook - This is a repo with links to everything you'd ever want to learn about data engineering
-
florinshen/FlashSplat - [ECCV2024] [3DV Nectar 2025] FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally
-
yzslab/gaussian-splatting-lightning - A 3D Gaussian Splatting framework with various derived algorithms and an interactive web viewer
-
CyberOrigin2077/Cyber - This repo is designed for General Robotic Operation System
-
Tencent-Hunyuan/HunyuanDiT - Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
-
microsoft/OmniParser - A simple screen parsing tool towards pure vision based GUI agent
-
TommyZihao/vlm_arm - 机械臂+大模型+多模态=人机协作具身智能体
-
cumtcssuld/RSP_of_CUMTCS - 【矿大计算机学院资源共享计划(Resource SharingPlan of CUMTCS)】本仓库由矿大计算机学院学生会学习部牵头维护,由计算机学院全体同学共建共享。欢迎大家积极的参加到本资源库的建设中来吧!(每当有重大更新,我们都会将整个库克隆到码云,点击下边链接,到我们的码云仓库可以获得更好的下载体验)
-
ut-amrl/ObVi-SLAM - Long-Term Object Visual SLAM
-
Infrasys-AI/AIInfra - AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。
-
CompVis/stable-diffusion - A latent text-to-image diffusion model
-
AnyLoc/Revisit-Anything - Code release for Revisit Anything: Visual Place Recognition via Image Segment Retrieval (ECCV 2024)
-
be2rlab/gsplatloc - [IROS 2025] GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization
-
TommyZihao/Train_Custom_Dataset - 标注自己的数据集,训练、评估、测试、部署自己的人工智能算法
-
isl-org/ZoeDepth - Metric depth estimation from a single image
-
datawhalechina/leedl-tutorial - 《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
-
Fafa-DL/Lhy_Machine_Learning - 李宏毅2021/2022/2023春季机器学习课程课件及作业
-
yubaoliu/RDS-SLAM - DS-SLAM: Real-Time Dynamic SLAM Using Semantic Segmentation Methods
-
SakanaAI/AI-Scientist - The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
-
datawhalechina/self-llm - 《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
-
facebookresearch/sam2 - The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
-
hustvl/4DGaussians - [CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering
-
heucoder/ML-DL_book - 机器学习、深度学习一些个人认为不错的书籍。
-
verlab/accelerated_features - Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!
-
openclaw/openclaw - Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
-
Molunerfinn/PicGo - 🚀 The Ultimate Image Uploader for Efficient Creators. Supports Obsidian, Typora, VS Code etc. and 60+ image hosting services (S3, GitHub, Cloudflare R2, Imgur, Aliyun OSS...). Paste, upload, done.
-
zimya/zhihu_obsidian - Zhihu on Obsidian | 知乎 Obsidian 插件
-
OpenCut-app/OpenCut - The open-source CapCut alternative
-
chanhx/crabviz - Generate interactive call graphs for various languages
-
plait-board/drawnix - 开源白板工具(SaaS),一体化白板,包含思维导图、流程图、自由画等。All in one open-source whiteboard tool with mind, flowchart, freehand and etc.
-
shareAI-lab/learn-claude-code - Bash is all you need - A nano claude code–like 「agent harness」, built from 0 to 1
-
google-gemini/gemini-cli - An open-source AI agent that brings the power of Gemini directly into your terminal.
-
n8n-io/n8n - Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
-
123xiao/sex-agreement-app - X行为同意协议系统
-
microsoft/vscode - Visual Studio Code
-
mastra-ai/mastra - From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack.
-
Binaryify/OneDark-Pro - Atom's iconic One Dark theme for Visual Studio Code
-
MegaScenes/web-viewer - web viewer for 3d reconstructions
-
coaidev/coai - 🚀 Next Generation Multi-tenant AI One-Stop Solution. Builtin Admin & Billing System. Enterprise-Grade Unified LLM Gateway Support for 200+ Models And 35+ Providers, Load Balacing w/ Priority-base Routing, Cost Management, Chat Share, Cloud Sync, Credit/Subscription Billing, All File Parsing, Web Search, Built-in Model Cache.
-
Eugeny/tabby - A terminal for a more modern age
-
amir9480/vscode-cpp-helper - vscode extension to create implementation for c++ function prototypes.
-
hcengineering/platform - Huly — All-in-One Project Management Platform (alternative to Linear, Jira, Slack, Notion, Motion)
-
conwnet/github1s - One second to read GitHub code with VS Code.
-
ocsjs/ocsjs - OCS 网课助手,刷课脚本,网课脚本,帮助大学生解决网课难题,支持【超星学习通】【知道智慧树】【职教云】【智慧职教】【中国大学MOOC】等网课 , 可以在 脚本猫 以及 油猴 等开源脚本管理器下运行。
-
immich-app/immich - High performance self-hosted photo and video management solution.
-
clash-verge-rev/clash-verge-rev - A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience
-
fengtt42/U2UData-2 - THU-EAI Lab
-
luohongk/AcademicHomepage - This project is a beautiful personal academic homepage template created by me. Welcome to use it.
-
Maserhe/VScode-Markdown-theme-Maserhe - vscode 自定义Markdown排版风格,以及代码块样式风格。
-
wzzheng/GaussianFormer - Project Page for GaussianFormer
-
i2Nav-WHU/i2Nav-Robot - A Large-Scale Indoor-Outdoor Robot Dataset for Multi-Sensor Fusion Navigation and Mapping
-
lesaf92/ros_noetic_ubuntu22 - Instructions for installing ROS Noetic on Ubuntu 22.04
-
caol64/wenyan - 文颜- Markdown文章排版美化工具,支持微信公众号、今日头条、知乎等平台。
-
jordanbaird/Ice - Powerful menu bar manager for macOS
-
lwouis/alt-tab-macos - Windows alt-tab on macOS
-
exelban/stats - macOS system monitor in your menu bar
-
ejbills/DockDoor - Window peeking, alt-tab and other enhancements for macOS
-
Caldis/Mos - 一个用于在 macOS 上平滑你的鼠标滚动效果或单独设置滚动方向的小工具, 让你的滚轮爽如触控板 | A lightweight tool used to smooth scrolling and set scroll direction independently for your mouse on macOS
-
gao-sun/eul - 🖥️ macOS status monitoring app written in SwiftUI.
- pppscn/SmsForwarder - 短信转发器——监控Android手机短信、来电、APP通知,并根据指定规则转发到其他手机:钉钉群自定义机器人、钉钉企业内机器人、企业微信群机器人、飞书机器人、企业微信应用消息、邮箱、bark、webhook、Telegram机器人、Server酱、PushPlus、手机短信等。包括主动控制服务端与客户端,让你轻松远程发短信、查短信、查通话、查话簿、查电量等。(V3.0 新增)PS.这个APK主要是学习与自用,如有BUG请提ISSUE,同时欢迎大家提PR指正
-
nelvko/clash-for-linux-install - 😼 优雅地使用基于 clash/mihomo 的代理环境
-
yuaotian/go-cursor-help - 解决Cursor在免费订阅期间出现以下提示的问题: Your request has been blocked as our system has detected suspicious activity / You've reached your trial request limit. / Too many free trial accounts used on this machine.
-
dockur/macos - MacOS inside a Docker container.
-
wnlen/clash-for-linux - 🐧 在 Linux 上提供一套完整的 Clash / Mihomo(Clash Meta) 代理与管理面板
-
VocabVictor/clash-for-AutoDL - AutoDL平台服务器适配梯子, 使用 Clash 作为代理工具
-
techahold/rustdeskinstall - Easy install Script for Rustdesk
-
liguodongiot/llm-resource - LLM全栈优质资源汇总
-
atakandag/data_collection_vloc - Data Collection with Zed2 and Ouster LiDAR and 3D Reconstruction with Rtabmap
-
saicaca/fuwari - ✨A static blog template built with Astro.
-
RomanHauksson/academic-project-astro-template - Astro template to help you build a website for your research paper, based on the Nerfies project page
-
luohongk/SurveyAlgo - 💪[SurveyAlgo] 测绘算法库! 本项目立足于测绘程序设计竞赛创建的测绘类算法仓库(An open-source code of surveying and mapping algorithms for programming design.)
-
xuankuzcr/xuankuzcr.github.io - My personal website.
-
Jun-CEN/Jun-CEN.github.io - My personal webset: cen-jun.com
-
zouzhekang/YJYpaper - 一个用来记录武汉大学杨景媛论文问题的仓库
-
wdndev/llm_interview_note - 主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
-
f/prompts.chat - f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.
-
David-patrick-chuks/Riona-AI-Agent - Riona Ai Agent 🌸 is built using Node.js and TypeScript 🛠️, designed for seamless job execution 📸. It's lightweight, efficient, and still evolving 🚧—exciting new features coming soon! 🌟
-
PKUFlyingPig/cs-self-learning - 计算机自学指南
-
vernesong/OpenClash - A Clash Client For OpenWrt
-
beichensky/Font - FiraCode 和 Operator Mono 字体
-
Robotics-STAR-Lab/ApexNav - [RA-L'25] An Reliable and Efficient Framework for Zero-Shot Object Navigation
-
PrideLab/PRIDE-PPPAR - An open‑source software for Multi-GNSS PPP ambiguity resolution
-
0voice/algorithm-structure - 2021年最新总结 500个常用数据结构,算法,算法导论,面试常用,大厂高级工程师整理总结
-
kevin2431/Traj-LO - [RA-L 2024] In Defense of LiDAR-Only Odometry Using an Effective Continuous-Time Trajectory
-
rtklibexplorer/RTKLIB - A version of RTKLIB optimized for low cost GNSS receivers, especially u-blox receivers. It is based on RTKLIB 2.4.3. This software is provided “AS IS” without any warranties of any kind so please be careful, especially if using it in any kind of real-time application. Click on the "Releases" label below to see the latest Windows pre-release.
-
MichaelBeechan/PPP-RTK - SPP、RTD、PPP、RTK、PPP-RTK、RAIM、ARAIM et al
-
Azure1210/elegantbook-magic-revision - Elegentbook魔改版本!
-
fky2015/resume-ng - A LaTeX resume template designed for optimal information density and aesthetic appeal.
-
SLAM-Handbook-contributors/slam-handbook-public-release - Release repo for our SLAM Handbook
-
lliei0x/Moderncv-LateX - LaTeX简历模版👀📑
-
HouJP/resume - 使用LaTeX编译生成的中英文个人简历
-
whutug/whu-thesis - 武汉大学毕业论文 LaTeX 模版 2025
-
openai/harmony - Renderer for the harmony response format to be used with gpt-oss
-
prefix-dev/pixi - Powerful system-level package manager for Linux, macOS and Windows written in Rust – building on top of the Conda ecosystem.
-
rustdesk/rustdesk - An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.
-
lapce/lapce - Lightning-fast and Powerful Code Editor written in Rust
-
typst/typst - A markup-based typesetting system that is powerful and easy to learn.
-
makeecat/Peng - A minimal quadrotor autonomy framework in Rust (Mac, Linux, Windows)
-
tailscale/tailscale - The easiest, most secure way to use WireGuard and 2FA.
-
fatedier/frp - A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.
-
XTLS/Xray-core - Xray, Penetrates Everything. Also the best v2ray-core. Where the magic happens. An open platform for various uses.
-
yorukot/superfile - Pretty fancy and modern terminal file manager
-
JanDeDobbeleer/oh-my-posh - The most customisable and low-latency cross platform/shell prompt renderer
-
sourcegraph/sourcegraph-public-snapshot - Code AI platform with Code Search & Cody
-
ayamir/nvimdots - A well configured and structured Neovim.
-
gaboolic/rime-shuangpin-fuzhuma - 墨奇音形,打造最强双拼辅助码rime输入方案,让天下双拼用户人人用得上辅助码。基于雾凇-白霜词库,支持小鹤双拼、自然码双拼、搜狗双拼、微软双拼等多种双拼,辅助码支持墨奇码(原创拆分开源支持4万字)、自然码部首辅、小鹤音形(鹤形辅)等,支持双拼和辅助码之间排列组合,支持整句/字词输入。不认识的字可以笔画、部件拆字、仓颉码反查。支持aw、aj模式输入英文、日文,支持双拼并击输入、emoji、快符、日期、大写数字、计算器等高级功能。雾凇鹤|雾凇自然|墨奇码|墨奇音形
-
neovim/neovim - Vim-fork focused on extensibility and usability
-
amix/vimrc - The ultimate Vim configuration (vimrc)
-
vim/vim - The official Vim repository
-
linrongbin16/lin.vim - Lin Rongbin's (Neo)Vim Distribution
-
yanchi-3dv/diff-gaussian-rasterization-for-gsslam - The modified differential Gaussian rasterization in the CVPR 2024 highlight paper: GS-SLAM: Dense Visual SLAM with 3D Gaussian Splatting.
-
computerhistory/AlexNet-Source-Code - This package contains the original 2012 AlexNet code.
-
carlinds/splatad - SplatAD: Real-Time Lidar and Camera Rendering with 3D Gaussian Splatting for Autonomous Driving
-
YOUNG-bit/open_semantic_slam - ICRA2025: OpenGS-SLAM: Open-Set Dense Semantic SLAM with 3D Gaussian Splatting for Object-Level Scene Understanding
-
qdLMF/LightGlue-with-FlashAttentionV2-TensorRT - A cutlass cute implementation of headdim-64 flashattentionv2 TensorRT plugin for LightGlue. Run on Jetson Orin NX 8GB with TensorRT 8.5.2.
-
deepseek-ai/DeepGEMM - DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
- TapXWorld/ChinaTextbook - 所有小初高、大学PDF教材。
-
Anduin2017/HowToCook - 程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
-
linyicheng1/Dockers - 一些常用的Dockerfile 文件,能够快速部署运行一些常用算法,避免重复配置环境
-
jaeseok4104/slam-docker - SLAM Docker for research
-
Achuan-2/SlideSCI - PPT plugin, supports one-click to add image titles, copy and paste positions, one-click image alignment, and one-click to insert Markdown (including bold, hyperlinks, and other inline styles, as well as code blocks, LaTeX, and other block-level styles)! PPT插件,支持一键添加图片标题,复制粘贴位置、一键图片对齐、一键插入Markdown(加粗、超链接等行内样式、代码块、LaTeX等块级样式)、便捷导出图片!
-
2dust/v2rayN - A GUI client for Windows, Linux and macOS, support Xray and sing-box and others
-
mahoshojo0805/ContestPrograms - 测绘技能大赛程序
-
MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning - This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
-
i2Nav-WHU/KF-GINS-Matlab - An EKF-based GNSS/INS Integrated Navigation Systems in Matlab (Matlab Version of KF-GINS)
-
zhao-zhibo/INS - INS.IMU. Inertial navigation mechanical arrangement algorithm, based on Yan Gongmin's PSINS 惯导机械编排算法,以严恭敏的PSINS为基础,可以完成武汉大学的机械编排课程作业.
- JunyaoHu/academic-project-page-template-vue - A vue-based project page template for academic papers. (in development) https://junyaohu.github.io/academic-project-page-template-vue
-
zh-google-styleguide/zh-google-styleguide - Google 开源项目风格指南 (中文版)
-
kahowang/FAST_LIO_SAM - Front_end : fastlio2 Back_end : lio_sam
- llvm/llvm-project - The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
- Relja/netvlad - NetVLAD: CNN architecture for weakly supervised place recognition
- stereolabs/zed-python-api - Python API for the ZED SDK
-
RayeRen/acad-homepage.github.io - AcadHomepage: A Modern and Responsive Academic Personal Homepage
-
localsend/localsend - An open-source cross-platform alternative to AirDrop
-
chen08209/FlClash - A multi-platform proxy client based on ClashMeta,simple and easy to use, open-source and ad-free.
-
labuladong/fucking-algorithm - Crack LeetCode, not only how, but also why.
-
codecrafters-io/build-your-own-x - Master programming by recreating your favorite technologies from scratch.
- tonsky/FiraCode - Free monospaced font with programming ligatures
- krahets/hello-algo - 《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持简中、繁中、English、日本語,提供 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 等代码实现