Machine Learning

๐Ÿš€ I recently wrote a overview article about **Model Merging**

โœ๏ธ I recently wrote a overview article about **Model Merging**, covering nearly 30 methods with detailed mathematical derivations.

Sep 1, 2025

Model Merging Overview
Model Merging Overview

Please refer to this ariticle for details.

Aug 22, 2025

๐Ÿš€ I have sorted out several projects I have done in recent months
๐Ÿš€ I have sorted out several projects I have done in recent months

โœ๏ธ In recent months, I have done some interesting projects, including ViT on CIFAR-10, Simple ML, ASR, etc.

Jul 1, 2025

๐ŸŽ™๏ธ Auto Speech Recognition
๐ŸŽ™๏ธ Auto Speech Recognition

I dove into the world of Automatic Speech Recognition (ASR) by building a Large Vocabulary Continuous Speech Recognition (LVCSR) system using the Kaldi toolkit.

Jun 25, 2025

Simple-ML
Simple-ML

TripDataset Machine Learning Project This project is a complete implementation of machine learning pipelines applied to the TripDataset, focusing on data preprocessing, classification, and regression tasks, including: ๐Ÿงน Data preprocessing and cleaning (handling missing values, outlier detection, normalization, and feature engineering) ๐Ÿค– Model training for classification and regression (various ML algorithms for categorical and continuous prediction tasks) ๐Ÿ“Š Performance evaluation and metrics (accuracy, F1-score, RMSE, and other evaluation techniques) ๐Ÿ” Exploratory data analysis and visualization (insightful plots for feature relationships, distribution, and model performance)

Jun 7, 2025

๐Ÿงฉ ViT on CIFAR-10
๐Ÿงฉ ViT on CIFAR-10

This project is a complete implementation of Vision Transformer (ViT) applied to small-scale datasets (especially CIFAR-10), including extensive exploration.

May 31, 2025

ViT on CIFAR-10
ViT on CIFAR-10

ViT-torch: Vision Transformer on CIFAR-10 (PyTorch) This project is a complete implementation of Vision Transformer (ViT) applied to small-scale datasets (especially CIFAR-10), including: ๐ŸŽฏ Model implementations with various configurations (native ViT, ResNet+ViT hybrid, different patch/heads/blocks setups, Stochastic Depth/DropPath, etc.) ๐ŸŒน Training and evaluation scripts (with learning rate schedulers: Warmup/Linear/Cosine/Constant-Cosine/Warmup-Constant-Cosine) ๐Ÿงฉ Data augmentation (RandomCrop+Paste, MixUp, CutMix, RandAugment, and batch random augmentation) ๐Ÿ“ˆ Visualization and analysis (attention maps, attention distance, gradient rollout, feature maps, positional embedding similarity)

May 31, 2025

Weight Spectra Induced Efficient Model Adaptation
Weight Spectra Induced Efficient Model Adaptation

PEFT from fully fine-tuning perspective

May 29, 2025

๐ŸŽ™๏ธ Voice Activity Detection
๐ŸŽ™๏ธ Voice Activity Detection

I conducted extensive experiments comparing frame division methods and model performances, with rich visualizations.

May 15, 2025

๐ŸŽ‰ I have opensourced my VAD project recently
๐ŸŽ‰ I have opensourced my VAD project recently

๐Ÿ“ˆ I conducted extensive experiments comparing frame division methods and model performances, with rich visualizations.

May 5, 2025