-
Notifications
You must be signed in to change notification settings - Fork 317
Pull requests: NVIDIA/Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add DeepSeek MoE detection and export mapping in HF PTQ/export path
#1125
opened Mar 26, 2026 by
Charles-JCJ
Loading…
Refine _extract_layer_prefixes to better handle mtp modules
#1124
opened Mar 26, 2026 by
Edwardf0t1
Loading…
Remove --skip-softmax option from hf_sa (only allow calibration option)
#1123
opened Mar 25, 2026 by
rohansjoshi
Loading…
Remove custom DistillationProvider and simplify mbridge distillation and hf export
#1122
opened Mar 25, 2026 by
kevalmorabia97
Loading…
Added fallback to preload cudnn dlls from nvidia cudnn venv package or torch venv package
#1119
opened Mar 25, 2026 by
hthadicherla
Loading…
Add custom MoE quantization guide for HuggingFace models
#1118
opened Mar 25, 2026 by
cjluo-nv
Loading…
[BugFix][5271237][ONNX] Add Q/DQ placement for Conv->LayerNorm patterns
#1117
opened Mar 24, 2026 by
ajrasane
Loading…
Add nvfp4_mse and nvfp4_local_hessian options to the ptq script.
cherry-pick
After code freeze, cherry-pick into release branch for next rc. Only for bug fixes and doc updates
#1113
opened Mar 24, 2026 by
bkartal-dev
Loading…
Add bypass distillation (blockwise local KD) to puzzletron pipeline
#1111
opened Mar 24, 2026 by
Separius
Loading…
[OMNIML-3776]: add clear docs restrict the model types
#1105
opened Mar 23, 2026 by
shengliangxu
Loading…
fix: EAGLE mix_hidden_states in-place op crash (#1088)
#1104
opened Mar 23, 2026 by
javierdejesusda
Loading…
6 tasks done
Added general graph surgery run function for easier scalability with Olive.
#1096
opened Mar 23, 2026 by
hthadicherla
Loading…
[OMNIML-3689] PTQ quant_cfg semantic correction. Design in doc _quant_cfg.rst
#1094
opened Mar 22, 2026 by
shengliangxu
Loading…
Exclude small-channel Conv nodes from FP8 quantization
#1083
opened Mar 20, 2026 by
nv-samcheng
Loading…
[3/n] Add skip-softmax to Triton flash attention kernel
#1081
opened Mar 20, 2026 by
kaix-nv
Loading…
fix: [modelopt 0.43.0][GB200][llm_ptq / sglang] Llama-3.1-8B-Inst (#5997673)
#1080
opened Mar 20, 2026 by
ChenhanYu
Loading…
fix: [ModelOpt-Windows][modelopt 0.43.0] [genai_llm][README]: Sho (#5997787)
#1077
opened Mar 19, 2026 by
ChenhanYu
Loading…
fix: Feature: Add validation for loaded modelopt state files (#1041)
#1074
opened Mar 19, 2026 by
ChenhanYu
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.