You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* add support for qwen3.5 vl model
* enable detection of VLM models and allow using non-Hopper GPUs for GPT-OSS
* add broader vlm support
* add general vlm support
* support gemma3n
* address coderabbit review comments
- Fix eos_token_id truthiness check (0 is valid)
- Add isinstance guard for RopeParameters in mrope detection
- Add hasattr fallback for non-dict rope objects
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* fix CI: import sorting, pylint, and test mocks
- Fix isort ordering in vlm_utils.py and model.py
- Fix pylint: use 'from torch import nn', mark unused-argument
- Mock needs_sdpa and get_module_class_from_name in unit tests
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* fix ruff formatting for CI version (0.12.11)
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* address remaining review comments
- accelerator.py: fall back to warning + default wrap policy instead of
ValueError when no _no_split_modules resolve; try underlying HF model
as secondary target
- model.py: use torch.cuda.current_device() instead of hardcoded 0
- vlm_utils.py: add trust_remote_code param (default False) to all
config-loading functions; use init_empty_weights for CausalLM shell;
copy quantization metadata from VLM
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* fix mamba kernel comments and exception handling
- Remove fabricated claim about C API incompatibility
- Accurately describe the issue as PyTorch/CUDA ABI mismatch
- Broaden exception handling to catch AttributeError
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
---------
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
f"WARNING: There is a mismatch between pad token id of model ({self.model.config.pad_token_id}) and tokenizer({self.tokenizer.pad_token_id}). Fixing model pad token id to be same as tokenizer's pad token id"
396
+
f"WARNING: There is a mismatch between pad token id of model ({text_config.pad_token_id}) and tokenizer({self.tokenizer.pad_token_id}). Fixing model pad token id to be same as tokenizer's pad token id"
f"WARNING: There is a mismatch between bos token id of model({self.model.config.bos_token_id}) and tokenizer({self.tokenizer.bos_token_id}). Fixing model bos token id to be same as tokenizer's bos token id"
405
+
f"WARNING: There is a mismatch between bos token id of model({text_config.bos_token_id}) and tokenizer({self.tokenizer.bos_token_id}). Fixing model bos token id to be same as tokenizer's bos token id"
f"WARNING: There is a mismatch between eos token id of model({self.model.config.eos_token_id}) and tokenizer({self.tokenizer.eos_token_id}). Fixing model eos token id to be same as tokenizer's eos token id"
414
+
f"WARNING: There is a mismatch between eos token id of model({text_config.eos_token_id}) and tokenizer({self.tokenizer.eos_token_id}). Fixing model eos token id to be same as tokenizer's eos token id"
0 commit comments