Main themes: - Chat completion support - Data parallel attention support - Multiple model and multiple LoRA support - Repository consolidation