Skip to content

kaofelix/pi-glm-image-summary

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pi-glm-image-summary

A pi extension that intercepts image reads when using non-vision GLM models and sends them to glm-4.6v for detailed analysis.

Why?

GLM text models (glm-4.6, glm-4.7, glm-4.7-flash) have no vision capabilities. GLM-4.6v does. This extension automatically detects when you're using a non-vision GLM model and intercepts image reads, sending them to glm-4.6v for comprehensive analysis.

Features

  • Automatic image interception: When using glm-4.7/glm-4.7-long, image file reads are automatically redirected to glm-4.6v for analysis
  • Comprehensive analysis: Extracts text content, visual elements, technical details, and more
  • Manual analysis command: /analyze-image <path> to manually analyze any image

Installation

Install globally:

pi install npm:pi-glm-image-summary

Or install for a specific project (writes to .pi/settings.json):

pi install -l npm:pi-glm-image-summary

To try it without installing:

pi -e npm:pi-glm-image-summary

Usage

Once installed, the extension loads automatically when you start pi:

pi --provider zai --model glm-4.7

Automatic Mode

When the extension detects:

  1. Current model is glm-4.7 or glm-4.7-long
  2. A file being read is an image (jpg, jpeg, png, gif, webp)

It will automatically spawn a subprocess with glm-4.6v to analyze the image and return a detailed summary.

Manual Analysis

Use the /analyze-image command to analyze any image:

/analyze-image ./screenshot.png

Supported Image Formats

  • JPEG (.jpg, .jpeg)
  • PNG (.png)
  • GIF (.gif)
  • WebP (.webp)

Configuration

The extension uses the ZAI provider for the vision model. Make sure you have proper API credentials configured.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors