Smart Screen

Record a screen demo, understand what happened, and turn it into a polished walkthrough.

Local-first Smart Screen analysis, optional BYOK AI, transcript-aware zooms, captions, click telemetry, and export-ready output.

Pitch

Most screen recorders stop at capture. Most AI demo tools over-edit, over-zoom, or hide the real workflow behind too much automation.

Smart Screen is built for a better path:

record normally
capture real interaction signals
understand narration and clicks together
suggest edits instead of silently forcing them
export a polished demo with captions, zooms, trims, and keystroke overlays

This project is designed to work in three modes:

Offline: local Smart Screen heuristics, no key required
Hybrid: local analysis plus optional BYOK AI refinement
Local AI: Ollama for teams that want to stay on-device

What Makes It Useful

Speech-grounded Smart Screen The app can use transcript + cursor + frames together, so spoken instructions like “click this button” or “look at this chart” become better zooms, focus moments, and step titles.
Native interaction telemetry On macOS, the app can capture true native click telemetry and global keystrokes for cleaner demo polishing.
Better polish controls Instead of random AI output, the editor supports selective apply for AI zooms and trims, one-click polish, calm zoom behavior, click emphasis, captions, and keystroke overlays.
Before / after demoability The editor has an Original vs Polished preview mode so judges can immediately see the transformation.

Core Features

Recording

screen/window recording
microphone recording enabled by default
pause / resume
local recording storage
native click telemetry on macOS
native keystroke telemetry on macOS

Editor

timeline-based editing
zoom regions
trim regions
speed regions
caption/transcript track
crop / padding / wallpaper
before / after preview mode
keyboard shortcuts customization
playback speed control
preset backgrounds and effects

Transcription & Captions

auto-transcription from microphone audio via MacOSTranscriber.app
import transcript from common formats (SRT, VTT, plain text)
transcript review and editing dialog with millisecond-precision timestamps
caption styling: font size, vertical offset, text color, background color
preset caption styles (Dark, Light, None)
SRT / VTT export
caption preview in video player with YouTube-style CC toggle button
burned-in captions for MP4 and GIF exports
dedicated "CC" row in timeline editor; click to seek

Smart Screen

local click / typing / navigation / silence analysis
calmer auto-zoom planning
transcript-aware callouts
one-click Polish Demo
Smart Screen panel as slide-over sheet (accessible from top-bar icon)
speech-aware AI refinement

AI Assist (Optional BYOK)

BYOK provider settings with secure secret storage
OpenAI support for AI-powered Smart Screen analysis
Ollama support for on-device inference
local Ollama model discovery
model guidance for base vs instruction vs vision-capable models
AI-generated summaries, step titles, zooms, trims, and focus moments

Demo Flow

The strongest demo path is:

Record a narrated walkthrough (video + audio + native click/keystroke telemetry captured).
Editor Opens — recording is auto-loaded with:
- Local Smart Screen analysis (click, typing, navigation, silence detection)
- Auto-transcription if .transcription.wav sidecar exists
Show Original Preview — demonstrate the raw recording alongside timeline.
Review Transcript — edit captions and timestamps in the transcript review dialog.
Smart Screen Panel — open the slide-over sheet to:
- Tune caption styling (font, size, color, presets)
- Apply one-click Polish Demo (auto-zooms, trims, silences removed)
Show Polished Preview — display the transformed version side-by-side with Original.
Optional AI Refinement — open AI Assist to further refine suggestions (if OpenAI or Ollama configured).
Export — choose MP4 or GIF with burned-in captions, applied effects, and overlays.

For a ready-to-use judging script, see HACKATHON_DEMO.md.

Data Flow

Record screen + audio + native events (clicks, keystrokes)
  |
  → Save as .webm video + .webm.cursor.json + .transcription.wav sidecars
  |
Open editor with recording
  |
  → Load video, cursor telemetry, auto-transcribe if .transcription.wav exists
  |
Local Smart Screen analysis
  |
  → Detect clicks, typing, inactivity, navigation patterns
  → Generate suggested zooms, trims, focus moments
  |
Review & edit transcript (timestamps, captions)
  |
Optional BYOK AI refinement (if OpenAI or Ollama configured)
  |
Apply edits: zooms, trims, captions, speed changes, overlays
  |
Export MP4 or GIF with captions, click emphasis, keystroke display

For detailed architecture and code module locations, see CLAUDE.md.

Providers And Transcription

AI providers

OpenAI
Ollama

Transcription backends

transcript import
OpenAI transcription
macOS-native transcription path

The app keeps transcription and AI analysis as separate concerns. You can use:

no AI + imported transcript
OpenAI transcription + Ollama analysis
local-only analysis without any provider

Security

provider secrets are kept out of renderer code
AI config is stored in the Electron main process
project files do not store API keys
the app remains usable without cloud AI

Local Development

npm install
npm run dev

Useful checks:

npm test
npx tsc --noEmit
npx vite build

Packaging:

npm run build:mac

Current Product Truth

This repo intentionally separates:

local Smart Screen heuristics
optional model-driven AI refinement

Not every “smart” feature is AI. The local pipeline is still valuable on its own, and the AI layer is additive rather than mandatory.

Attribution

Maintained by SaiVaraprasad Medapati

Repository: varaprasadreddy9676/smart-screen
Forked from: siddharthvaddem/openscreen
License: MIT

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.github		.github
electron		electron
icons/icons		icons/icons
landing		landing
public		public
scripts		scripts
shared		shared
src		src
testsprite_tests		testsprite_tests
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
HACKATHON_DEMO.md		HACKATHON_DEMO.md
LICENSE		LICENSE
README.md		README.md
SMART_DEMO.md		SMART_DEMO.md
biome.json		biome.json
components.json		components.json
electron-builder.json5		electron-builder.json5
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
postcss.config.cjs		postcss.config.cjs
tailwind.config.cjs		tailwind.config.cjs
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Smart Screen

Pitch

What Makes It Useful

Core Features

Recording

Editor

Transcription & Captions

Smart Screen

AI Assist (Optional BYOK)

Demo Flow

Data Flow

Providers And Transcription

AI providers

Transcription backends

Security

Local Development

Current Product Truth

Attribution

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Smart Screen

Pitch

What Makes It Useful

Core Features

Recording

Editor

Transcription & Captions

Smart Screen

AI Assist (Optional BYOK)

Demo Flow

Data Flow

Providers And Transcription

AI providers

Transcription backends

Security

Local Development

Current Product Truth

Attribution

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages