Fara Browser Automation Agent

A local browser automation agent based on Microsoft Fara-7B model optimized for LM Studio inference.

Run browser automation locally on a consumer-grade GPU with a variation of quantized models.

Features

✅ 100% local AI browser agent
✅ Quantized models support
✅ Completely self-contained (no external dependencies)
✅ Optimized for LM Studio
✅ Browser automation via Playwright

Setup

1. Install Dependencies

pip install -r requirements.txt
playwright install firefox

2. Setup LM Studio

Download and install LM Studio
Download the Fara-7B model (GGUF format):
- Search for: microsoft_fara-7b
- Recommended: Q5_K_M quantization (6GB)
Load the model in LM Studio
Start the local server (default port: 1234)
In model settings:
- Context Length: 8192+
- Temperature: 0.0
- Top P: 0.9

3. Run the Agent

python run_agent.py --task "Go to wikipedia.org and search for cats" --headful

Optional debug flags (enabled by default in headful mode):

headful: displays a browser window
show_overlay: bottom-right HUD with latest model responses (hidden during screenshots)
show_click_markers: transient markers for clicks/hover/type coordinates (hidden during screenshots)

Configuration

Edit config.json to change:

Model endpoint (default: http://localhost:1234/v1)
Model name
Max rounds
Screenshot settings
Max images to keep in context (max_n_images, default 1)
Downloads folder for saving files
Debug overlay and click markers (show_overlay, show_click_markers)

How It Works

Browser Control: Uses Playwright to control Firefox
Vision: Takes screenshots and sends them to the model
Actions: Model returns tool calls (click, type, scroll, navigate, hover, keypress, wait, memorize facts)
Single-Image Mode: Only sends the latest screenshot to LM Studio (better compatibility)
Loop Guard: Tracks scroll position and warns the model when it oscillates up/down

Limitations

Quantized models have reduced capabilities vs full model
LM Studio has issues with multiple images in the conversation history
Some complex tasks may cause loops (scrolling, navigation)

Troubleshooting

Browser not visible?

Make sure you're using --headful flag

Model not responding?

Check LM Studio server is running on port 1234
Verify model is loaded in LM Studio

Agent looping?

Try reducing the temperature in LM Studio to 0.0
Reduce max_rounds in config.json

License

MIT License - Based on Microsoft Fara-7B

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
agent.py		agent.py
browser.py		browser.py
config.json		config.json
message_types.py		message_types.py
prompts.py		prompts.py
requirements.txt		requirements.txt
run_agent.py		run_agent.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Fara Browser Automation Agent

Features

Setup

1. Install Dependencies

2. Setup LM Studio

3. Run the Agent

Configuration

How It Works

Limitations

Troubleshooting

License

About

Uh oh!

Languages

License

pmbstyle/fara-agent

Folders and files

Latest commit

History

Repository files navigation

Fara Browser Automation Agent

Features

Setup

1. Install Dependencies

2. Setup LM Studio

3. Run the Agent

Configuration

How It Works

Limitations

Troubleshooting

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages