BookmarkMaster

A Python tool that helps organize and optimize your browser bookmarks by analyzing their content and suggesting an improved structure.

Features

Analyzes bookmark content using TF-IDF and clustering
Preserves original folder structure while optimizing organization
Moves frequently used bookmarks to the Bookmarks Bar
Generates browser-compatible HTML output
Supports nested folder structures
Configurable parameters for fine-tuning the optimization process

Setup

Clone the repository:

git clone https://github.com/geochum/bookmark-organizer.git
cd bookmark-organizer

Create the required directory structure:
```
mkdir -p data/input data/output
```
Add your bookmarks file to data/input/
- Export your bookmarks from your browser as HTML
- Place the exported file in the data/input/ directory
- Update the INPUT_FILE path in config.py if your file has a different name
Install required Python packages:
```
pip install -r requirements.txt
```

Configuration

The tool's behavior can be customized by modifying config.py. Key configuration options include:

File paths for input and output files
TF-IDF vectorizer parameters for text analysis
Clustering parameters for bookmark grouping
Domain categories for fallback clustering
Thresholds for folder path and word frequency
Frequently used tool domains and class-related keywords

Example configuration:

# TF-IDF parameters
TFIDF_PARAMS = {
    'stop_words': 'english',
    'max_features': 1000,
    'token_pattern': r'(?u)\b[a-zA-Z][a-zA-Z]+\b',
    'ngram_range': (1, 2),
    'min_df': 2,
    'max_df': 0.8
}

# Clustering parameters
N_CLUSTERS = 10
CLUSTERING_METRIC = 'euclidean'
CLUSTERING_LINKAGE = 'ward'

Usage

Run the main script:
```
python main.py
```
The script will:
- Process your bookmarks from the input file
- Generate optimized organization
- Save the results in data/output/organized_bookmarks.json
- Generate a browser-compatible HTML file in data/output/organized_bookmarks.html
Import the generated HTML file back into your browser to apply the new organization

Project Structure

bookmark-organizer/
├── config.py              # Configuration parameters
├── data/
│   ├── input/            # Place your input bookmarks file here
│   └── output/           # Generated files will be saved here
├── src/
│   ├── extraction/       # Bookmark extraction from HTML
│   │   └── extract_bookmarks.py
│   ├── optimization/     # Core optimization logic
│   │   └── optimize_bookmarks.py
│   └── generation/       # HTML generation
│       └── generate_html.py
├── main.py              # Main entry point
├── requirements.txt     # Python dependencies
└── README.md           # This file

Dependencies

Python 3.x
scikit-learn
numpy
pandas
beautifulsoup4
lxml

License

MIT License

Contributing

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BookmarkMaster

Features

Setup

Configuration

Usage

Project Structure

Dependencies

License

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
src		src
.gitignore		.gitignore
README.md		README.md
config.py		config.py
main.py		main.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

BookmarkMaster

Features

Setup

Configuration

Usage

Project Structure

Dependencies

License

Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages