Contributing to SciGo

Thank you for your interest in contributing to SciGo! We welcome contributions from everyone, whether you're fixing a typo, adding a test, implementing a new algorithm, or improving documentation.

🚀 Quick Start (5 minutes to your first contribution!)

Prerequisites

Go 1.21 or later
Git
Make (optional but recommended)

Your First Contribution

Set up your development environment:

git clone https://github.com/YuminosukeSato/scigo.git
cd scigo
make setup-dev  # Installs all tools and dependencies

Find an issue to work on:
- Look for issues labeled good first issue
- Or fix a typo in documentation
- Or add a missing test

Create your branch:

git checkout -b fix/issue-description
# or
git checkout -b feature/new-algorithm

Make your changes and test:

make test       # Run tests
make lint-full  # Check code style

Submit a Pull Request:
- Push your branch to your fork
- Open a PR with a clear description
- Wait for review and address feedback

That's it! 🎉

📖 Developer's Guide

Project Philosophy

SciGo aims to provide a high-performance, production-ready machine learning library for Go with scikit-learn compatible APIs. We prioritize:

API Compatibility: Following scikit-learn's proven interface patterns
Performance: Leveraging Go's concurrency and efficiency
Reliability: Comprehensive testing and error handling
Simplicity: Clear, idiomatic Go code

Project Structure

scigo/
├── core/           # Core abstractions and utilities
│   ├── model/      # Base estimator and interfaces
│   ├── tensor/     # Tensor operations
│   └── parallel/   # Parallel processing utilities
├── linear/         # Linear models (regression, classification)
├── preprocessing/  # Data preprocessing (scalers, encoders)
├── metrics/        # Evaluation metrics
├── sklearn/        # Advanced scikit-learn compatible models
├── pkg/           # Shared packages
│   ├── errors/    # Error handling utilities
│   └── log/       # Structured logging
└── examples/      # Usage examples

Coding Standards

Go Style

Format your code:

make fmt        # or: go fmt ./...
goimports -w .  # Organize imports

Follow Go conventions:
- Use camelCase for unexported identifiers
- Use PascalCase for exported identifiers
- Keep line length under 100 characters when possible
- Write clear, concise comments

Run linters:

make lint-full  # Runs comprehensive linting

Machine Learning API Conventions

All ML models in SciGo follow the scikit-learn estimator pattern:

// Basic Estimator Pattern
type Estimator interface {
    Fit(X, y mat.Matrix) error
    Predict(X mat.Matrix) (mat.Matrix, error)
    Score(X, y mat.Matrix) (float64, error)
}

// Transformer Pattern
type Transformer interface {
    Fit(X mat.Matrix) error
    Transform(X mat.Matrix) (mat.Matrix, error)
    FitTransform(X mat.Matrix) (mat.Matrix, error)
}

Key Principles:

State Management: Use BaseEstimator for consistent fitted state tracking

type MyModel struct {
    model.BaseEstimator
    // model-specific fields
}

Error Handling: Use structured errors from pkg/errors

if !m.IsFitted() {
    return nil, errors.NewNotFittedError("MyModel", "Predict")
}

Logging: Use structured logging for ML operations

m.LogInfo("Training started",
    log.OperationKey, log.OperationFit,
    log.SamplesKey, nSamples,
)

Numerical Precision: Always use float64 for numerical computations
Matrix Operations: Use gonum.org/v1/gonum/mat for matrix operations

Testing Strategy

Test Requirements

Coverage: Aim for >80% test coverage for new code
Types: Write unit tests, integration tests, and benchmarks
Naming: Use descriptive test names that explain what is being tested

Writing Tests

Unit Tests: Test individual functions/methods

func TestLinearRegression_Fit(t *testing.T) {
    tests := []struct {
        name    string
        X, y    mat.Matrix
        wantErr bool
    }{
        // test cases
    }
    // test implementation
}

Example Tests: Provide usage examples

func ExampleLinearRegression() {
    // Create and train model
    lr := linear.NewLinearRegression()
    _ = lr.Fit(X, y)
    
    // Output: expected output
}

Benchmarks: Measure performance

func BenchmarkLinearRegression_Fit(b *testing.B) {
    // benchmark implementation
}

Running Tests

make test           # Run all tests
make test-short     # Run short tests only
make coverage       # Generate coverage report
make bench          # Run benchmarks

Documentation

Code Documentation

Every exported type, function, and method must have a godoc comment:

// LinearRegression implements ordinary least squares regression.
//
// The model minimizes the residual sum of squares between observed
// targets and predictions made by linear approximation.
//
// Example:
//   lr := linear.NewLinearRegression()
//   err := lr.Fit(X, y)
//   predictions, err := lr.Predict(X_test)
type LinearRegression struct {
    // ...
}

Package Documentation

Each package should have a doc.go or package comment explaining:

Package purpose
Main types and functions
Usage examples
Related packages

Pull Request Process

Before submitting:
- Ensure all tests pass: make test
- Run linters: make lint-full
- Update documentation if needed
- Add tests for new functionality
- Update CHANGELOG.md if applicable
PR Description should include:
- What problem does this solve?
- How does it solve it?
- Any breaking changes?
- Related issues (use "Fixes #123" to auto-close)
Review process:
- CI must pass (tests, linting, coverage)
- At least one maintainer approval required
- Address review feedback promptly
- Squash commits if requested

Development Workflow

Common Tasks

# Set up development environment
make setup-dev

# Run tests
make test

# Check code coverage
make coverage

# Run linters
make lint-full

# Format code
make fmt

# Run benchmarks
make bench

# Clean build artifacts
make clean

# See all available commands
make help

Adding a New Algorithm

Create the implementation:

// mypackage/algorithm.go
package mypackage

type MyAlgorithm struct {
    model.BaseEstimator
    // fields
}

func (m *MyAlgorithm) Fit(X, y mat.Matrix) error {
    // implementation
    m.SetFitted()
    return nil
}

Add comprehensive tests:

// mypackage/algorithm_test.go
func TestMyAlgorithm_Fit(t *testing.T) {
    // test implementation
}

Add an example:

// mypackage/example_test.go
func ExampleMyAlgorithm() {
    // example usage
}

Update documentation:
- Add package documentation if new package
- Update README.md if significant feature

Error Handling

SciGo uses structured errors for better debugging:

// Use predefined error types
errors.NewNotFittedError("ModelName", "Method")
errors.NewDimensionError("Method", expected, got, axis)
errors.NewValueError("Method", "description")

// Wrap errors with context
fmt.Errorf("failed to train model: %w", err)

// Use panic recovery for public APIs
func (m *MyModel) Fit(X, y mat.Matrix) (err error) {
    defer errors.Recover(&err, "MyModel.Fit")
    // implementation
}

Performance Considerations

Memory Efficiency:
- Reuse allocated memory when possible
- Use in-place operations for large matrices
- Clear references to allow garbage collection
Parallelization:
- Use core/parallel utilities for concurrent operations
- Set appropriate thresholds for parallel vs sequential processing
- Benchmark to verify performance improvements
Numerical Stability:
- Use stable algorithms (e.g., QR decomposition over matrix inversion)
- Check for numerical edge cases (division by zero, overflow)
- Use appropriate epsilon values for floating-point comparisons

🐛 Reporting Issues

Before Creating an Issue

Check if the issue already exists
Try with the latest version
Ensure it's not a usage problem (check examples/documentation)

Creating a Good Issue Report

Include:

Go version and OS
Minimal reproducible example
Expected vs actual behavior
Error messages and stack traces
Relevant logs (use log.SetLevel(log.LevelDebug))

💡 Proposing Features

Check existing issues/PRs for similar proposals
Open a discussion for significant changes
Provide use cases and example API
Consider backward compatibility

📜 Code of Conduct

Be respectful and inclusive
Welcome newcomers and help them get started
Focus on constructive criticism
Respect differing viewpoints and experiences

📄 License

By contributing to SciGo, you agree that your contributions will be licensed under the MIT License.

🙏 Recognition

Contributors are recognized in:

Git history
CONTRIBUTORS.md file
Release notes for significant contributions

📚 Resources

❓ Getting Help

Documentation: Check the README and package documentation
Examples: Look at the examples directory
Issues: Search existing issues or create a new one
Discussions: Join GitHub Discussions for questions and ideas

Thank you for contributing to SciGo! 🚀

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Contributing to SciGo

🚀 Quick Start (5 minutes to your first contribution!)

Prerequisites

Your First Contribution

📖 Developer's Guide

Project Philosophy

Project Structure

Coding Standards

Go Style

Machine Learning API Conventions

Testing Strategy

Test Requirements

Writing Tests

Running Tests

Documentation

Code Documentation

Package Documentation

Pull Request Process

Development Workflow

Common Tasks

Adding a New Algorithm

Error Handling

Performance Considerations

🐛 Reporting Issues

Before Creating an Issue

Creating a Good Issue Report

💡 Proposing Features

📜 Code of Conduct

📄 License

🙏 Recognition

📚 Resources

❓ Getting Help

FilesExpand file tree

CONTRIBUTING.md

Latest commit

History

CONTRIBUTING.md

File metadata and controls

Contributing to SciGo

🚀 Quick Start (5 minutes to your first contribution!)

Prerequisites

Your First Contribution

📖 Developer's Guide

Project Philosophy

Project Structure

Coding Standards

Go Style

Machine Learning API Conventions

Testing Strategy

Test Requirements

Writing Tests

Running Tests

Documentation

Code Documentation

Package Documentation

Pull Request Process

Development Workflow

Common Tasks

Adding a New Algorithm

Error Handling

Performance Considerations

🐛 Reporting Issues

Before Creating an Issue

Creating a Good Issue Report

💡 Proposing Features

📜 Code of Conduct

📄 License

🙏 Recognition

📚 Resources

❓ Getting Help