🧩 BSQ - 42 Paris Piscine

Fast C CLI that reads maps from files or stdin, computes the largest empty square (dynamic programming), and prints them back with the largest empty square.

🪄 Highlights

Dynamic growth I/O buffer: reduces the number of read system calls
Unified parsing and solving: single-pass computation combining map validation and dp update
Flat memory layouts: reduce data access time, memory usage, and copy overhead
Single-row array dynamic programming layout: improves memory footprint and L1 cache locality.
In-place editing of the original map: avoids full map copies and checks
One-time output: avoids multiple write system calls
Integrated tests with the make test command
Integrated benchmark (100 iterations) with the make bench command

🚀 Performance

A 10,000×10,000 map is processed in less than 80 ms

Measured on macOS (Apple M4) using <time.h> / clock_gettime()

stdout redirected to /dev/null to eliminate potential shell or terminal I/O bottlenecks

Version	Description	10k×10k (ms)
v1.1.0	Baseline (42 Paris Piscine version)	~37,000 ms
v1.2.0	Output optimization → Added output buffer (`char **`)	~3,800 ms
v1.3.0	Output optimization → Switched output to flat buffer (`char *`)	~3,600 ms
v1.4.0	General optimization → Removed initialization loops	~3,400 ms
v2.0.0	Major refactor → Simplified data structures → Unified parsing and solving → Optimized flat I/O buffers → Flattened map and DP arrays	~320 ms
v2.1.0	Output optimization → In-place map editing (no full copy) → Only updates the required characters inside the map	~250 ms
v2.1.1	QoL update → Integrated benchmark mode (10 iterations)	~200 ms
v2.2.0	Input optimization → Reworked `str_grow()` to dereference pointer once before the loop → Reduced redundant memory accesses during buffer reallocation	~190 ms
v2.2.1	Input optimization → Switched to native C types during file read operations → Implemented in-place reading to remove buffer duplication and reduce latency	~180 ms
v2.3.0	Parse optimization → Optimized DP minimum computation to reduce branch-misses	~140 ms
v2.4.0	Parse optimization → Reordered parser condition checks to reduce branch mispredictions → Implemented precomputation of all possible values → Minimized dereferencing in hot loops → Increased integrated benchmark from 10 to 100 iterations	~100 ms
v2.5.0	Code cleanup & build optimization → Removed unused fields, return values and redundant casts → Inlined hot functions → Added `-fomit-frame-pointer` and `-fno-stack-protector` flags → Introduced optional PGO build (`make sfast`)	~100 ms
v3.0.0	*Code cleanup, tests implementation, bug fixes, and branchless* comparison investigation** → Added `make test` and `make bench` commands → Fixed multiple issues → See CHANGELOG.md for more details	~100 ms
v3.1.0	Parse optimization → Implemented `parse_col_0()` to speed up parsing and solving of the first col of each row	~87 ms
v3.2.0	Parse optimization → Implemented single-row array `dp` for faster updates	~77 ms

🎓 Context

The BSQ (Biggest Square) is the final algorithmic project of the 42 Paris Piscine.
Its goal is to parse a text-based map - from file(s) or stdin - and compute the largest possible empty square, following the official 42 C Norm v4.

This project is a deep dive into:

Dynamic programming for 2D optimization problems
Execution time optimization using low-level CPU profiling and micro-optimization techniques
Memory management and I/O efficiency
Strict compliance with the 42 Norm

⚙️ Objective

Read maps from one or more files, or directly from stdin
Detect invalid or corrupted maps (missing lines, inconsistent width, invalid characters, etc.)
Efficiently compute the largest empty square, even on large maps (e.g. 10k×10k)
Output the map with the square marked using the filled character

🧩 Algorithm

The program implements a dynamic programming approach:

Each cell represents the size of the largest square ending at that point.
The recurrence relation:

if (map[row][col] == EMPTY)
	dp[row][col] = 1 + min(dp[row-1][col], dp[row][col-1], dp[row-1][col-1]);
else
	dp[row][col] = 0;

The largest value found indicates the size and position of the biggest square.

For performance reasons, the first row and the first column of each row are computed by dedicated functions: parse_row_0() and parse_col_0().

Additionally, the dp is implemented using a single-row array and a temporary value (representing the up-left cell: dp[row-1][col-1]).

The 2D relation above is shown for clarity, as the actual implementation uses a 1D array for better memory locality.

For experimental branchless versions of the DP computation, see branchless_comparison.md.

This file documents the bitmask-based and XOR-based approaches I tested to minimize branch mispredictions, and explains why it didn't produce the expected results.

🗂️ Repository structure

bsq/
├── README.md				# This README file
├── bsq.subject.en.pdf		# Official 42 subject (English)
├── Makefile				# Build rules and compiler settings
├── includes/				# Header files with type definitions and prototypes
├── srcs/					# Source files (C code)
│   ├── main.c				# Entry point
│   ├── objects/			# Constructor and destructor functions for custom structs
│   ├── utils/				# Utilities
│   ├── read.c				# Reads content from filepath/stdin
│   ├── parse_rules.c		# Parses rules
│   ├── parse_map.c			# Simultaneously parses and solves the map
│   └── result.c			# Prints result
└── tests/					# Sample maps and performance benchmarks

🧰 Build and Run

Compilation

# Compile the standard binary
make

# Compile the optimized (fast) binary
make fast

# Compile, run the binary to collect profiling data, and then recompile it with PGO optimizations
make sfast

Run with one or more files

./bsq file_path_1 file_path_2 ... file_path_n

Run with stdin

⚠️ This feature has been removed in v2.0.0. Coming back soon...

cat tests/basic_test | ./bsq

Run tests

make test

Run automatic benchmark

Automatically runs 100 iterations with 10kx10k map as input
Displays average timings to stderr

stdout is redirected to /dev/null to eliminate potential shell or terminal I/O bottlenecks

make bench

Clean build files

# Only clean objects
make clean

# Clean objects and binary
make fclean

“Think out of the square — then fill it.” 🧠

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

🧩 BSQ - 42 Paris Piscine

🪄 Highlights

🚀 Performance

🎓 Context

⚙️ Objective

🧩 Algorithm

🗂️ Repository structure

🧰 Build and Run

Compilation

Run with one or more files

Run with stdin

Run tests

Run automatic benchmark

Clean build files

About

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 202 Commits
includes		includes
srcs		srcs
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Makefile		Makefile
README.md		README.md
branchless_comparison.md		branchless_comparison.md
bsq.subject.en.pdf		bsq.subject.en.pdf

Uh oh!

Uh oh!

guillaumeast/42_bsq

Folders and files

Latest commit

History

Repository files navigation

🧩 BSQ - 42 Paris Piscine

🪄 Highlights

🚀 Performance

🎓 Context

⚙️ Objective

🧩 Algorithm

🗂️ Repository structure

🧰 Build and Run

Compilation

Run with one or more files

Run with stdin

Run tests

Run automatic benchmark

Clean build files

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages