Updates main README

adomasbaliuka · adomasbaliuka · commit 6e062731f01d · 2024-05-28T10:45:10.000+02:00
diff --git a/README.md b/README.md
@@ -6,9 +6,13 @@
 
 ## Overview
 
-This repository aims to solve the following problem. Suppose Alice and Bob each have a bit-string of length _N_.
-Bobs bit-string is the result of randomly flipping each bit of Alice with probability _p_, i.e., it is the output of a binary symmetric channel with **channel parameter** _p_. 
-(Note: the same considerations apply for soft inputs instead of bit-strings and a Gaussian channel.) The goal is for Alice to send (via a noise-less channel) to Bob a message (called syndrome), such that Bob can recover Alice's bit-string given his bit-string and the syndrome received from Alice. There are two important metrics, which we want to minimize: 
+This repository aims to solve the following problem: Suppose Alice and Bob each have a bit-string of length _N_.
+Bobs bit-string is the result of randomly flipping each bit of Alice with probability _p_.
+I.e., it is the output of a [binary symmetric channel](https://en.wikipedia.org/wiki/Binary_symmetric_channel) with **channel parameter** _p_. 
+(Note: similar considerations apply for soft inputs instead of bit-strings and a [Gaussian channel](https://en.wikipedia.org/wiki/Additive_white_Gaussian_noise).)
+The goal is for Alice to send (via a noise-less channel) to Bob a message (called syndrome), such that Bob can recover Alice's bit-string given his bit-string and the syndrome received from Alice.
+
+There are two important metrics, which we want to minimize: 
 
 - the probability that Bob will fail to recover Alice's bit-string, which is called the frame error rate (FER)
 - the length of the syndrome
@@ -22,17 +26,18 @@ Limits for finite block sizes also exist but are more complicated).
 
 ### Contrast with forward error correction
 
-Contrary to how forward error correction works, generator matrices for the LDPC code are not used at all for distributed source coding. 
-This repository does not provide generator matrices (calculating them from the parity check matrices is straightforward).
+Contrary to how forward error correction works, distributed source coding does not use generator matrices for the LDPC code. 
+This repository does not provide generator matrices (though calculating them from the parity check matrices is straightforward).
+
 Our decoder implementation operates on a bit-string (noisy version of true message) and its correct syndrome. 
-This is slightly different from what is used for forward error correction (as in e.g. [AFF3CT](https://github.com/aff3ct/aff3ct)), where the decoder operates on only the noisy codeword. 
-The noisy codeword is the result of transmitting the codeword (true message encoded using a generator matrix) via a noisy channel.
+This is different from what is used for forward error correction (as in e.g. [AFF3CT](https://github.com/aff3ct/aff3ct)), where the decoder operates on only the noisy codeword.
+(The noisy codeword is the result of transmitting the codeword (true message encoded using a generator matrix) via a noisy channel.)
 
 
 ## How to use
 In order to use all the functionality provided in this repository, install
 - CMake (at least version 3.19)
-- C++ compiler (supporting C++17)
+- C++ compiler (supporting C++20; parts of the project also work with only C++17)
 - Julia (at least version 1.6)
 
 For how to use the provided Julia code, see directory `codes`. No familiarity with the Julia programming language is required.
@@ -48,7 +53,9 @@ All executables will be built inside the `build` folder.
 
 For a demo of how to use the C++ header-only library, see the `examples` directory, which shows a basic example ("demo_error_correction") of how to use the C++ header containing the decoder.
 This example is built by CMake (executable `build/examples/demo_error_correction`.
-Note: the executable produces no output; look at the C++ source code to see how the header can be used).
+Note: the executable produces no output; look at the C++ source code to see how the header can be used.
+
+For more advanced examples, looking at the unit tests may be helpful.
 
 ## How to contribute
 This repository is actively maintained. 
@@ -59,29 +66,49 @@ Let us know if you're having problems with the provided materials, wish to contr
 ## List of contents
 
 ### Data files
-- A number of LDPC codes (a list and the actual LDPC matrices are in the folder `codes`). 
+- A number of LDPC codes (a list and the actual LDPC matrices are in the folder `codes`).
   Their parity check matrices are stored in a custom file format (called `qccsc.json`).
+
 - Simulations results done using [AFF3CT](https://github.com/aff3ct/aff3ct), showing FER of the LDPC matrices at various channel parameters.
+
 - Simulation results done using the decoder in this repository, showing FER of LDPC matrices, their rate adapted versions, and average rate under rate adaption (Work in progress!).
+
 - For each LDPC matrix, a specification of rate adaption. 
   This is a list of pairs of row indices of the matrix that are combined (added mod 2) in each rate adaption step.
 
 ### Julia code
 - Contained in the folder `codes`.
-- Enables loading the LDPC matrices from `CSCMAT` files (using our custom Julia library ``)
-- Enables saving the compressed sparse column (CSC) representation and also exporting to other standard formats, such as `alist`.
+
+- Enables loading the LDPC matrices from files.
+  This uses our custom Julia library [LDPCStorage.jl](https://github.com/XQP-Munich/LDPCStorage.jl)) (installed automatically by the Julia package manager).
+
+- Enables saving the compressed sparse column (CSC) representation (both binary and quasi-cyclic).
+  Also supports exporting to other standard formats, such as `alist`.
 
 ### C++ code
-- Basic LDPC decoder using belief propagation (BP). Contained in a single header file (`src/rate_adaptive_code.hpp`) and easy to use. Can perform syndrome computation as well. 
+- Basic LDPC decoder using belief propagation (BP).
+  Contained in a single header file (`src/rate_adaptive_code.hpp`) and easy to use.
+  Can perform syndrome computation as well. 
   The LDPC code can be embedded into the executable or loaded from a file at program runtime.
-- Utility functions for reading LDPC matrices (from `.cscmat` files) and rate adaption (from `.csv` files). These functions are contained in `src/read_cscmat_format.hpp`.
-  Note that these functions require the fully explicit binary form of the LDPC matrix. The LDPC codes given inside `codes/ldpc` use an even more memory-efficient storage 
-- For applications that only require syndrome computation but no decoding, we provide a separate implementation for multiplication of a sparse binary matrix and a dense binary vector (LDPC syndrome computation). This is also contained in a single header file (`src/encoder.hpp`). 
-  This is a very specific application, which you probably don't care about initially.
-- A LDPC matrix can be stored within the executable. 
-  It is included as a header file defining constant data (for example `tests/fortest_autogen_ldpc_matrix_csc.hpp`). 
-  Julia code (see folder `codes`) is provided to generate such a C++ header file for any LDPC matrix.
 
+- Utility functions for reading LDPC matrices (from `.cscmat` files) and rate adaption (from `.csv` files).
+  These functions are contained in `src/read_cscmat_format.hpp`.
+  Note that these functions require the fully explicit binary form of the LDPC matrix.
+  The LDPC codes given inside `codes/ldpc` use an even more memory-efficient storage.
+
+- LDPC matrices can be stored within the executable.
+  There are two ways:
+  - include as a header file defining constant data (for example `tests/fortest_autogen_ldpc_matrix_csc.hpp`).
+    Julia code (see folder `codes`) is provided to generate such a C++ header file for any LDPC matrix.
+  - C++ headers storing QC-LDPC matrices in terms of their quasi-cyclic exponents.
+    This is very efficient in terms of binary size.
+    See `src/autogen_ldpc_QC.hpp` (partially auto-generated using the Julia code).
+    Note: this part is new and may still change significantly in future versions.
+    It also requires C++20 and `src/encoder_advanced.hpp`.
+
+- For applications that only require syndrome computation but no decoding, we provide a separate implementation for multiplication of a sparse binary matrix and a dense binary vector (LDPC syndrome computation).
+  See `src/encoder.hpp` (old) or `src/encoder_advanced.hpp` (new).
+  **This is a very specific application, which you probably don't care about initially.**
 
 ## Planned features and improvements
 
@@ -90,23 +117,18 @@ If you need some feature for your applications, let us know, e.g. by creating an
 - [ ] Reports and benchmarks on LDPC code quality and decoding performance
   + [x] Simulation programs with command line interfaces
   + [x] Frame error rate simulations for rate adapted codes (including special case of no rate adaption)
-  + [x] Critical rate (codeword-averaged minimum leak rate for successful decoding) computation for rate adapted codes  
-  + [ ] Automatic running of simulations
-  + [ ] Automatic reports with plots
-  + [ ] Simulations on multiple threads and/or multiprocessing
+  + [x] Critical rate (codeword-averaged minimum leak rate for successful decoding) computation for rate adapted codes
+  + [ ] Automatic performance reports with code that generates plots
 - [ ] LDPC codes
   + [x] 3 LDPC codes each (different block sizes) for leak rates 1/2 and 1/3
-  + [ ] More sizes, more rates
-  + [ ] Very high leak rate codes (for very low signal-to-noise ratios)
+  + [ ] More sizes, more rates (coming soon!)
 - [ ] Decoding and decoding algorithms
   + [x] Basic belief propagation (BP) decoder for Slepian-Wolf setting
   + [ ] Decoder performance improvements (look at [AFF3CT](https://github.com/aff3ct/aff3ct) for inspiration), plausibly achieve 2x runtime speedup at same decoding accuracy
   + [ ] Decoding on GPU
   + [x] Encoder that can be used separately from encoder (e.g. for embedded applications)
-  + [ ] Using compressed sparse row format may have advantages over the current compressed sparse column (CSC) format. This needs to be tested.
-  + [ ] Save memory by storing QC-exponents of structured codes, rather than CSC storage
-  + [ ] Encoder speedup may be possible using QC-exponents by densely packed bits and using bit-shifts (not a priority, as syndrome computation, i.e., sparse matrix-vector multiplication, is fast anyway)
-  
+  + [x] Save memory by storing QC-exponents of structured codes, rather than CSC storage
+  + [x] Encoder that directly uses CSC-storage of QC-exponents instead of expanding to CSC storage of binary matrix
 
 ## Attributions