5x5words-asm

An avx512 implementation of the "find 5 5-letter words with all different letters" puzzle. I upload purely for educational reading, and make no guarantee that it still works as it has been a while since I was working on it actively, as I may have left a change half-done somewhere, and I have made untested changes in the last few hours before uploading too, that may or may not have fixed the bug noted below.

These files are a code::blocks project for the 5x5words challenge largely made in nasm assembly, but to build it correctly code::blocks requires some considerable setup to know what to do with nasm files, so I'll breifly explain how to build without an IDE (see Build).

The assembly is written for 64-bit linux execution environment, (mostly) respecting the 64-bit linux C calling convention, although it takes a lot of liberties with the stack. The code itself is OS agnostic though, it just works with memory, so should be modifiable for use on windows pretty easily. The main difference will be that windows has different expectations about register assignments for parameters and what a called function must preserve, but there may be additional issues with the unrestrained use I make of stack space (freely growing worklists etc.)

Build

To build the nasm files separately at the command line use:

nasm -p nasm_defaults.inc file.nasm

On each of the nasm files. this should produce file.o as a PIE elf64. (the nasm_defaults.inc sets up some macros that let me type {k1z} instead of {k1}{z} for avx512 zeroing-mask predicates, and default the data references to PC-relative addressing for relocatable code.)

Build newreport.c with gcc or clang, also to .o file

Then build the c main file (newfind5x5word.c) with gcc or clang and link the nasm generated object files.

Run

When run in a terminal it expects to find the words_alpha.txt file in the execution directory. It should output the results, and timings for the various sections of the process.

There are still some flaws in the code: apart from a couple of sub-optimal memory locality optimisations yet to be done, for some reason although it works perfectly the first time, if the search is re-run in the same execution, then somewhere dirty memory causes it to miss-read things and fail to find all the solutions. I have yet to work out why, since everything is being re-sourced from the original file and overwritten each time. I can only think there's a memory corruption bug somewhere that I have yet to find.

Comments and Notes in the code

When looking through the code you'll often find notes that simply say something like zmm13=... with some hex after it. this was a note to myself on a constant to be loaded into the register at the eginning of the function. The one thing that my not be obvious is that at the end there will e something like (w)(r2l) or (dw)(l2r). w (word) and dw (dword) should be self explanititory but r2l or l2r refer to the memory byte order. it is normal to only list data elements left to right since that is the direction our language flows across the page. But our numbers flow right to left, since the Arabic numerals our math is based on, like the Arabic language they came from, flows right to left. In many cases, especially where different element sizes are mixed in a series of operations I find it it much simpler to work with everything flowing right-to-left so that the order of digits remains the same whether they are to be interpreted as 64 bytes, 32 words, 16 dwords or 8 qwords.

In addition to the extensive line by line notes in the executable code there's also a few other notes files including a pseudocode version of the algorithm where I was trying to imagine how a high-level language might represent the processes written in assembly as a more general set of bulk data handling data types and functions. I guess it's kinda C++-like in its way of doing things but C++ is one language I can say for certain cannot stretch to handle it. Maybe rust's metaprogramming would make it possible to handle all the different methods data types and cases in the most appropriate way, but I wouldn't be confident of anything short of a fully new language designed around making this kind of thing more accessible being able to handle it optimally.

code::blocks setup

code::blocks needs 2 main settings changes to be able to edit and build the project:

first is in the settings menu -> compile... -> other settings tab -> advanced options button. Declair that you really know what you are doing, then add a new source file entry for .nasm files and paste a macro for compiling with nasm. I use:

nasm $file -felf64 -o $object -p ~/path/to/nasm_defaults.inc -g

the second is to update and add an entry for a lexer to read nasm assembly with avx512 instructions and registers (the supplied assembly lexers only include up to SSE2 instructions and registers). I've included my custom lexer definition here as lexer_nasm.xml, and lexer_nasm.sample. Installing is as simple as copying to the correct directory in the code::blocks installation, but if, like me, you installed with flatpak it might be tricky to find. mine is at:

'/var/lib/flatpak/app/org.codeblocks.codeblocks/x86_64/stable/9149915c0623727962f7ea992dd690e5c0b9908f65f1e6fd9d5e0befdfa0a0ce/files/share/codeblocks/lexers'

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
5x5 convert cullduplicates.nasm		5x5 convert cullduplicates.nasm
5x5 countRankConvert v4.nasm		5x5 countRankConvert v4.nasm
5x5 expandsubgroups newer.nasm		5x5 expandsubgroups newer.nasm
5x5 expandsubgroups newerer.nasm		5x5 expandsubgroups newerer.nasm
5x5 find5words chunks.nasm		5x5 find5words chunks.nasm
5x5 find5words new.nasm		5x5 find5words new.nasm
5x5_new.cbp		5x5_new.cbp
5x5_new.depend		5x5_new.depend
5x5_new.layout		5x5_new.layout
README.md		README.md
expandSubgroups neglength.nasm		expandSubgroups neglength.nasm
hashWords_phf_cfd.nasm		hashWords_phf_cfd.nasm
lexer_nasm.sample		lexer_nasm.sample
lexer_nasm.xml		lexer_nasm.xml
linearSearch splitHistory.nasm		linearSearch splitHistory.nasm
nasm_defaults.inc		nasm_defaults.inc
newfind5x5words.c		newfind5x5words.c
newreport.c		newreport.c
pseudocode		pseudocode
sortPop2-Lpop.nasm		sortPop2-Lpop.nasm
words_alpha.txt		words_alpha.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

5x5words-asm

Build

Run

Comments and Notes in the code

code::blocks setup

About

Uh oh!

Releases

Packages

Languages

IAmAThousandTrees/5x5words-asm

Folders and files

Latest commit

History

Repository files navigation

5x5words-asm

Build

Run

Comments and Notes in the code

code::blocks setup

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages