GitHub - csimoes1/LexLLM: LexLLM project to train a base model to interview like Lex Fridman

csimoes1 / LexLLM Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

LexLLM project to train a base model to interview like Lex Fridman

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
__pycache__		__pycache__
modelfiles		modelfiles
transcripts_jsonl		transcripts_jsonl
.DS_Store		.DS_Store
.gitattributes		.gitattributes
Lex Fridman Podcast - Episodes ALL.html		Lex Fridman Podcast - Episodes ALL.html
LexFineTuneAWS.py		LexFineTuneAWS.py
LexFineTuneMacBook.py		LexFineTuneMacBook.py
LexLLM.iml		LexLLM.iml
LexScrape.py		LexScrape.py
LexTranscriptProcessor.py		LexTranscriptProcessor.py
LexTranscriptProcessor2.py		LexTranscriptProcessor2.py
LongestLine.py		LongestLine.py
ModelMerge.py		ModelMerge.py
README		README
Sandbox.py		Sandbox.py
TokenCounter.py		TokenCounter.py
initialPrompt.txt		initialPrompt.txt
linkedin_scrape.py		linkedin_scrape.py
sagemaker_test_llama.py		sagemaker_test_llama.py
sagemaker_train_llama.py		sagemaker_train_llama.py

Repository files navigation

# To create model file from output of LexFineTune do the following:
To run LoRA trained model in ollama:
1.) Run ModelMerge.py
2.) python ~/Projects/llama/convertToGGUF/convertToGGUF/llama.cpp/convert_hf_to_gguf.py /Users/csimoes/Projects/llama/final_Mar052024 --outfile lex_llama_unquantized.gguf --verbose --outtype f16
python /home/ubuntu/projects/llama.cpp/convert_hf_to_gguf.py ~/projects/LexLLM/lex_lora_results_20250307_162307/checkpoint_epoch_5 --outfile lex_llama_unquantized.gguf --verbose --outtype f16
2b.) [Optional] Test GGUF file manually: python ~/Projects/llama/convertToGGUF/convertToGGUF/llama.cpp/llama-cli -m lex_llama_unquantized.gguf -p "Tell me about AI."
2c.) [Optional] Quantize to reduce memory needs: llama.cpp/quantize lex_llama_unquantized.gguf lex_llama_q4_k_m.gguf Q4_K_M

3.) ollama create [model name] -f [modelfile]


# To watch performance of AWS machine
nvidia-smi  # Check GPU usage
htop        # Check CPU usage (should see 16 threads)

# NVidia GPU usage:
nvidia-smi --query-gpu=utilization.gpu,memory.used,memory.total --format=csv -l 5