Skip to content

Commit 3783d15

Browse files
committed
add paligemma to projects
1 parent caabce1 commit 3783d15

File tree

7 files changed

+57
-8
lines changed

7 files changed

+57
-8
lines changed

Diff for: config.yaml

+15
Original file line numberDiff line numberDiff line change
@@ -348,6 +348,21 @@ params:
348348
- icon: fas fa-blog
349349
url: /blogs/attention-is-all-you-need
350350

351+
- title: "PaliGemma: A versatile 3B VLM for transfer"
352+
content: Paper implementation of Paligemma inspired by Umar Jamil. Currently a work in progress
353+
image: /projects/paligemma.png
354+
# featured:
355+
# name: Demo # Fixit
356+
# link: https://hugo-profile.netlify.app
357+
badges:
358+
- "VLM"
359+
- "Code Follow Along"
360+
- "Vision Model"
361+
- "Transformers"
362+
links:
363+
- icon: fab fa-github
364+
url: https://github.com/sarthak247/PaliGemma
365+
351366
- title: 🐱 NekoPDF 📖 - Document Interaction with LLM
352367
content: |-
353368
A web app to seamlessly interact with your documents using a Large Language Model over the Langchain framework. Currently support OpenAI and a single PDF file at a time but an option for other LLM backends and multiple PDF files will be added soon.

Diff for: public/blogs/word-embeddings/index.html

+3-6
Original file line numberDiff line numberDiff line change
@@ -547,16 +547,13 @@ <h2 id="tf-idf">TF-IDF</h2>
547547
<li><strong>Term Frequency (TF)</strong>: This component measures the frequency of a term (word) within a document. It rewards words that appear frequently within a document. It can be calculated as the ratio of the word&rsquo;s occurrences in a document to the total word count in that document.
548548
l</li>
549549
</ul>
550-
$$TF(term)=\frac{\text{Number of times term appears in a document}}{\text{Total number of items in the document}}$$
551-
<p>For example, consider our previous example <code>I love coding and coding is fun.</code> Here, TF(coding) is 1/6 as the word <code>and</code> has been ignored.</p>
550+
$$TF(term)=\frac{\text{Number of times term appears in a document}}{\text{Total number of items in the document}}$$<p>For example, consider our previous example <code>I love coding and coding is fun.</code> Here, TF(coding) is 1/6 as the word <code>and</code> has been ignored.</p>
552551
<ul>
553552
<li><strong>Inverse Document Frequeny</strong>: This component measures the rarity or importance of a word across all the documents. It is the log of the inverse of the document frequency where document frequency tells us the number of documents which contain a particular word.</li>
554553
</ul>
555-
$$DF(term)=\frac{\text{Documents containing our term}}{\text{Total number of documents}}$$
556-
<p>Thus, DF tells us about the proportion of documents which contain our word of interest. Thus, we inverse it to make sure that the more common a word is, example stopwords, the less score it gets and a logarithm is taken to dampen or reduce it&rsquo;s effect on the final calculation.</p>
554+
$$DF(term)=\frac{\text{Documents containing our term}}{\text{Total number of documents}}$$<p>Thus, DF tells us about the proportion of documents which contain our word of interest. Thus, we inverse it to make sure that the more common a word is, example stopwords, the less score it gets and a logarithm is taken to dampen or reduce it&rsquo;s effect on the final calculation.</p>
557555
<!-- raw HTML omitted -->
558-
$$IDF(term)=\log{\bigg(\frac{\text{Total number of documents}}{\text{Documents containing our term}}\bigg)}$$
559-
<p>Thus, Inverse Document Frequency (IDF) is a measure of how unique or significant a word is across a collection of documents. It can be computed as the logarithm of the total number of documents divided by the number of documents in which the word occurs, effectively quantifying the word&rsquo;s rarity and importance in the entire document collection.</p>
556+
$$IDF(term)=\log{\bigg(\frac{\text{Total number of documents}}{\text{Documents containing our term}}\bigg)}$$<p>Thus, Inverse Document Frequency (IDF) is a measure of how unique or significant a word is across a collection of documents. It can be computed as the logarithm of the total number of documents divided by the number of documents in which the word occurs, effectively quantifying the word&rsquo;s rarity and importance in the entire document collection.</p>
560557
<h3 id="pros-2">Pros</h3>
561558
<ol>
562559
<li>

Diff for: public/files/resume.pdf

-51.7 KB
Binary file not shown.

Diff for: public/index.html

+38-1
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22
<html>
33

44
<head>
5-
<meta name="generator" content="Hugo 0.128.2"><script src="/livereload.js?mindelay=10&amp;v=2&amp;port=1313&amp;path=livereload" data-no-instant defer></script><meta charset="UTF-8">
5+
<meta name="generator" content="Hugo 0.133.1"><script src="/livereload.js?mindelay=10&amp;v=2&amp;port=1313&amp;path=livereload" data-no-instant defer></script><meta charset="UTF-8">
66
<meta name="viewport" content="width=device-width, initial-scale=1.0">
77
<meta http-equiv="X-UA-Compatible" content="ie=edge">
88
<meta http-equiv="Accept-CH" content="DPR, Viewport-Width, Width">
@@ -1111,6 +1111,43 @@ <h5 class="card-title bg-transparent mt-1">Attention Is All You Need</h5>
11111111
</div>
11121112
</div>
11131113

1114+
<div class="col-lg-4 col-md-6 my-3">
1115+
<div class="card my-3 h-100" title="PaliGemma: A versatile 3B VLM for transfer">
1116+
<div class="card-head">
1117+
<img class="card-img-top" src="/projects/paligemma.png" alt="PaliGemma: A versatile 3B VLM for transfer">
1118+
</div>
1119+
<div class="card-body bg-transparent p-3">
1120+
<div class="pb-2 bg-transparent">
1121+
1122+
<span class="badge badge-secondary">VLM</span>
1123+
1124+
<span class="badge badge-secondary">Code Follow Along</span>
1125+
1126+
<span class="badge badge-secondary">Vision Model</span>
1127+
1128+
<span class="badge badge-secondary">Transformers</span>
1129+
1130+
</div>
1131+
<h5 class="card-title bg-transparent mt-1">PaliGemma: A versatile 3B VLM for transfer</h5>
1132+
<div class="card-text bg-transparent secondary-font">
1133+
Paper implementation of Paligemma inspired by Umar Jamil. Currently a work in progress
1134+
</div>
1135+
</div>
1136+
1137+
<div class="card-footer py-3">
1138+
1139+
<span class="m-1 mx-2">
1140+
<a href="https://github.com/sarthak247/PaliGemma" target="_blank">
1141+
<i class="fab fa-github"></i>
1142+
</a>
1143+
</span>
1144+
1145+
1146+
</div>
1147+
1148+
</div>
1149+
</div>
1150+
11141151
<div class="col-lg-4 col-md-6 my-3">
11151152
<div class="card my-3 h-100" title="🐱 NekoPDF 📖 - Document Interaction with LLM">
11161153
<div class="card-head">

Diff for: public/index.json

+1-1
Large diffs are not rendered by default.

Diff for: public/projects/paligemma.png

298 KB
Loading

Diff for: static/projects/paligemma.png

298 KB
Loading

0 commit comments

Comments
 (0)