|
| 1 | +--- |
| 2 | +name: "What Fact Check websites are referenced in Community Notes on X?" |
| 3 | +excerpt: A count of IFCN websites linked in Community Notes |
| 4 | +author: "Aatman Vaidya" |
| 5 | +project: |
| 6 | +date: 2025-04-30 |
| 7 | +tags: devlog |
| 8 | +--- |
| 9 | + |
| 10 | +import TopDomains from "../images/ifcn_blog_top_domains_plot.png" |
| 11 | +import TopIFCNDomains from "../images/ifcn_blog_top_ifcn_plot.png" |
| 12 | +import IFCNIndiaDomains from "../images/ifcn_blog_top_ifcn_pie_india_colored.png" |
| 13 | + |
| 14 | +We wanted to better understand what fact check websites are being referenced in community notes on X and their frequency. The data for community notes and user rating can be downloaded officially from [here](https://x.com/i/communitynotes/download-data) and fields of the data have been documented [here](https://communitynotes.x.com/guide/en/under-the-hood/download-data). The data for notes is available from 28th Jan 2021 to 25th Apr 2025\. The total number of notes in this timeline are approximately 1.85 million. |
| 15 | + |
| 16 | +We started off by finding what all websites/domains are linked in a note? Using basic regex to extract urls from text and then extracting domain names from the urls, below are the top 20 website domains that are referenced in community notes over time. |
| 17 | + |
| 18 | +<img src={TopDomains} alt="TopDomains" width="1200" height="800" /> |
| 19 | + |
| 20 | +We found that **81.23%** of all the community notes had links in them. Majority of the urls linked were from X itself followed by Wikipedia, Youtube, Google, BBC, Reuters, Instagram etc. |
| 21 | + |
| 22 | +Next, we repeated the same analysis to find the number of community notes which have International Fact-Checking Network ([IFCN](https://www.poynter.org/ifcn/)) websites included in them. Since we couldn’t find any available database of IFCN domains, we used the [list from wikipedia](https://en.wikipedia.org/wiki/List_of_fact-checking_websites) and manually created an array with the domains. |
| 23 | + |
| 24 | +<img src={TopIFCNDomains} alt="TopDomains" width="1200" height="800" /> |
| 25 | + |
| 26 | +We would also like to conduct a topic analysis of all notes that include links to IFCN-affiliated websites. The goal is to understand the broader topics, discussions, or contexts in which fact-checking sources are cited. Additionally, we’d like to compare user approval ratings for notes that include IFCN links versus those that don’t. |
| 27 | + |
| 28 | +We then ran a similar analysis as above only for Indian IFCN-afflicated organizations. We found that 10 Indian IFCN domains were present in the **1833** notes, which is approx **0.1%** of all community notes. |
| 29 | + |
| 30 | +<img src={IFCNIndiaDomains} alt="TopDomains" width="1200" height="800" /> |
| 31 | + |
| 32 | +Here is a distribution with counts of India based IFCN websites. |
| 33 | + |
| 34 | +| Domain | Count | |
| 35 | +| :---- | :---- | |
| 36 | +| indiatoday.in | 945 | |
| 37 | +| altnews.in | 270 | |
| 38 | +| factly.in | 255 | |
| 39 | +| newschecker.in | 154 | |
| 40 | +| factcrescendo.com | 103 | |
| 41 | +| youturn.in | 47 | |
| 42 | +| newsmobile.in | 24 | |
| 43 | +| vishvasnews.com | 18 | |
| 44 | +| thip.media | 13 | |
| 45 | +| medicaldialogues.in | 4 | |
| 46 | + |
| 47 | +The Google Colab Notebook with the code of the above analysis can be found here - [](https://colab.research.google.com/drive/1tq6wwBsuq-HFnsD_oBmgg2TAF8WxRt92?usp=sharing). |
| 48 | + |
0 commit comments