@@ -12,6 +12,7 @@ pub_date: 2026-03-03
1212---
1313body:
1414
15+
1516## Quantifying the Commons: The end of an era
1617
1718Dear gentle reader,
@@ -20,17 +21,17 @@ It is the end of an era yet the beginning of my bloom as a young aspiring data
2021professional on a global stage. It feels so surreal to be at the end of this
2122amazing journey with my mentors and to see the quantifying commons become a
2223mature project in the creative commons open source community. Quantifying the
23- commons is also blooming so stay tuned to experience its impact in
24- different teams at Creative Commons.
24+ commons is also blooming so stay tuned to experience its impact in different
25+ teams at Creative Commons.
2526
2627Looking back, I was quite nervous on my first meeting with Timid Robot and
2728Sara. I did not quite understand the automation part of the project, how long
28- the scripts ran? Why? I was fascinated by the whole process of the system, after
29- further explanation by Timid Robot I was really impressed by the design thinking. A
30- lot of details and critical thinking were put into implementing the system. Big
31- kudos to the project lead and previous contributors, I am in love with the
32- foundation being put in place prior to my contribution. It is a firm one and it
33- made my work easier and worthwhile.
29+ the scripts ran? Why? I was fascinated by the whole process of the system,
30+ after further explanation by Timid Robot I was really impressed by the design
31+ thinking. A lot of details and critical thinking were put into implementing the
32+ system. Big kudos to the project lead and previous contributors, I am in love
33+ with the foundation being put in place prior to my contribution. It is a firm
34+ one and it made my work easier and worthwhile.
3435
3536
3637## Day 1 was amazing, Day 90 is growth!
@@ -51,47 +52,54 @@ efficiency of the automation process.
5152### Automating the Smithsonian quarterly report
5253
5354Smithsonian is one of the largest public institutions in the United States. It
54- has a total of 38 units/data sources like museums, zoos and libraries as of when
55- I worked on it. We derived insights on the usage of CC0 license across the media
56- records and records without media. This urged me to add the horizontal stacked
57- barplot to the collection of visualization in the report system. From this, we
58- could get the distribution of the records with CC0 licenses at a glance. Also,
59- we explored the top 10 distribution of units and lowest 10 distribution of
60- units. This meaningfully tells us how common the CC0 license is used in these
61- institutions. After testing the whole workflow a couple of times, I detected
62- that the unit code seems to be updated frequently whether added or removed. I
63- developed a function that keeps track of these changes and gives a warning about
64- changes in the next automation process. This was the best way possible at the
65- moment to handle the sudden unit code, so that our data is quite predictable and
66- updated.
55+ has a total of 38 units/data sources like museums, zoos and libraries as of
56+ when I worked on it. We derived insights on the usage of CC0 license across the
57+ media records and records without media. This urged me to add the horizontal
58+ stacked barplot to the collection of visualization in the report system. From
59+ this, we could get the distribution of the records with CC0 licenses at a
60+ glance. Also, we explored the top 10 distribution of units and lowest 10
61+ distribution of units. This meaningfully tells us how common the CC0 license is
62+ used in these institutions. After testing the whole workflow a couple of times,
63+ I detected that the unit code seems to be updated frequently whether added or
64+ removed. I developed a function that keeps track of these changes and gives a
65+ warning about changes in the next automation process. This was the best way
66+ possible at the moment to handle the sudden unit code, so that our data is
67+ quite predictable and updated.
68+
6769
6870### Automating the arXiv quarterly report
6971
70- Arxiv is a curated research-sharing platform with 5 million monthly active users
71- and hosts 2.6 million research papers. We derived quite interesting insights
72- from this data source. Then expanded the visualization collection in plot.py by
73- adding the function for line plot and vertical stacked barplot. The insights
74- include the count of legal tools on a yearly basis and various comparative
75- analysis of the tools over the years. We also explored the breakdown of these
76- tools usage in different categories.
72+ Arxiv is a curated research-sharing platform with 5 million monthly active
73+ users and hosts 2.6 million research papers. We derived quite interesting
74+ insights from this data source. Then expanded the visualization collection in
75+ plot.py by adding the function for line plot and vertical stacked barplot. The
76+ insights include the count of legal tools on a yearly basis and various
77+ comparative analysis of the tools over the years. We also explored the
78+ breakdown of these tools usage in different categories.
79+
7780
7881## Lessons learned
82+
7983I learnt so much about creating a structure when solving a problem. It is quite
8084easier to debug and it presents a detailed workflow for future contributors to
8185understand what has been done previously. It literally boils down to how you
82- name your variable or how you use it in a function. I also learnt the importance
83- of asking why. Timid Robot encouraged me to always question assumptions and understand
84- the reasoning behind decisions. This was the best thing to do because it made
85- the whole internship fun and puzzling. Things became naturally logical and I
86- could connect the dots quite easily.
86+ name your variable or how you use it in a function. I also learnt the
87+ importance of asking why. Timid Robot encouraged me to always question
88+ assumptions and understand the reasoning behind decisions. This was the best
89+ thing to do because it made the whole internship fun and puzzling. Things
90+ became naturally logical and I could connect the dots quite easily.
91+
8792
8893## What Next!
94+
8995I hope to continue volunteering my time on the project going forward. I am also
9096eager to explore other open-source projects involving research, big data, and
9197automation, and to further align these skill sets with my background in
9298actuarial science.
9399
100+
94101## Goodbye for now
102+
95103I really enjoyed working with my mentors, I will miss our little chit chats
96104about the holidays, the weather and even vacation trips. I look forward to
97105catching up again in the future.
0 commit comments