Skip to content

Commit e6ffc8e

Browse files
authored
Update hot-take-on-open-source-AI.mdx
1 parent 8efea2d commit e6ffc8e

File tree

1 file changed

+11
-10
lines changed

1 file changed

+11
-10
lines changed

src/blog/hot-take-on-open-source-AI.mdx

Lines changed: 11 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -9,14 +9,14 @@ tags: open souce AI, tech, policy
99

1010
Over the last few months I've been in a number of conversations with business executives, media and policy professionals and researchers, who in the sincerest of intentions have said something like "but what even is open source AI" or the scarier version "oh but we don’t have a definition of open source". I think people intend for these statements to be provocations to start a conversation but, in my humble opinion, these become more like confounders.
1111

12-
Let me explain my frustration using the metaphor of a chilli. Yes, that humble ingredient used in most kitchens across the world.
12+
Let me explain my frustration using the metaphor of a chilli. Yes, that ordinary ingredient used in most kitchens across the world.
1313

1414
![Many Hot Chilli Peppers](../images/Madame_Jeanette_and_other_chillies.jpg "Many Hot Chilli Peppers")
15-
By [Takeaway](https://tattle.co.in/blog/hot-take-on-open-source-AI/), licenses under CC BY-SA 4.0 via Wikimedia Commons
15+
By [Takeaway](https://tattle.co.in/blog/hot-take-on-open-source-AI/), licensed under CC BY-SA 4.0 via Wikimedia Commons
1616

17-
Anyone who spends time in a kitchen- most mothers, some fathers, the born-in-pandemic amateur chefs- and botanists know how to identify a chilli. If we want to get technical, it belongs to the [genus Capsicum](https://en.wikipedia.org/wiki/Chili_pepper) and is eaten raw, dried, fried, crushed. It is one versatile little thing that packs a punch in a dish. There are a bunch of dishes- Mexican Chilli, Chilli Chicken, Honey Chilli potato- where chilli, strictly by volume or weight, isn’t the main ingredient. But chilli is added as an adjective because it adds something unique and important to the dish. Chilli is so important that its influence has spilled far outside the culinary stage. [Iconic rock bands](https://en.wikipedia.org/wiki/Red_Hot_Chili_Peppers) are named after it. Restaurants ranging from [American fast food chains](https://en.wikipedia.org/wiki/Chili%27s) to small dhabas in India fight over calling themselves Chilli. Whether their food uses chilli or black pepper as the primary spice is a different matter. It has spun off crazy subcultures of [chilli eating competitions](https://en.wikipedia.org/wiki/Hot_pepper_challenge) that serve no functional purpose. None of this, however, confuses anybody about the organic edible thing called chilli. No one is going into malls buying a plastic toy in the shape of a chilli, mistaking it for the real thing, and adding it to their food.
17+
Anyone who spends time in a kitchen- most mothers, some fathers, the born-in-pandemic amateur chefs and botanists know how to identify a chilli. If we want to get technical, it belongs to the [genus Capsicum](https://en.wikipedia.org/wiki/Chili_pepper) and is eaten raw, dried, fried, crushed. It is one versatile little thing that packs a punch in a dish. There are a bunch of dishes- Mexican Chilli, Chilli Chicken, Honey Chilli potato- where chilli, strictly by volume or weight, isn’t the main ingredient. But chilli is added as an adjective because it adds something unique and important to the dish. Chilli is so important that its influence has spilled far outside the culinary stage. [Iconic rock bands](https://en.wikipedia.org/wiki/Red_Hot_Chili_Peppers) are named after it. Restaurants ranging from [American fast food chains](https://en.wikipedia.org/wiki/Chili%27s) to small dhabas in India fight over calling themselves Chilli. Whether their food uses chilli or black pepper as the primary spice is a different matter. It has spun off crazy subcultures of [chilli eating competitions](https://en.wikipedia.org/wiki/Hot_pepper_challenge) that serve no functional purpose. None of this, however, confuses anybody about the organic edible thing called chilli. No one is going into malls buying a plastic toy in the shape of a chilli, mistaking it for the real thing, and adding it to their food.
1818

19-
There are a lot of other kinds of plants such as white pepper, black pepper and ginger that can be added to spice food. Sometimes the black pepper growers association might feel envious of the popularity of chilli and crash the chilli growers association conference and scream, “But we do exactly what you do. We are also a great spice\!” This doesn’t make black pepper a chilli. Because chilli is a biologically identifiable thing. A plant must contain *capsaicinoids* to be called a chilli. Sorry black pepper, you just ain’t chilli.
19+
There are a lot of other kinds of plants such as white pepper, black pepper and ginger that can be added to spice food. Sometimes the black pepper growers association might feel envious of the popularity of chilli and crash the chilli growers association conference and scream, “But we do exactly what you do. We are also a great spice\!” This doesn’t make black pepper a chilli. A plant must contain capsaicinoids to be called a chilli. Sorry black pepper, you just ain’t chilli.
2020

2121
Now in the crazed decade that is the 2020s, an evil/mad/genius food manufacturer- let’s call it X for simplicity- produces a new range of hot sauces. X pushes these sauces to Michelin star restaurants and grocery stores alike, promising a culinary revolution. The sauce never spoils, it is cheaper than your spices and you can forget your worries about salting your food right. The sauce with its special adaptive properties will figure out exactly how much saltiness to express. So says X.
2222

@@ -31,24 +31,25 @@ Now switch out:
3131
* Sauce for AI
3232

3333
And I think we get a pretty good picture of what is happening with open source and AI.
34-
Here are my four hot takes:
3534

36-
1. The definition of open source software is fairly settled. Any classification exercise, be it of a plant, software or AI, is a heavily social process. There are power dynamics in whose voices count in naming something[^1]. What happens to objects on the boundaries of classes can be arbitrary (remember when Pluto was still a planet?). But time becomes a good test of any classification system and with open source software, we’ve had two decades. The definition is pretty clear. Just as chilli is identifiable by some chemical properties (presence of capsaicin), open source software is identifiable by some legally enforceable properties.
37-
None of us working with FOSS have any doubt about how to identify free and open source software. Even when the black pepper folks (DPGs) say that they are brethren of chilli (open source software), it is easy to factually check this. Even if the success of open source software has inspired domains outside of software, the ‘vibe’ of openness is tied to material practices in software. There is a checklist of the four freedoms- to use, study, modify, distribute software- that software licenses must meet.
35+
After that spicy background, I offer four hot takes:
36+
37+
1. The definition of open source software is fairly settled. Any classification exercise, be it of a plant, software or AI, is a heavily social process. There are power dynamics in whose voices count in naming something[^1]. What happens to objects on the boundaries of classes can be arbitrary (remember when Pluto was still a planet?). But time becomes a good test of any classification system and with open source software, we’ve had two decades. The definition is pretty clear. Just as chilli is identifiable by some chemical properties (presence of capsaicin), open source software is identifiable by legally testable properties.
38+
None of us working with FOSS have any doubt about how to identify free and open source software. Even when the black pepper folks (DPGs) say that they are brethren of chilli (open source software), it is easy to factually check this. Even if the success of open source software has inspired domains outside of software, the ‘vibe’ of openness is tied to material practices in software. There is a checklist of the four freedoms- to use, study, modify, distribute software- that software licenses must meet.
3839

3940
2. This is a relatively minor point- the creation of a derivative product shouldn’t be pulling the legitimacy of an original ingredient into question. Just because DPG/black pepper is gaining popularity (no less because chilli made spices popular), and we now have AI/sauces doesn’t mean we need to be confused about what open source software/chilli is.
4041

41-
3. Just as the creation of sauce involves a bunch of chemical processes (glycation, acidulation) where ingredients bond in a way that makes it challenging to talk about properties coming from constituent ingredients, so it is with AI. AI is a new kind of technical artifact. This whole affair of defining open source AI started because corporations were using ‘open source’ as a marketing push for AI. There was a fair bit of discussion and deliberation that happened in the process of [OSI coming up with OSAID](https://discuss.opensource.org/t/the-open-source-ish-ai-definition-osaid/580/6). That definition might not last but we are far beyond the point of “what even is open source AI.” By and large, everyone recognizes that it won’t be as easy to certify AI products against open source licenses as it is to certify software, because AI is software and data amalgamated in a way that makes it hard to separate the contribution of either. More importantly, arguing about the chilli quotient of a sauce might not be the most productive use of our time, when all other aspects of the other production process are also concerning. The content and quality of other ingredients matters. The process and quality of the facility in which the sauce was made also matters. With AI, all aspects of the production process need investigation. And this is why I think the question of “what even is open source AI?” is a distraction or a confounder. If Meta or Anthropic trains their models on books and papers scraped from Libgen but then opens all the data and weights and code for LLama, they would have met all aspects of their model open. But should we celebrate this as a success?
42+
3. Just as the creation of sauce involves a bunch of chemical processes (glycation, acidulation) where ingredients bond in a way that makes it challenging to talk about properties coming from constituent ingredients, so it is with AI. AI is a new kind of technical artifact. This whole affair of defining open source AI started because corporations were using ‘open source’ as a marketing push for AI. There was a fair bit of discussion and deliberation that happened in the process of [OSI coming up with OSAID](https://discuss.opensource.org/t/the-open-source-ish-ai-definition-osaid/580/6). That definition might not last but we are far beyond the point of “what even is open source AI.” By and large, everyone recognizes that it won’t be as easy to certify AI products against open source licenses as it is to certify software, because AI is software and data amalgamated in a way that makes it hard to separate the contribution of either. More importantly, arguing about the chilli quotient of a sauce might not be the most productive use of our time, when all other aspects of the other production process are also concerning. The content and quality of other ingredients matters. The process and quality of the facility in which the sauce was made also matters. With AI, all aspects of the production process need investigation. This is why I think that the question of “what even is open source AI?” is a distraction or a confounder. If Meta or Anthropic trains their models on books and papers scraped from Libgen but then opens all the data and weights and code for LLama, they would have met all aspects of their model open. But should we celebrate this as a success?
4243
The open source software definition was supposed to be a mental shortcut for four tangible things that a piece of code enables. It is clear that an open source AI definition won’t map onto a similarly simple check list of four freedoms because data and compute are significantly expensive for any person to exercise the four freedoms with AI. I think people are right to watch the licenses that models are released with, with hawk eyes. But, this won't be enough to ensure autonomy to tinker and innovate, which is what the four freedoms enable. FOSS communities need to get more involved in conversations of data governance (beyond open data) and politics of hardware supply chains. All of that, at present, seems more important than splitting hair over what should constitute an Open Source AI definition.
4344

4445
4. And last, after all this confusion, maybe we should stop invoking ‘open source’ as a synonym of ‘good’. People use open source as an adjective for digital IDs, governments, infrastructures, charity... Just as the band Red Hot Chilli Pepper shares little in common with the plant outside of the name, many of these concepts that might be inspired by FOSS share little in common with FOSS software. The material difference between government and software, money and software, IDs and software matters. Just applying the suffix ‘open’ to big concepts doesn’t make them all unanimously good. This also holds for AI.
4546

46-
It is worth stating that the most egregious misuses of the open source language don’t (always) come from FOSS practitioners. But a narrative battle around open source has been underway for a while, and we need to take some responsibility for not tackling it with more force. Perhaps we were so used to fighting like [the underdogs](https://en.wikipedia.org/wiki/Microsoft_and_open_source) that we took every adaptation of the language, however loose, to be a win. Or perhaps if we had taken more effort to build public understanding of OSS, a narrative spin would have been less successful. To go back to the chilli metaphor, imagine an alternate world where only some people could distinguish a chilli from sichuan pepper, but the average population understanding stayed at the level of spice. And so when black pepper or sichuan pepper sellers said they are the same as chilli, people had no reason to doubt it. After all, the chilli sellers hadn’t said anything earlier.
47+
It is worth stating that the most egregious misuses of the open source language don’t (usually) come from FOSS practitioners. But a narrative battle around open source has been underway for a while, and we need to take some responsibility for not tackling it with more force. Perhaps we were so used to fighting like [the underdogs](https://en.wikipedia.org/wiki/Microsoft_and_open_source) that we took every adaptation of the language, however loose, to be a win. Or perhaps if we had taken more effort to build public understanding of OSS, a narrative spin would have been less successful. To go back to the chilli metaphor, imagine an alternate world where only some people could distinguish a chilli from sichuan pepper, but the average population understanding stayed at the level of spice. And so when black pepper or sichuan pepper sellers said they are the same as chilli, people had no reason to doubt it. After all, the chilli sellers hadn’t said anything earlier.
4748

4849
FOSS practitioners (and the general public) are caught in a maelstrom of jargon. The one thing we need is to defend and steady the turf of open source software- the definition of open source software is not contested and we need to make this point as loudly and as frequently as possible, and in forums outside our FOSS cliques. We need to up our game as public communicators to bring back the precision of language used to describe technical objects. In chasing that precision we will automatically find ourselves beyond the boundary of open source AI, examining the properties of all the constituents that come together to become AI.
4950

5051
There is also a provocation for the ‘tech for good’ and policy folks (in the odd chance that anyone gets this far)- can we challenge ourselves to write and talk about national infrastructures and digital sovereignty in the age of AI without using the terms Digital Public Goods and Open Source AI? This one little change alone could do a lot to improve the quality of the conversation.
5152

52-
_A special thanks to Denny George and Srravya Chandramowli for revieweing an earlier, less hinged version of this draft._
53+
_A special thanks to Denny George and Srravya Chandramowli for revieweing an earlier, less hinged version of this blog._
5354

5455
[^1]: In this context, Srravya Chandramowli recommended the book: [Sorting Things Out](https://mitpress.mit.edu/9780262522953/sorting-things-out/)

0 commit comments

Comments
 (0)