You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
NeuralNoise is an AI-powered podcast studio that uses multiple AI agents working together. These agents collaborate to analyze content, write scripts, and generate audio, creating high-quality podcast content with minimal human input. The team generates a script that the cast team (using a TTS tool of your choice) will then record.
@@ -72,7 +78,7 @@ To run the NeuralNoise application first make sure that you create a configurati
72
78
Then you can run the application with:
73
79
74
80
```
75
-
nn new <url|filepath> --name <name> --config-path <config>
81
+
nn generate --name <name> <url|file> [<url|file>...]
76
82
```
77
83
78
84
## Want to edit the generated script?
@@ -81,17 +87,18 @@ The generated script and audio segments are saved in the `output/<name>` folder.
81
87
82
88
1. Locate the JSON file in this folder containing all script segments and their text content.
83
89
2. Make your desired changes to specific segments in the JSON file. Locate the "sections" and "segments" content in this file that you want to change, then feel free to edit the content of the segments you want to change.
84
-
3. Run the same command as before (same name) to regenerate the podcast.
90
+
3. Run the same command as before with the same name (`nn generate --name <name>`) to regenerate the podcast.
85
91
86
92
The application will regenerate the podcast, preserving unmodified segments and only processing the changed ones. This approach allows for efficient editing without regenerating the entire podcast from scratch.
87
93
88
94
## Roadmap
89
95
90
-
- [ ] Better PDF and articles content extraction.
96
+
- [x] Better PDF and articles content extraction.
97
+
- [ ] Add interactive ways of using NeuralNoise (Gradio/Colab/etc)
91
98
- [ ] Add local LLM provider. More generic LLM configuration. Leverage AutoGen for this.
92
99
- [ ] Add local TTS provider
93
100
- [ ] Add podcast generation format options: interview, narrative, etc.
94
-
- [] Add podcast generation from multiple source files
101
+
- [x] Add podcast generation from multiple source files
95
102
- [ ] Add more agent roles to the studio. For example, a "Content Curator" or "Content Researcher" that uses tools to find and curate content before being analyzed. Or a "Sponsor" agent that adds segways to ads in the podcast script ([à la LTT](https://www.youtube.com/live/EefvOLKoXdg?si=G1714t2jK4ZIvao0&t=5307)).
96
103
- [ ] Add music and sound effects options
97
104
- [ ] Real-time podcast generation with human and AI collaboration (🤔)
0 commit comments