View Farm

Prerequisites

PowerShell 5.1 or later
Python 3.8 or later
Docker installed

Cloning the Repository

Open PowerShell or your preferred terminal.

Clone the repository by running the following command:

git clone https://github.com/oliverkristianfritsche/ViewFarm_v2

Navigate to the project directory:
```
cd ViewFarm_v2
```

Configuring the `config.json` File

Create a new file named config.json in the root directory of the project.

Add the following structure to the file:

{
  "botid": "your_bot_id",
  "YOUTUBE_API_KEY": "your_youtube_api_key",
  "youtubescrapper": {
    "searchBy": "hashtag",
    "searchValue": "your_search_value",
    "maxResults": 50,
    "order": "viewCount",
    "shorts_category_id": "0",
    "max_short_length": 60
  },
  "CLIENT_ID": "your_client_id",
  "CLIENT_SECRET": "your_client_secret",
  "USER_AGENT": "your_user_agent",
  "subreddits": "your_subreddits",
  "num_posts": 11,
  "target_language": ["en", "es", "hi", "pt", "ar"],
  "speaker_file": "/root/speakers/speaker_americanpyscho.wav",
  "audio_speed": 2.5,
  "video_speed": 1.3,
  "max_video_length": 60
}

Replace the placeholder values with your own:
- botid: A unique identifier for your bot.
- YOUTUBE_API_KEY: Your YouTube API key used to access YouTube data.
- youtubescrapper:
  - searchBy: Criteria for searching on YouTube. Options are hashtag, trending, or account.
  - searchValue: The specific value to search for based on the searchBy parameter (e.g., the actual hashtag or account name).
  - maxResults: The maximum number of search results to return.
  - order: The order in which to sort results (e.g., viewCount).
  - shorts_category_id: ID for the YouTube shorts category.
  - max_short_length: The maximum length (in seconds) for short-form videos.
- CLIENT_ID: Your Reddit client ID used to access the Reddit API.
- CLIENT_SECRET: Your Reddit client secret used for API authentication.
- USER_AGENT: The user agent string for Reddit API requests.
- subreddits: A comma-separated list of subreddits from which to scrape stories.
- num_posts: The number of posts to retrieve from each subreddit.
- target_language: A list of target languages for translation (e.g., ["en", "es", "hi", "pt", "ar"]).
- speaker_file: The path to the audio file used for TTS (text-to-speech) synthesis.
- audio_speed: The speed multiplier for the generated audio.
- video_speed: The speed multiplier for the video playback.
- max_video_length: The maximum length (in seconds) for the final video.

Running the `build.ps1` Script

Open PowerShell and navigate to the root directory of the project.
Run the build.ps1 script by typing:
```
.\build.ps1
```
The script will install the required dependencies and build the project.

Running the `run.ps1` Script

Ensure that you have correctly configured the config.json file with your own values.
Open PowerShell and navigate to the root directory of the project.
Run the run.ps1 script by typing:
```
.\run.ps1
```
The script will start the multilanguage Reddit pipeline.

Note: This script automatically mounts your Google Drive. If the specified paths do not exist, you must change them in the ./run.ps1 script. Videos will be outputted to ./reprocessio/[language type].

Pipeline Overview

This project showcases a sophisticated and automated content creation pipeline, designed to capitalize on the virality of short-form content, particularly on platforms like YouTube and Reddit.

YouTube Scraping: The process begins by using the YouTube API to scrape video analysts and links based on criteria like hashtags, trending locations, or specific accounts.
Video Download: Once the desired videos are identified, they are downloaded using youtube-dl.
Reddit Story Collection: The Reddit API is then utilized to gather popular stories from selected subreddits, ensuring the content is engaging and relevant.
Translation: Using deep_translate, the stories are translated into the target languages specified in the config.json file.
Video Overlay: The downloaded videos and translated Reddit stories are then combined using MoviePy to create compelling videos that are primed for short-form content platforms.
Text-to-Speech: The translated text is converted to speech using the GTTS package.
Content Popularity: This method is particularly effective for generating viral content, with some pages achieving millions of views across all videos.
Google Drive Integration: Once the videos are created, they are automatically saved to a Google Drive account in language-specific folders for easy organization and access.
Autoposting: The final step involves using the Repurpose.io website to automate the posting of these videos to various social media accounts, ensuring consistent and timely content distribution.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
.github/workflows		.github/workflows
databases		databases
downloaders		downloaders
scrappers		scrappers
tests		tests
tts		tts
.gitconfig		.gitconfig
.gitignore		.gitignore
README.md		README.md
build.ps1		build.ps1
dockerfile		dockerfile
main.py		main.py
requirements.txt		requirements.txt
run.ps1		run.ps1
schedule_container.ps1		schedule_container.ps1
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

View Farm

Table of Contents

Prerequisites

Cloning the Repository

Configuring the `config.json` File

Running the `build.ps1` Script

Running the `run.ps1` Script

Pipeline Overview

About

Uh oh!

Releases

Packages

Languages

oliverkristianfritsche/ViewFarm_v2

Folders and files

Latest commit

History

Repository files navigation

View Farm

Table of Contents

Prerequisites

Cloning the Repository

Configuring the config.json File

Running the build.ps1 Script

Running the run.ps1 Script

Pipeline Overview

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Configuring the `config.json` File

Running the `build.ps1` Script

Running the `run.ps1` Script

Packages