V2PH Downloader
📦 Plug-and-play: No extra dependencies required
🌐 Cross-platform: Supports all platforms
🔄 Dual engines: Supports both DrissionPage and Selenium automation options
🛠️ Convenient: Supports multiple accounts for auto-login and switching, supports cookies/password login
⚡️ Fast: High-efficiency download with asynchronous event loop
🧩 Customizable: Offers many configuration options
🔑 Secure: Uses PyNaCL as encryption backend
Requirements:
- Chrome browser installed
- Python version > 3.10
- Install via pip
pip install v2dlOn first run, login to V2PH with one of the two methods:
- Account Management Interface
Usev2dl -ato enter the account management interface.
v2dl -a- Cookies Login
Logs in using cookies by specifying a cookies file. If the path is a directory, it will search for all .txt files containing "cookies" in their filename. This method adds the account to the login candidate list.
v2dl -c /PATH/to/cookies- Manual Login
Due to strict bot detection on login pages, you can trigger the login page by randomly downloading an album, then manually log in if errors occur.
v2dl supports various download methods, including downloading a single album, a list of albums, starting from a specific album, or reading all pages from a text file.
# Download a single album
v2dl "https://www.v2ph.com/album/Weekly-Young-Jump-2015-No15"
# Download all albums in an album list
v2dl "https://www.v2ph.com/category/nogizaka46"
# Download all pages in a text file
v2dl -i "/path/to/urls.txt"Cookies login is often more successful than using username/password.
Use an extension (e.g., Cookie-Editor) to export cookies in Netscape format, and input the exported cookie file path in the account manager tool.
Note
Exported cookies must include frontend-rmt/frontend-rmu.
Note
Cookies are sensitive information; use high-quality extensions and remove or restrict access after exporting.
- url: URL of the target to download.
- -i: URL list in a text file, one URL per line.
- -a: Enter the account management tool.
- -c: Specify the cookies file to be used for this execution. If the provided path is a folder, it will automatically search for all .txt files containing "cookies" in their names within that folder. This is especially useful for users who prefer not to use account management.
- -d: Configure the base download directory.
- --force: Force download without skipping.
- --range: Specifies the download range, following the same usage as
--rangein gallery-dl. - --bot: Select automation tool; Drission is less likely to be blocked by bots.
--chrome-args: Override the arguments used to launch Chrome. This is useful when the browser is being blocked or detected by bots. Usage:--chrome-args "window-size=800,600//guest". List of all available arguments.- --user-agent: Override the user-agent, useful for bot-blocked scenarios.
- --terminate: Whether to close Chrome after the program ends.
- -q: Quiet mode.
- -v: Debug mode.
The program will search for a config.yaml file in the system configuration directory and automatically load it. Please refer to the example for the correct format. The configuration file can be located in the following directories based on your operating system:
- Windows:
C:\Users\xxx\AppData\Roaming\v2dl\config.yaml - Linux, macOS:
/Users/xxx/.config/v2dl/config.yaml
In this file, you can modify settings like headers, language, scroll length, scroll step, and rate limits:
- headers: If blocked, you can customize the headers. Note that after modifying, you must restart the browser opened by v2dl to refresh.
- language: Used to set the name of the download directory, since I found some titles are from Google Translate, it is better to keep the original.
- use_default_chrome_profile: Use your personal Chrome profile, which theoretically makes it harder to be blocked. However, the browser cannot be interacted with during the download process.
- download_dir: Set the download location; defaults to the system download folder.
- download_log_path: Logs the URLs of downloaded album pages, skipped if duplicated. The default location is the system configuration directory.
- system_log_path: Location for program logs. The default location is the system configuration directory.
- rate_limit: Download speed limit, default is 400, which is sufficient and prevents being blocked.
- chrome/exec_path: Path to the system's Chrome executable.
- encryption_config: Adjust encryption-related settings. Higher configurations require longer decryption times, and the default value already meets the minimum performance requirements.
- Don't want to use a password manager
Use the -c flag to specify the path to your cookies. If a folder is provided, all *cookies*.txt files within it will be parsed. To avoid entering it every time, set the cookies_path in the config file, V2DL will load it automatically.
- Browser connection blocked
There are three common causes: a dirty IP, an improperly configured user agent, or anti-bot targeting the background tool DrissionPage. For the first, switch to a clean IP or disable your VPN. For the second, use --user-agent to match your environment (what's my user agent); be sure to restart the browser afterward. For the third, only the developer can resolve it.
- Download blocked
Create a config.yaml and set your custom headers.
- Incomplete downloads
First, check whether the number of images matches what's expected. Site counters can occasionally be wrong, and some pages may include VIP-only content. If everything looks fine, please file an issue with full logs.
- Architecture
The tool uses DrissionPage to bypass Cloudflare checks and httpx for downloading. DrissionPage (in d mode) only allows setting a user agent, while httpx accepts headers—meaning the two settings don't conflict.
For fun, I included some seemingly unnecessary features like this security architecture. I mostly just glanced at the documentation, and this section was written while researching. I selected a lightweight 4MB package (while cryptography is 25MB).
Password storage uses PyNaCL, an encryption suite based on modern cryptography Networking and Cryptography (NaCl). The system uses a three-layer key architecture for defense in depth:
-
The first layer uses the operating system's secure random source
os.urandomto generate a 32-bitencryption_keyandsaltfor key derivation using the advanced Argon2id algorithm, which combines Argon2i and Argon2d to defend against side-channel attacks and GPU brute-force cracking. -
The middle layer protects asymmetric key pairs with a master key using XSalsa20-Poly1305 with a 24-byte nonce to prevent password collisions. XSalsa20 enhances Salsa20 with greater security without hardware acceleration. Poly1305 ensures data integrity, preventing tampering during transmission.
-
The outer layer implements encryption with SealBox, using Curve25519 for perfect forward secrecy, offering RSA-level security with shorter keys.
The keys are stored in a secure folder with access control, and encryption materials are stored separately in a .env file.
You can also extend V2DL. An example code below demonstrates how to use custom default config and replace your own the web automation script.
import asyncio
from v2dl import V2DLApp
custom_defaults = {
"static_config": {
"min_scroll_distance": 1000,
"max_scroll_distance": 2000,
# ...
}
}
class CustomBot:
def __init__(self, config_instance):
self.config = config_instance
async def auto_page_scroll(self, full_url, page_sleep=0) -> str:
# this function should return the html content for each album page
print("Custom bot in action!")
return """
<!doctype html>
<html>
<head>
<title>Example Domain</title>
<meta charset="utf-8" />
<meta http-equiv="Content-type" content="text/html; charset=utf-8" />
<meta name="viewport" content="width=device-width, initial-scale=1" />
</head>
<body>
<div>
<h1>Example Domain</h1>
<p>This domain is for use in illustrative examples in documents. You may use this
domain in literature without prior coordination or asking for permission.</p>
<p><a href="https://www.iana.org/domains/example">More information...</a></p>
</div>
</body>
</html>
"""
class ExtendedV2DL(V2DLApp):
def _setup_runtime_config(self, config_manager, args):
super()._setup_runtime_config(config_manager, args)
config_manager.set("runtime_config", "user_agent", "my_custom_ua")
print("Custom config in action!")
bot_name = "my awesome bot"
command_line_args = ["--url", "https://www.v2ph.com/album/foo", "--force"]
app = ExtendedV2DL()
app.register_bot(bot_name, CustomBot)
app.set_bot(bot_name)
asyncio.run(app.run(command_line_args))- Rapid page switching or fast downloads may trigger blocks. Current settings balance speed and block prevention.
- Block likelihood depends on network environment. Avoid using VPN for safer downloads.
- Use cautiously to avoid overloading the website's resources.
This software (hereinafter referred to as the "Software") is released under the terms of the GNU General Public License v3.0 ("GPLv3") and is intended solely for lawful academic research, education, and technical development purposes. By downloading, installing, executing, or otherwise using the Software, you acknowledge that you have read, understood, and agreed to comply with the GPLv3 license and this Disclaimer.
The Software is provided "as is" and "as available", without any express or implied warranties of any kind. The developer makes no representations or guarantees, including but not limited to warranties of merchantability, fitness for a particular purpose, non-infringement, accuracy, reliability, data integrity, system compatibility, or uninterrupted operation. The developer does not warrant that the Software will be error-free, secure, or compliant with any specific jurisdiction's legal or technical requirements.
You are solely responsible for ensuring that your use of the Software complies with all applicable laws, regulations, directives, and policies of your jurisdiction(s), including but not limited to:
- Computer crime and unauthorized access laws
- Network and communications monitoring regulations
- Intellectual property laws (e.g., copyright, trademark, database rights)
- Privacy and personal data protection laws (e.g., GDPR, CCPA, Taiwan PDPA)
- Terms of service and API usage policies of any third-party platform or website
The functionality, scripts, examples, and documentation provided by the Software do not constitute legal, technical, security, or operational advice. The developer does not review or validate your specific use case and provides no endorsement of legality or compliance for any particular application.
You explicitly agree that you assume full responsibility and risk for any consequences resulting from your use of the Software, including but not limited to platform bans, account suspension, legal claims, regulatory action, damages, or reputational harm. The developer shall not be liable for any direct, indirect, incidental, special, punitive, or consequential damages arising from the use, misuse, or inability to use the Software, whether under contract, tort (including negligence), strict liability, or any other legal theory, even if advised of the possibility of such damages.
You must not use the Software in any manner that violates applicable law, infringes on third-party rights, breaches service agreements, or accesses services or data without proper authorization. The developer shall bear no responsibility for any consequences resulting from unlawful or unauthorized use of the Software.
The developer reserves the right to modify, suspend, or discontinue the Software, in whole or in part, at any time without prior notice, and shall not be liable for any resulting impact on users or third parties.
If any provision of this Disclaimer is held to be invalid or unenforceable by a court of competent jurisdiction, all other provisions shall remain in full force and effect.