pytube logo

pypi pypi

### 24 July 2020 Actively soliciting contributers! Ping @ronncc if you would like to help out! # pytube *pytube* is a very serious, lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos. ## Installation To install from pypi with pip: ```bash $ python -m pip install pytube ``` Sometime, the pypi release becomes slightly outdated. To install from the source with pip: ```bash $ python -m pip install git+https://github.com/pytube/pytube ``` ## Description YouTube is the most popular video-sharing platform in the world and as a hacker you may encounter a situation where you want to script something to download videos. For this I present to you *pytube*. *pytube* is a lightweight library written in Python. It has no third party dependencies and aims to be highly reliable. *pytube* also makes pipelining easy, allowing you to specify callback functions for different download events, such as ``on progress`` or ``on complete``. Finally *pytube* also includes a command-line utility, allowing you to quickly download videos right from terminal. ### Behold, a perfect balance of simplicity versus flexibility: ```python >>> YouTube('https://youtu.be/2lAe1cqCOXo').streams.first().download() >>> yt = YouTube('http://youtube.com/watch?v=2lAe1cqCOXo') >>> yt.streams ... .filter(progressive=True, file_extension='mp4') ... .order_by('resolution') ... .desc() ... .first() ... .download() ``` ## Features - Support for Both Progressive & DASH Streams - Support for downloading complete playlist - Easily Register ``on_download_progress`` & ``on_download_complete`` callbacks - Command-line Interfaced Included - Caption Track Support - Outputs Caption Tracks to .srt format (SubRip Subtitle) - Ability to Capture Thumbnail URL. - Extensively Documented Source Code - No Third-Party Dependencies ## Getting started Let's begin with showing how easy it is to download a video with pytube: ```python >>> from pytube import YouTube >>> YouTube('https://youtube.com/watch?v=2lAe1cqCOXo').streams.first().download() ``` This example will download the highest quality progressive download stream available. Next, let's explore how we would view what video streams are available: ```python >>> yt = YouTube('https://youtube.com/watch?v=2lAe1cqCOXo') >>> yt.streams [, , , , , , , , , , , , , , , , , , , , , , , ] ``` You may notice that some streams listed have both a video codec and audio codec, while others have just video or just audio, this is a result of YouTube supporting a streaming technique called Dynamic Adaptive Streaming over HTTP (DASH). In the context of pytube, the implications are for the highest quality streams; you now need to download both the audio and video tracks and then post-process them with software like FFmpeg to merge them. The legacy streams that contain the audio and video in a single file (referred to as "progressive download") are still available, but only for resolutions 720p and below. To only view these progressive download streams: ```python >>> yt.streams.filter(progressive=True) [, ] ``` Conversely, if you only want to see the DASH streams (also referred to as "adaptive") you can do: ```python >>> yt.streams.filter(adaptive=True) [, , , , , , , , , , , , , , , , , , , , , ] ``` You can also interact with Youtube playlists: ```python >>> from pytube import Playlist >>> pl = Playlist("https://www.youtube.com/watch?v=Edpy1szoG80&list=PL153hDY-y1E00uQtCVCVC8xJ25TYX8yPU") >>> for video in pl.videos: >>> video.streams.first().download() ``` Pytube allows you to filter on every property available (see the documentation for the complete list), let's take a look at some of the most useful ones. To list the audio only streams: ```python >>> yt.streams.filter(only_audio=True) [, , , ] ``` To list only ``mp4`` streams: ```python >>> yt.streams.filter(subtype='mp4').all() [, , , , , , , , , , , , , , ] ``` Multiple filters can also be specified: ```python >>> yt.streams.filter(subtype='mp4', progressive=True).all() >>> # this can also be expressed as: >>> yt.streams.filter(subtype='mp4').filter(progressive=True).all() [, ] ``` You also have an interface to select streams by their itag, without needing to filter: ```python >>> yt.streams.get_by_itag(22) ``` If you need to optimize for a specific feature, such as the "highest resolution" or "lowest average bitrate": ```python >>> yt.streams.filter(progressive=True).order_by('resolution').desc().all() [, ] ``` Note that ``order_by`` cannot be used if your attribute is undefined in any of the Stream instances, so be sure to apply a filter to remove those before calling it. If your application requires post-processing logic, pytube allows you to specify an "on download complete" callback function: ```python >>> def convert_to_aac(stream, file_handle): return # do work >>> yt.register_on_complete_callback(convert_to_aac) ``` Similarly, if your application requires on-download progress logic, pytube exposes a callback for this as well: ```python >>> def show_progress_bar(stream, chunk, bytes_remaining): return # do work >>> yt.register_on_progress_callback(show_progress_bar) ``` You can also download videos to a specific directory with specific filename: ```python >>> yt = YouTube('https://youtube.com/watch?v=2lAe1cqCOXo') >>> yt.streams.first().download(output_path="/tmp" ,filename='output') ``` ## Command-line interface (CLI) Pytube also ships with a tiny CLI for interacting with videos and playlists. To download the highest resolution progressive stream: ```bash $ pytube https://www.youtube.com/watch?v=2lAe1cqCOXo ``` To view available streams: ```bash $ pytube https://www.youtube.com/watch?v=2lAe1cqCOXo --list ``` To download a specific stream, use the itag ```bash $ pytube https://www.youtube.com/watch?v=2lAe1cqCOXo --itag=22 ``` To get a list of all subtitles (caption codes) ```bash $ pytube https://www.youtube.com/watch?v=2lAe1cqCOXo --list-captions ``` To download a specific subtitle (caption code) - in this case the english subtitles (in srt format) - use: ```bash $ pytube https://www.youtube.com/watch?v=2lAe1cqCOXo -c en ``` It is also possible to just download the audio stream (default AAC/mp4): ```bash $ pytube https://www.youtube.com/watch?v=2lAe1cqCOXo -a ``` Finally, if you're filing a bug report, the cli contains a switch called ``--build-playback-report``, which bundles up the state, allowing others to easily replay your issue.