Skip to main content

βš™οΈ Settings and Configuration

Detailed description of all functions and parameters of YouTube-Parser for effective data extraction from YouTube.


πŸ” Search Results Parsing​

The main function for collecting data based on search queries. Allows finding videos, channels, and playlists by keywords.

Search Results Parsing

YouTube API Method​

🎯 Input Data​

  • Type: Dropdown list
  • Value: Search query
  • Description: Keywords are specified line by line in the 1_Search_Keywords.txt file

πŸ“Š Type of Data to Collect​

  • Options: Video | Playlist | Channel
  • Default: Video
  • Purpose: Determines what type of content will be found by the query

πŸ“‹ Results Order​

  • Options:
    • By relevance (recommended)
    • By upload date
    • By view count
    • By rating
    • By title
    • By number of uploaded videos
  • Application: Affects the priority of results in search output

πŸ“… Date Filters​

  • Published after: Format DD.MM.YYYY
  • Published before: Format DD.MM.YYYY
  • Note: Both fields must be filled for the filter to work

🎬 Additional Filters​

  • Topic: Music, Games, Sports, Technology, etc.
  • Video category: Entertainment, Education, News, etc.
  • Resolution: HD 1080 | Standard
  • Duration: Long | Medium | Short
  • Subtitles: Only with subtitles | Without subtitles

InnerTube API Method​

🏷️ Hashtag Parsing​

  • Type: Toggle
  • Purpose: Search by hashtags like #keyword
  • Feature: Filters are not applied in this mode

⏰ Upload Date Filter​

  • Options: Last hour | Today | This week | This month | This year

πŸ“Ί Video Metadata Parsing​

Collection of detailed information about videos: views, likes, comments, description, and other characteristics.

Video Parsing

🎯 Input Data​

  • Options: Video | Channel
  • Video: Links in the 3_Video_Urls.txt file
  • Channel: Links in the 2_Channels_Urls.txt file

πŸ”§ Filtering Parameters​

πŸ“Š Latest Videos from Channel​

  • Type: Number
  • Default: 10
  • Purpose: Limits the number of latest videos for analysis

πŸ‘οΈ View Count [from]​

  • Type: Number
  • Default: 10000
  • Application: Selects videos with a minimum number of views

πŸ“… Publication Date [from]​

  • Format: DD.MM.YYYY
  • Example: 01.01.2025
  • Purpose: Analysis of only fresh content

⏱️ Video Length [from]-[to]​

  • Format: seconds
  • Examples:
    • 120-500 (from 2 to 8 minutes)
    • 120- (from 2 minutes)
    • -500 (up to 8 minutes)

πŸŽͺ Channel Metadata Parsing​

Extraction of information about channels: subscribers, description, contacts, statistics.

Channel Parsing

πŸ“§ Contact Extraction​

βœ‰οΈ Collect Email Addresses from Description​

  • Type: Toggle
  • Purpose: Automatic search for emails in channel description
  • Application: Useful for finding contacts for collaboration

🎯 Save Only Channels with Emails​

  • Type: Toggle
  • Effect: Filters results, keeping only channels with found contacts

🚫 Exclude Previously Collected Emails​

  • Type: Toggle
  • Function: Uses a blacklist to avoid duplicates

πŸ” Channel Filtering​

πŸ‘₯ Subscriber Count [from]​

  • Default: 1000
  • Purpose: Selection of channels with a specific audience
  • Note: Hidden data is excluded

πŸ“Š View Count [from]​

  • Default: 10000
  • Application: Evaluation of channel popularity

πŸ“… Creation Year [up to]​

  • Default: 2012
  • Goal: Search for experienced or new channels

🌍 Countries [comma-separated]​

  • Format: ISO 3166-1 alpha-2
  • Example: US, GB, CA, DE, FR, AU
  • Purpose: Geographic filtering

πŸ“‹ Playlist Parsing​

Analysis of playlists and videos in them for studying thematic content.

🎯 Data Sources​

  • Playlist: File 4_Playlists_Urls.txt
  • Channel: File 2_Channels_Urls.txt (all channel playlists)

πŸ“Š Data Obtained​

  • Playlist title and description
  • Number of videos
  • Creation date
  • List of all videos in the playlist

πŸ’¬ Comment Parsing​

Extraction of comments for audience analysis and feedback.

🎯 Sources​

  • For videos: Files 3_Video_Urls.txt or 2_Channels_Urls.txt
  • For posts: Files 5_Posts_Urls.txt or 2_Channels_Urls.txt

πŸ” Search Parameters​

πŸ”Ž Comment Search [contains]​

  • Type: Text
  • Support: Regex expressions
  • Application: Search for brand mentions, keywords

πŸ‘ Like Count [from]​

  • Default: 5
  • Purpose: Selection of popular comments

πŸ“… Publication Date [from]​

  • Format: DD.MM.YYYY HH:MM
  • Application: Analysis of fresh discussions

🚫 Exclude Replies​

  • Type: Toggle
  • Effect: Collect only main comments

πŸ’Ύ Content Downloading​

Download videos, audio, subtitles, and thumbnails for offline use.

Downloading

🎬 Video Downloading​

🎯 Data Sources​

  • Video: File 3_Video_Urls.txt
  • Playlist: File 4_Playlists_Urls.txt
  • Channel: File 2_Channels_Urls.txt

πŸ“Ή Video Type​

  • Options: Any | Videos | Livestreams | Shorts
  • Application: For channels - selection of content type for downloading

🎞️ Stream Type​

  • Video: Video + audio
  • Audio: Audio track only

πŸ”§ Video Quality​

  • 360p/720p Muxed: Fast downloading without merging
  • 1080p/1440p/2160p: High quality with subsequent merging
  • Max: Maximum available quality

πŸ–ΌοΈ Additional Options​

  • Download thumbnails: Save video previews
  • Number of videos: Limit for mass downloading

πŸ“ Subtitle Downloading​

🌐 Subtitle Language​

  • Format: ISO 639-1 (two-letter code)
  • Example: en for English, ru for Russian

πŸ“„ Subtitle Type​

  • Uploaded by author: High-quality, manually created
  • Auto-generated: YouTube machine translation
  • Any: Priority to author subtitles

✏️ Formatting​

  • Type: Toggle
  • Effect: Conversion to plain text without timestamps

🌐 Additional Settings​

Additional Settings

πŸ”‘ API Management​

πŸ”„ Shuffle API Keys​

  • Purpose: Even distribution of load
  • Effect: Increased parsing speed

🌐 Proxy Settings​

πŸ”’ Use Proxy​

  • Application: Required for InnerTube API
  • Function: Bypassing YouTube restrictions

πŸ”„ Proxy Rotation​

  • Mode: Cyclic use
  • Warning: Possible hanging when limits are exhausted

πŸ›‘οΈ Protection Against Blocking​

⏱️ Delay Between Requests​

  • Time: 1 second
  • Purpose: Prevention of captcha when working without proxy

🌍 Regional Settings​

🌎 Country​

  • Format: ISO 3166-1 alpha-2
  • Default: US
  • Influence: Emulation of regional results

πŸ—£οΈ Language​

  • Format: ISO 639-1
  • Default: en
  • Effect: Localization of results

πŸ’‘ Usage Recommendations​

πŸš€ For Beginners​

  1. Start with YouTube API - more stable and fast
  2. Use simple search queries without complex filtering
  3. Test on small volumes of data

⚑ For Experienced Users​

  1. Combine API methods to get the maximum amount of data
  2. Configure filtering for precise targeting
  3. Use proxies for large-scale parsing via InnerTube

⚠️ Important Limitations​

  • YouTube API: Maximum 600 results per query
  • Filtering: Applied after data retrieval
  • Regionality: Results depend on the selected country and language