-
Notifications
You must be signed in to change notification settings - Fork 17
/
pytomo.man
71 lines (53 loc) · 3.45 KB
/
pytomo.man
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
PYTOMO() PYTOMO()
NNAAMMEE
PYTOMO -
Pytomo is a YouTube crawler designed to figure out network information
out of YouTube video download.
UUSSAAGGEE
pytomo [-r max_rounds] [-u max_crawled_url] [-p max_per_url] [-P
max_per_page] [-t time_frame] [-n ping_packets] [-D download_time] [-B
buffering_video_duration] [-M min_playout_buffer_size] [-x] [-L
log_level]:
Options:
-h, --help show this help message and exit
-r MAX_ROUNDS Max number of rounds to perform (default 50)
-u MAX_CRAWLED_URL Max number of urls to visit (default 10000)
-p MAX_PER_URL Max number of related urls from each page (default 2)
-P MAX_PER_PAGE Max number of related videos from each page (default
30)
-t TIME_FRAME Timeframe for the most popular videos to fetch at
start of crawl put 'today', 'week', 'month' or
'all_time' (default 'week')
-n PING_PACKETS Number of packets to be sent for each ping (default 3)
-D DOWNLOAD_TIME Download time for the video (default 30.000000)
-B BUFFERING_VIDEO_DURATION
Buffering video duration (default 3.000000)
-M MIN_PLAYOUT_BUFFER_SIZE
Minimum Playout Buffer Size (default 1.000000)
-x Do NOT store public IP address of the machine in the
logs
-L LOG_LEVEL The log level setting for the Logging module.Choose
from: 'DEBUG', 'INFO', 'WARNING', 'ERROR' and
'CRITICAL' (default 'DEBUG')
--http-proxy=PROXIES in case of http proxy to reach Internet (default None)
IINNSSTTAALLLLAATTIIOONN--FFRREEEE
In order to provide installation-free package, we provide binary exe‐
cutables for Linux (32 and 64bits), Windows, and Mac OS X. The bina‐
ries files were generated with _P_y_i_n_s_t_a_l_l_e_r (version 1.5RC1).
If you have Python installed, you can directly run the start_crawl.py
script at root or the pytomo script in bin directory.
EEXXTTEERRNNAALL RREESSOOUURRCCEESS
We based the lib_youtube_download on _Y_o_u_T_u_b_e _D_o_w_n_l_o_a_d script: we sim‐
plified it at most and include only the classes we needed (and only
YouTube video retrieval).
The dns module is taken from the _D_N_S _P_y_t_h_o_n _P_a_c_k_a_g_e: we just modified
rdata so that _P_y_i_n_s_t_a_l_l_e_r include all needed modules.
The extraction of metadata out of video files is an adaptation of _K_a_a
_M_e_t_a_d_a_t_a _P_y_t_h_o_n _P_a_c_k_a_g_e: it has been modified in order to be indepen‐
dent of Kaa-base (thus pure Python and portable).
EExxtteerrnnaall LLiinnkkss
1. _Y_o_u_T_u_b_e _D_o_w_n_l_o_a_d.
2. _K_a_a _M_e_t_a_d_a_t_a _P_y_t_h_o_n _P_a_c_k_a_g_e.
3. _D_N_S _P_y_t_h_o_n _P_a_c_k_a_g_e
4. _P_y_i_n_s_t_a_l_l_e_r
PYTOMO()