All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
- Unreleased
- 0.5.7-alpha - 2020-05-06
- 0.5.6-alpha - 2020-03-20
- 0.5.5-alpha - 2020-03-04
- 0.5.4-alpha - 2020-01-31
- 0.5.3-alpha - 2019-12-30
- 0.5.2-alpha - 2019-11-05
- 0.5.1-alpha - 2019-08-02
- 0.5.0-alpha - 2019-07-27
- 0.4.1-alpha - 2019-07-11
- 0.4.0-alpha - 2019-02-17
Click up arrow to go back to TOC.
- Add method convert_wav in cmdline_utils.py.
- Add the manual translation method.
- Add method sub_to_file in cmdline_utils.py.
- Add complex join-events functions.
- Add Auditok options optimization.
- Add send2trash instead of os.remove in some cases.
- Add word check in method man_get_vtt_words_index.
- Add speed limit in trim in join-events.
- Add method split_dst_lf_src_assfile to split bilingual subtitles file's events.
- Add current working directory support in file renaming.
- Add join-events support for youtube automatic caption in json format.
- Add support for googletrans v4.0.0rc1 in class ManualTranslator.
- Add more subtitles format support in method sub_processing to join and split the events of the subtitles.
- Add leading and trailing whitespace stripping in class method from_json_file in class YTBWebVTT.
- Add keep_events in --join-control to keep the indexes from subtitles events.
- Change the default value for
-et
option into 50. - Change the control flow in method audio_or_video_prcs by using args.output_files to control.
- Change ci to github actions by @jitingcn.
- Refactor method sub_conversion.
- Fix the issue when not input the
-D
option. - Fix args.audio_split_cmd replacement issue when using option
-ap
. issue #122 - Fix default value issue with option
--user-agent
. issue #127 - Fix last word in line not reading in method YTBWebVTT.from_file.
- Fix wrong return value in method list_to_googletrans. issue #136
- Fix youtube vtt multiple words using one timestamp issue.
- Fix output file path in method str_to_file by @jitingcn.
- Fix out of range when output bilingual subtitles with -der option by @jitingcn.
- Fix Auditok v0.1.8+ compatibility issue.
- Fix style_name issue in method split_dst_lf_src_assfile.
- Fix unable to type into the terminal after running by @9mido.
- Fix to avoid splitext on args.ext_regions when it's None in method sub_processing.
- Fix to avoid access on vtt_sub class members when it's None in method sub_processing.
- Fix to avoid access on src_sub class members when it's None in method sub_processing.
0.5.7-alpha - 2020-05-06
- Add support for Xun Fei Yun Speech-to-Text WebSocket API.
- Add support for Baidu Automatic Speech Recognition API. issue #68
- Add chars filter for the transcript result in XfyunWebSocketAPI.
- Add delete_chars in method list_to_googletrans.
- Add merge_src_assfile, merge_bilingual_assfile methods.
- Add stop words to split events in merge_src_assfile method.
- Add punctuations split in merge_src_assfile method.
- Add limitation in SplitIntoAudioPiece with an audio length of at least 4 bytes.
- Add support for src language auto-detection when not input
-SRC
language.
- Change the replacement condition of the audio_split_cmd only when the user doesn't modify it.
- Change the MAX_REGION_SIZE_LIMIT into 60 seconds.
- Change all text file input decoding into "utf-8".
- Change the default style selection in subtitles translation.
- Change the loglevel in ffmpeg commands into
-loglevel error
. - Change DEFAULT_MIN_REGION_SIZE to 0.5.
- Change langcodes into the optional dependency.
- Change LICENSE into GPLv2.
- Fix the size count bug when the last line been split in list_to_googletrans.
- Fix delete_chars issue when using
-of full-src
. - Fix os.remove() PermissionError in api_xfyun.py.
- Fix DEFAULT_AUDIO_PRCS_CMDS and DEFAULT_CHECK_CMD.
- Fix the issue with path checking in dependency finding.
- Fix Baidu's strange error code handling. issue #114
- Remove Python 2.7 support.
0.5.6-alpha - 2020-03-20
- Add extra environment variables check when finding dependencies. issue #91
- Add arguments parser input when open it without arguments. issue #92
- Add subtitles processing when not input
-SRC
.
- Change option
-sml
into-nsml
. - Change defaults for Auditok.
- Fix Google Speech-to-Text API empty result response bug. issue #89
- Deprecate Python 2.7 support.
0.5.5-alpha - 2020-03-04
- Add Google Cloud Speech-to-Text API Recognition Config json file input support.
- Add Speech-to-Text API recognition result json file output support. issue #76
- Add exception when receiving error from Google-Speech-v2.
- Add Nuitka compatibility codes to stop program compiled by Nuitka when using Google Cloud service account credentials. Nuitka pkg_resources.DistributionNotFound error
- Fix high memory/RAM usage partially by terminating child processes, lowering default multiprocessing counts and using
gc.collect(0)
. issue #67, issue #74 - Fix dependency finding issue. issue #82
- Fix ass override tags translation issue. issue #79
0.5.4-alpha - 2020-01-31
- Add basic Google Cloud Speech-to-Text support. issue #10
- Add more bilingual subtitles formats output support. issue #72
- Remove gtransv2 support.
- Fix output format limits when input is a subtitles file.
0.5.3-alpha - 2019-12-30
- Fix excessive transcoding time issue. pull request #66
- Fix Auditok option issues. issue #70
- Fix output option issue. issue #73
0.5.2-alpha - 2019-11-05
- Add issue templates.
- Fix last row of empty translation text missing issue. issue #62
- Fix executable file detection problem in the current directory.
0.5.1-alpha - 2019-08-02
- Add translation source lang code auto match.
- Fix method list_to_googletrans index error bug. issue #48
- Fix unix subprocess.check_output compatibility. issue #47
- Fix googletrans full-wide chars length too long issue. issue #49
0.5.0-alpha - 2019-07-27
- Add arguments for min and max region size. issue #3
- Add metadata.py. issue #5
- Add output file name detection to avoid any file overwritting.
- Add new dev branch for latest dev codes to push.
- Add more output format(ass, ssa, sub, mpl2, tmp). issue #20
- Add arguments for auditok.StreamTokenizer and energy_threshold. issue #30
- Add overwrite option
-y
for output overwrite and no input pause. issue #29 - Add specific .ass style when output format is .ass. issue #21
- Add timings generating function instead of using speech-to-text api. issue #14
- Add arguments for confidence control. issue #6
- Add arguments for dropping empty lines from speech-to-text results.
- Add free api to use by importing the googletrans. issue #25
- Add bilingual subtitle output. issue #16
- Add multi-types subtitles files output at the same time (regions/source language/destination language/bilingual subtitles) when using
--output-files
option. - Add exception to stop the workflow in main(). issue #35
- Add bilingual subtitle styles input. issue #32
- Add subtitles translate. issue #38
- Add function to auto-replace
’
to'
in the translation result. - Add py-googletrans control args. issue #36
- Add lang codes support.(Depend on langcodes package) issue #34
- Add complex ass json output. issue #39
- Add audio preprocessing. issue #7
- Add options to control every ffmpeg command. issue #43
- Add temp file save function. issue #22
- Add only audio fragments output. issue #44
- Add subtitles(first line) language detection powered by googletrans. issue #40
- Add http, https proxy support.(Set environment variables) issue #17
- Add i18n support. issue #9
- Add i18n language choice. issue #45
- issue #5.
- Rewrite help messages.
- Refactor argparse.
- Refactor constaints.
- Change dev branch into origin branch.
- Use alpha branch for alpha releases.
- Change docs.
- Change audio conversion workflow to get a better audio quality to process. Currently will create two files from the original source file separately. 48kHz/16bit/mono .wav for local speech regions finding. 44.1kHz/24bit/mono .flac for google speech v2 api upload or in other words, speech recognition. Need to point out that Google-Speech-v2 is wrong on the supported .flac audio channel number. According to my test the api doesn't support the 2-channel .flac file. agermanidis/autosub issue #155
- Refactor internal regions unit to millisecond. issue #23
- Refactor speech regions detection by using auditok. issue #27
- Refactor generate_subtitles into 3 parts. issue #24
- Refactor api_gen_text to 2 parts. One is speech_to_text. Another is text_translation. issue #33
- Refactor txt output. Now txt can output regions.
- Refactor list_to_sub_str. issue #37
- Fix vtt output replacing all commas to dots issues.
- issue #8
- Fix python3 compatibility issues.
- Fix Nuitka build after updating Nuitka to 0.6.4(Environment Anaconda2 python3.5).
0.4.1-alpha - 2019-07-11
- Add https speech-to-text api url and url choice argument. agermanidis/autosub pull request #135
- Add external speech-to-text regions control from external subtitle files. agermanidis/autosub pull request #159
- Add scripts to build, release and etc.
- Change docs.
- Fix vague language codes caused wrong recognition result. agermanidis/autosub pull request #136
0.4.0-alpha - 2019-02-17
- Fix several issues. agermanidis/autosub pull request #128 by @iWangJiaxiang
- Fix "ffmpeg.exe" causes "Dependency not found: ffmpeg" on Windows.
- Fix "ValueError" when the response data of "SpeechRecognizer" couldn't be parsed to JSON Object.
- Fix Temp Folder Permissions Denied on Windows 10. agermanidis/autosub issue #15
- Fix JSONDecodeError caused crash. agermanidis/autosub pull request #131 by @raryelcostasouza