Generic: use compat_urllib_parse_unquote to prevent utf8 mangling
of the entire page in python 2.
-requires- fixed compat_urllib_parse_unquote
example - the following will save with a mangled playlist title,
instead of the kanji for 'tsunami'. This affects all utf8encoded
urls as well
youtube-dl -f18 -o '%(playlist_title)s-%(title)s.%(ext)s' \
61c14c1e3a/tsunami.html
This commit is contained in:
parent
e37c932fca
commit
45eedbe58c
1 changed files with 1 additions and 1 deletions
|
@ -1115,7 +1115,7 @@ class GenericIE(InfoExtractor):
|
||||||
# Sometimes embedded video player is hidden behind percent encoding
|
# Sometimes embedded video player is hidden behind percent encoding
|
||||||
# (e.g. https://github.com/rg3/youtube-dl/issues/2448)
|
# (e.g. https://github.com/rg3/youtube-dl/issues/2448)
|
||||||
# Unescaping the whole page allows to handle those cases in a generic way
|
# Unescaping the whole page allows to handle those cases in a generic way
|
||||||
webpage = compat_urllib_parse.unquote(webpage)
|
webpage = compat_urllib_parse_unquote(webpage)
|
||||||
|
|
||||||
# it's tempting to parse this further, but you would
|
# it's tempting to parse this further, but you would
|
||||||
# have to take into account all the variations like
|
# have to take into account all the variations like
|
||||||
|
|
Loading…
Reference in a new issue