See for instance https://commons.wikimedia.org/w/index.php?title=File:Mésange_Charbonnière_(88205567).jpeg&oldid=313653974
Maybe it's an UTF-8 issue?
See for instance https://commons.wikimedia.org/w/index.php?title=File:Mésange_Charbonnière_(88205567).jpeg&oldid=313653974
Maybe it's an UTF-8 issue?
The metadata we received already came with all non-asccii characters substituted by a question mark. I use the url too reconstruct the filename and author name, but the rest is basically lost. To me it doesn't seem like an issue we should loose much time in trying to re-scrape this data, most of it can be gotten from context or, if necessary, from the original 500px link.
This is an even bigger problem than I thought: https://commons.wikimedia.org/wiki/File:Myshkin_(221178833).jpeg