GitHub - dmascialino/cuentos_verano12: Proyecto para descargar y convertir a reStructuredText la colección de cuentos públicados por Pagina12

Descripción

Proyecto para descargar recopilación de cuentos publicados en Pagina12. Convertirlos a restructured text, luego usando rst2epub2 se puede obtener un epub.

Los comandos para utilizarlo:

scrapy crawl cuentos -o cuentos.json -L INFO
python items_to_rst.py
PYTHONPATH=../rst2epub2-master python ../rst2epub2-master/rst2epub.py verano12.rst verano12.epub --traceback --stylesheet verano12.css

Este proyecto fue realizado durante el PyCamp 2017, organizado por PyAr.

Autores

Mario Chacon <[email protected]>
Laureano Silva <[email protected]>
Diego <[email protected]>

Nota

En rst2epub2 hay un bug al tener unicode en el índice (que se realiza con el título de los cuentos). Si se obtiene el error:

UnicodeEncodeError: 'ascii' codec can't encode character ...

The specified output encoding (utf-8) cannot
handle all of the output.
Try setting "--output-encoding-error-handler" to

* "xmlcharrefreplace" (for HTML & XML output);
  the output will contain "&#237;" and should be usable.
* "backslashreplace" (for other output formats);
  look for "\xed" in the output.
* "replace"; look for "?" in the output.

"--output-encoding-error-handler" is currently set to "xmlcharrefreplace".

Se puede solucionar, editando rst2epub2-master/epublib/epub.py y reemplazando la función _write_toc_ncx, con:

def _write_toc_ncx(self):
    self.toc_map_root.assign_play_order()
    fout = open(os.path.join(self.root_dir, 'OEBPS', 'toc.ncx'), 'wb')
    tmpl = self.loader.load('toc.ncx')
    stream = tmpl.generate(book=self)
    fout.write(stream.render('xml').encode('utf-8'))
    fout.close()

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
imgs		imgs
verano12		verano12
.flake8		.flake8
.gitignore		.gitignore
README.rst		README.rst
TODO		TODO
items_to_rst.py		items_to_rst.py
requirements.txt		requirements.txt
scrapy.cfg		scrapy.cfg
verano12.css		verano12.css
verano12.tpl		verano12.tpl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Descripción

Autores

Nota

About

Releases

Packages

Languages

dmascialino/cuentos_verano12

Folders and files

Latest commit

History

Repository files navigation

Descripción

Autores

Nota

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages