The DOCX-HTML Converter is a desktop application built with Tkinter that simplifies the process of converting formatted text from specific website blogs stored in DOCX files to HTML format. IMAGE CONVERSION IS NOT SUPPORTED.
- Drag-and-Drop Interface: Easily convert DOCX files by dragging and dropping them onto the application window.
- File Dialogs: You can open the file selector by double-clicking on the left text box and select files to convert, or you can change the save directory by double-clicking on the right text box.
- Conversion Info: After the files are converted, you will see a message box with a list of all successfully converted or unconverted files.
- Task Removal: The application identifies and removes tasks stored before and inside the first
</ul>
tag in the HTML content. You can change it by pressingCTRL Z
. It is enabled by default. - Strong Tag Removal: The application identifies and removes
<strong>
tags without removing HTML content inside this tag. You can change it by pressingCTRL X
. It is enabled by default.
-
Install Python. (tested only with Python 3.11)
-
Install Pandoc:
- For Linux:
sudo apt-get install pandoc
- For Windows: download and install latest
*.msi
file
- For Linux:
-
Then execute command in the project folder:
pip install -r requirements.txt
-
Run program:
python3 main.py
OR
python main.py
pyinstaller -F -w --add-data "venv/lib/python3.<SPECIFIC_VERSION>/site-packages/tkinterdnd2/tkdnd/linux64/libtkdnd2.9.2.so:tkinterdnd2/tkdnd/linux64/" --name "DOCX-HTML Converter" main.py --additional-hooks-dir=.
pyinstaller -F -w --name "DOCX-HTML Converter" main.py --additional-hooks-dir=.