PDF Table Extractor - repository to hold revisable version of code from https://www.cvast.tuwien.ac.at/projects/pdf2table by Burcu Yildiz
-
Updated
Mar 15, 2024 - Java
PDF Table Extractor - repository to hold revisable version of code from https://www.cvast.tuwien.ac.at/projects/pdf2table by Burcu Yildiz
PDF Analysis: Extracting words and their word frequencies from PDF files; Preparation of text data for performing topic analysis on annual reports of German car manufacturers - e.g. Volkswagen, Porsche and Audi. Please note that words are only being extracted, stemming is not being applied. In order to improve this, use nltk.stem.snowball.Snowba…
A secure, AI-enhanced file scanning tool built on Flask, strengthened with ClamAV and PDF analysis, designed to vigilantly detect digital threats and potential vulnerabilities.
A RAG project. Chat PDF
An extremely fast and user-friendly PDF page counter app for multiple PDF files.
This project focuses on automating the analysis and reporting of bibliometric data, specifically targeting the annual production of academic articles. The primary goal is to understand trends, anomalies, and patterns in bibliometric data through a combination of statistical modeling and exploratory data analysis.
ArchLinux packaged version of the kali-linux pdf analysis tool pdfid. Original author is DidierStevensSuite! His license applies!
This project uses Google's Generative AI to analyze and answer questions about PDF content. It provides a user-friendly interface to upload PDFs and receive insightful answers generated by the Gemini AI model.
PDF Query LangChain is a tool that extracts and queries information from PDF documents using advanced language processing. Leveraging LangChain, OpenAI, and Cassandra, this app enables efficient, interactive querying of PDF content. Ideal for data analysis, research, and automated reporting, it simplifies detailed document analysis with ease.
Add a description, image, and links to the pdf-analysis topic page so that developers can more easily learn about it.
To associate your repository with the pdf-analysis topic, visit your repo's landing page and select "manage topics."