python (12.9k questions)
javascript (9.2k questions)
reactjs (4.7k questions)
java (4.2k questions)
java (4.2k questions)
c# (3.5k questions)
c# (3.5k questions)
html (3.3k questions)
Tab separated data is confused to tables when parsing pdf to text
I am using pdfMiner to convert pdf to txt. When there are tabs, the data is read column wise instead of row wise. For example, the below snippet in a PDF:
titel1 : text1
title2: text2
titl...
A_Matar
Votes: 0
Answers: 1
Extracting email address, first name and last name from multiple PDF files within a folder
I am trying to extract the following information from all PDF files within a folder, the PDF files are CV's: Email Address, First Name, Last Name for a work project.
I have successfully managed to ext...
Berci Vagyok
Votes: 0
Answers: 1