Find Jobs
Hire Freelancers

Converting Pdf Into Word

₹12500-37500 INR

Zavřený
Zveřejněno přibližně před 3 roky

₹12500-37500 INR

Zaplaceno při doručení
As we see, the pages of the PDF were converted to images. Then the images were read, and the content was written into a text file. Advantages of this method include: Avoiding text-based conversion because of encoding scheme resulting in loss of data. Even handwritten content in PDF can be recognized due to the usage of OCR. Recognizing only particular pages of the PDF is also possible. Getting the text as a variable so that any amount of required pre-processing can be done. Disadvantages of this method include: Disk storage is used to store the images in the local system. Although these images are tiny in size. Using OCR cannot guarantee 100% accuracy. Given a computer typed PDF document results in very high accuracy. Handwritten PDFs are still recognized, but the accuracy depends on various factors like handwriting, page color, etc. Part #1 deals with converting the PDF into image files. Each page of the PDF is stored as an image file. The names of the images stored are: PDF page 1 -> [login to view URL] PDF page 2 -> [login to view URL] PDF page 3 -> [login to view URL] …. PDF page n -> [login to view URL] Part #2 deals with recognizing text from the image files and storing it into a text file. Here, we process the images and convert it into text. Once we have the text as a string variable, we can do any processing on the text. For example, in many PDFs, when a line is completed, but a particular word cannot be written entirely in the same line, a hyphen (‘-‘) is added, and the word is continued on the next line. For example – This is some sample text but this parti- cular word could not be written in the same line. Now for such words, a fundamental pre-processing is done to convert the hyphen and the new line into a full word. After all the pre-processing is done, this text is stored in a separate text file. To get the input PDF files used in the code, click [login to view URL] Below is the implementation: filter_none brightness_4 # Import libraries from PIL import Image import pytesseract import sys from pdf2image import convert_from_path import os # Path of the pdf PDF_file = "[login to view URL]" ''' Part #1 : Converting PDF to images ''' # Store all the pages of the PDF in a variable pages = convert_from_path(PDF_file, 500) # Counter to store images of each page of PDF to image image_counter = 1 # Iterate through all the pages stored above for page in pages: # Declaring filename for each page of PDF as JPG # For each page, filename will be: # PDF page 1 -> [login to view URL] # PDF page 2 -> [login to view URL] # PDF page 3 -> [login to view URL] # .... # PDF page n -> [login to view URL] filename = "page_"+str(image_counter)+".jpg" # Save the image of the page in system [login to view URL](filename, 'JPEG') # Increment the counter to update filename image_counter = image_counter + 1 ''' Part #2 - Recognizing text from the images using OCR ''' 3 # Variable to get count of total number of pages filelimit = image_counter-1 # Creating a text file to write the output outfile = "[login to view URL]" # Open the file in append mode so that # All contents of all images are added to the same file f = open(outfile, "a") # Iterate from 1 to total number of pages for i in range(1, filelimit + 1): # Set filename to recognize text from # Again, these files will be: # [login to view URL] # [login to view URL] # .... # [login to view URL] filename = "page_"+str(i)+".jpg" # Recognize the text as string in image using pytesserct text = str(((pytesseract.image_to_string([login to view URL](filename))))) # The recognized text is stored in variable text # Any string processing may be applied on text # Here, basic formatting has been done: # In many PDFs, at line ending, if a word can't # be written fully, a 'hyphen' is added. # The rest of the word is written in the next line # Eg: This is a sample text this word here GeeksF- # orGeeks is half on .
IČ projektu: 29343282

O projektu

33 nabídky
Vzdálený projekt
Aktivní před 3 roky

Chcete si vydělat nějaké peníze?

Výhody podávání nabídek na Freelancer

Stanovte si rozpočet a časový rámec
Získejte za svou práci zaplaceno
Načrtněte svůj návrh
Registrace a podávání nabídek je zdarma
33 freelanceři nabízejí v průměru ₹19 330 INR za tuto práci
Avatar uživatele
PDF--WORD EXPERT -------I AM AVAILABLE RIGHT NOW-----100% ACCURACY I can do this work checked your whole description and attachments . Please knock me then I can do this Thanks
₹29 000 INR v 20 dnech
5,0 (192 recenze)
6,0
6,0
Avatar uživatele
Good Day! I am Ilxam and I have read your requirements and already ready to start working. Just contact me and I will finish your project in a short time for cheap price
₹12 500 INR v 1 dni
5,0 (1 recenze)
0,0
0,0
Avatar uživatele
Dear sir , I've just gone through your project, so I have expertise in such assignment as having a great experience for working on well reputed organization on corporate level. Highly goal given professional with spark to keep committed. My first and final bid to make my commitment on time and to provide the Assigned work well before the time given. You will find me as a well organized and punctual employee in term of completion of task. so, let's have a discussion about the project.
₹22 222 INR v 4 dnech
0,0 (0 recenze)
0,0
0,0
Avatar uživatele
You will get the results as you want in just a couple of days!!! I'm a student of graduation so I have enough knowledge that I can do this work efficiently... this is just what I want to say
₹12 778 INR v 2 dnech
0,0 (0 recenze)
0,0
0,0
Avatar uživatele
I am the best candidate. If you want I will complete before the deadline. Suggest me if you want any changes.
₹25 000 INR v 7 dnech
0,0 (0 recenze)
0,0
0,0
Avatar uživatele
I have very good experience in windows office, all kind of file conversion (Pdf to MS word, excel, PowerPoint, images etc.),cad and 3D modeling and i am having more than 18 experiences in this field, and i have all licensed copy's software's which are required for this kind of job. End to end safe data transfer is guaranteed. Time required to complete the project is solely depends upon the quantity of job. Best Regards Deepakkumar Jatti
₹25 000 INR v 7 dnech
0,0 (0 recenze)
0,0
0,0
Avatar uživatele
Hello Dear I have read your project and I have understood your project. I have the experience to do this type of project earlier. I have worked many platforms before. So, I have the confidence to do this. If you want we can work together. I have the experience to work in Data Entry Excel. I did this kind of work before. So please let me know if you want to work with me.
₹25 000 INR v 7 dnech
0,0 (0 recenze)
0,0
0,0
Avatar uživatele
Hi, I will summarize all the pdfs and provide you with clear, concise notes. I have worked on similar projects compiling presentation slides and lecture notes into a single set of notes, and can deliver quality notes to tight deadlines. I can generally compile up to 10 pages of pdfs per day. I'm interested to hear more about the project and about the subject matter of the lectures. If you award me the project I'd be very happy to discuss this further and get started for you as soon as possible. Thanks! Preetha
₹13 000 INR v 3 dnech
0,0 (0 recenze)
0,0
0,0
Avatar uživatele
I convert pdf to word in a professional way I am very advanced in this work, so please hire me for this
₹12 778 INR v 1 dni
0,0 (0 recenze)
0,0
0,0
Avatar uživatele
delivery date might be sooner..loyal hardworking and a man looking to sustain himself i can do the job perfectly
₹12 500 INR v 3 dnech
0,0 (0 recenze)
0,0
0,0
Avatar uživatele
I Subash, In freelancer. I have 5 years experience in this field. Services include PDF Conversion, Typesetting and Cover design.
₹16 667 INR v 2 dnech
0,0 (0 recenze)
0,0
0,0
Avatar uživatele
I am certified Data Entry Worker. I have published two books. I can't make any mistakes. I have done many jobs... I have two year experience in it.
₹15 000 INR v 7 dnech
0,0 (0 recenze)
0,0
0,0
Avatar uživatele
Hi, I'm interested in you job ad and would like to work in gathering data and data entry. Short introduction about myself: my name is Celso Renz Dema-ala. a 26 year old Architect from the Philippines. I had around 1 year of experience in data entry and transcription with a typing speed of 65wpm. I also know my way around the internet so I think it helps with researching. Hoping for you to consider me. Thank you very much, Celso Renz Dema-ala
₹25 000 INR v 7 dnech
0,0 (0 recenze)
0,0
0,0
Avatar uživatele
Hi, I have read your requirements and I am very interested in this project. I have required skills and experience to do this for you at reasonable price. Please feel free to contact me. Regards, Sanket Dahane
₹25 000 INR v 7 dnech
0,0 (0 recenze)
0,0
0,0
Avatar uživatele
hello, my name is Awais khan I am a dedicated and hard working person who believes in honesty and good working relation. though i am new it this sector of job but i have certain qualities which makes me good at this.
₹12 500 INR v 2 dnech
0,0 (0 recenze)
0,0
0,0
Avatar uživatele
As this pandemic brings us to home so I have learnt to manage all things online which also include PDF and word file conversions so I feel am good at this . I will do the quality job and will complete the task on time.
₹25 000 INR v 7 dnech
0,0 (0 recenze)
0,0
0,0
Avatar uživatele
I have scanned many books and converted them to pdfs and separated them in chapters. If you want the perfect completion of the project. Contact me.
₹13 000 INR v 7 dnech
0,0 (0 recenze)
0,0
0,0
Avatar uživatele
Will ensure good quality work and that too within the mentioned timeline. I am sure you will be happy to work with me.
₹25 000 INR v 7 dnech
0,0 (0 recenze)
0,0
0,0
Avatar uživatele
I'm here to help you. I am well versed with the conversion content in pdf to text format in word. doing this conversion for past 2 to 3 years. I will provide you the desired results. awaiting for your positive response.
₹13 889 INR v 4 dnech
0,0 (0 recenze)
0,0
0,0
Avatar uživatele
Hey, I am Student of Computer Engineering. I want this project cause I am good at typing and can complete the task within the given Deadline.
₹13 611 INR v 7 dnech
0,0 (0 recenze)
0,0
0,0

O klientovi

Pochází z INDIA
Navsari, India
0,0
0
Členem od úno 20, 2021

Ověření klienta

Díky! Poslali jsme vám e-mailem odkaz pro získání kreditu zdarma.
Při odesílání e-mailu se něco pokazilo. Zkuste to prosím znovu.
Registrovaných uživatelů Zveřejněných projektů
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Načítání náhledu
Bylo uděleno povolení ke geolokaci.
Vaše doba přihlášení vypršela a byli jste odhlášeni. Přihlaste se znovu.