| <<<Back 1 day (to 2017/12/27) | 20171228 |
Piedro | sebras: Would be nice. | 09:43.30 |
canopus | hi | 12:26.32 |
mubot | Welcome to #mupdf, the channel for MuPDF. If you have a question, please ask it, don't ask to ask it. Do be prepared to wait for a reply as devs will check the logs and reply when they come on line. | 12:26.32 |
canopus | i want to perform text selection functionlity by dragging finger in mupdf | 12:28.41 |
kens | canopus, go right ahead and code that. | 13:08.29 |
sanjay | hi | 13:48.30 |
mubot | Welcome to #mupdf, the channel for MuPDF. If you have a question, please ask it, don't ask to ask it. Do be prepared to wait for a reply as devs will check the logs and reply when they come on line. | 13:48.30 |
sanjay | how to get the title and text of any page inside mupdf library | 13:50.53 |
sebras | sanjay: to extract text you need to look into the structured text device. | 13:57.22 |
| sanjay: I assume you are working in C? or are you perhaps developing using Java? | 13:57.34 |
| you can easily see what kind of information you can get if you run: mutool draw -F stext doc.pdf 1 | 13:58.53 |
| to get the info for page 1 of doc.pdf | 13:59.00 |
| or mutool draw -F text doc.pdf 1 | 13:59.10 |
| if you just want to see the text. | 13:59.17 |
sanjay | i am working in java and my requirement is that, if we open any the then we get the title of that page. | 14:00.11 |
sebras | sanjay: ah, so you are using the java interface I see. | 14:00.38 |
sanjay | yes | 14:00.47 |
sebras | sanjay: I assume this is for some kind of book reader or something like that? | 14:00.59 |
sanjay | i want to perfom some operation in pdf | 14:01.45 |
sebras | sanjay: yes, I know. but since I'm trying to help you I'm just curious what I'm helping you with. will the app you develop go on the google play store? | 14:02.33 |
sanjay | like highlighting, share, copy in selected text of pdf file. | 14:02.53 |
sebras | sanjay: in PDF files each page has text, but there is no real title for each page. there is a title of the entire document. | 14:03.12 |
| sanjay: so you are doing develping this app for your employer? | 14:03.52 |
sanjay | yes | 14:04.15 |
sebras | sanjay: will the app be open source? | 14:04.35 |
| sanjay: similar to how mupdf is open sourced under the AGPLv3 license..? | 14:05.02 |
sanjay | but in pdf file some books are there and inside the text are there , if any text are open then my requirement to know the book name of that chapter. | 14:05.26 |
sebras | sanjay: mupdf can give you the characters on each page, but it is difficult to know what text on a page is a title or a chapter heading. | 14:07.44 |
| sanjay: is the app you are developing commercial..? | 14:07.55 |
| sanjay: will it be sold to customer? | 14:08.03 |
sanjay | no | 14:08.14 |
sebras | sanjay: ok, so the source code will be available on e.g. github? | 14:08.49 |
sanjay | yes | 14:09.27 |
| so, can you tell me is there any way to perform some customization of pdf file . | 14:10.50 |
sebras | sanjay: alright, since mupdf is AGPLv3 is it important that you license your app source code under a suitable license, also note that you may not be able to use some libraries like Google AdMob. you need to think about these things before you start coding. or you need to remind your manager that they need to think about this. | 14:11.32 |
| sanjay: we explain the details in chapter 1, 1.1 and 1.2 here: https://mupdf.com/docs/android-sdk.html | 14:12.13 |
sanjay | ok | 14:12.58 |
sebras | sanjay: I was just about to suggest to you that you might be able to extract chapter headings from the document outline, but then again extracting headings is a hard problem. | 14:15.38 |
| Forward 1 day (to 2017/12/29)>>> | |