| <<<Back 1 day (to 2021/04/07) | Fwd 1 day (to 2021/04/09)>>> | 20210408 |
artifexirc-bot | <Ihor> They use own technologies, and we don't have same possibility to handle links? | 07:34.17 |
ator | All I know is that the PDF file does not have any links on that page. There is some text that looks like a partial URL but isn't. What about page 1, are there supposed to be links there too? | 09:12.39 |
malc_ | ator: there are 2 links on page 1 | 10:05.17 |
| https://www.clinicalpocketreference.com and https://www.clinicalpocketreference.com | 10:06.24 |
| whoops sorry and: | 10:06.35 |
| https://www.drakemedox.co.uk/ | 10:06.44 |
| that's according 2 llpp's interrogation of mupdf | 10:07.12 |
ator | malc_: sorry, I meant the page labeled "1" (i.e. page 3) | 10:08.27 |
artifexirc-bot | <ator> the first page has those two links for me as well | 10:09.10 |
malc_ | ator: nothing on page 3 | 10:09.33 |
artifexirc-bot | <ator> but the third page has text which says "www.clinicalpocketreference.com" which is not a link annotation | 10:09.41 |
malc_ | ator: aye | 10:10.51 |
artifexirc-bot | <KenSharp> We've seen before (as you noted) that some PDF consumers, and Acrobat is one, will turn somethign which looks like a URL into a hyperlink, even though there is no hyperlink in the PDF file. As you suggested I would imagine that's the problem | 10:45.03 |
malc_ | KenSharp, ator: did you catch what viewer the reporter was using? | 10:59.25 |
artifexirc-bot | <KenSharp> I think they said 'Apple' so presumably the Quartz built-in viewer | 11:01.33 |
malc_ | Preview then... i can test that | 11:03.19 |
| KenSharp: yes, thehttp://www.clinicalpocketreference.com is clickable in Preview, (and that's not all that Preview does with this document) | 11:11.45 |
| ator: i think what preview does is something "Previewy" (not "PDFy" IOW) | 12:20.10 |
artifexirc-bot | <KenSharp> malc_ Sorry was chatting elsewhere. Yes we know Acrobat does this (we've had people raise bug reports). | 12:50.39 |
| <KenSharp> Up to ator wehter he thinks its worth adding, I'm inclined to feel not myself | 12:51.01 |
malc_ | Kensharp: aha, just want to reiterate - i wasn't using Acrobat | 12:53.02 |
ator | I'd prefer not to. | 12:53.03 |
| it's something a client can do without too much trouble using the text extraction if they feel a pressing need. | 12:53.38 |
artifexirc-bot | <KenSharp> That's hw I'd feel, yes | 13:08.08 |
velix | MuScripting is damn great! | 18:20.49 |
| I just wrote this: "usage: mutool run pdf-split.js a.pdf@2 b.pdf@8 input.pdf" | 18:21.14 |
| 2 pages to "a.pdf", then 8 pages to "b.pdf" then again 2 pages to "a" etc. | 18:21.39 |
| I don't know other tools, who are that flexible. | 18:21.50 |
| which* | 18:21.54 |
locustlord | hey how do I use mutool to extract everything from a pdf? | 18:49.21 |
| wait | 18:49.32 |
| nvm got it | 18:50.36 |
| ok sorry I did not | 18:50.52 |
| it says that they're not valid font files | 18:50.57 |
| when I try to install them | 18:51.03 |
| is there a way to turn them into valid font files? | 18:54.25 |
| I understand that they only contain a selection of the glyphs but I only need that selection of glyphs for my purposes anyway | 18:54.57 |
artifexirc-bot | <KenSharp> They may simply be in a format your OS does not support. Such as CFF or type 3 | 18:55.20 |
locustlord | nope these are ttf | 18:57.19 |
| I'm on windows 10 | 18:57.32 |
artifexirc-bot | <KenSharp> Without seeing the fonts I couldn't possibly comment 🙂 | 18:57.48 |
locustlord | alright give my a second | 18:57.59 |
| I'll upload them | 18:58.04 |
artifexirc-bot | <KenSharp> No, please don't, I'm off now its late for me. | 18:58.14 |
| <KenSharp> It may also be that the fonts are CIDFonts with TrueType outlines, your OS won't like those either | 18:58.30 |
locustlord | https://a.tmp.ninja/qTPhQUPw.zip | 18:59.15 |
artifexirc-bot | <KenSharp> If you open the file in Acrobat and look at the fotns tab it should say if they are CIDFonts | 18:59.21 |
locustlord | ah | 18:59.21 |
artifexirc-bot | <KenSharp> Well one of those has a cff extension | 18:59.41 |
locustlord | so that means they're CIDFonts? | 19:00.06 |
artifexirc-bot | <KenSharp> The .cff is, I would guess, a CFF format font | 19:00.18 |
| <KenSharp> (Compact Font Format) not a TTF | 19:00.26 |
| <KenSharp> THer other ones I don't know offhand | 19:00.34 |
| <KenSharp> Easiest way to check if you ahev Acrobat is to look at the fonts tab in the info dialog | 19:00.50 |
locustlord | can I do it on the free acrobat reader? | 19:01.26 |
artifexirc-bot | <KenSharp> Not sure, I'l look | 19:01.59 |
locustlord | maybe you'd know a better way of doing what I'm trying to do | 19:03.02 |
| I want to change 2 text fields in an existing pdf | 19:03.11 |
artifexirc-bot | <KenSharp> I really am off but someone else may be abl;;e to answer | 19:03.22 |
locustlord | alright have a good evening | 19:03.33 |
artifexirc-bot | <KenSharp> Oh well if they are using an embedded subset font, then that's simple, you can't | 19:03.37 |
locustlord | what if the glyphs I intent to use are used elsewhere in the same font in the document/ | 19:04.01 |
| ? | 19:04.01 |
| intend* | 19:04.05 |
artifexirc-bot | <KenSharp> Then you can, but..... | 19:04.10 |
| <KenSharp> You need to be sure they are in the same subset font. | 19:04.17 |
locustlord | they definitely are | 19:04.23 |
artifexirc-bot | <KenSharp> You then need to identify the character codes used for the glyphs you want | 19:04.28 |
| <KenSharp> Then replace the characetr codes you want replaced by the character code for the glyphs you want instead. | 19:04.46 |
locustlord | how do I identify those codes? | 19:04.57 |
artifexirc-bot | <KenSharp> For example, 'Hello World' in a subset fotn will consist of character codes 1, 2, 3, 3, 4, 5, 6, 4, 7, 3, 8 | 19:05.35 |
| <KenSharp> In general the first time a glyph is used it is assigned the character code of 1, the next glyph is given 2 and so on | 19:06.04 |
| <KenSharp> SO you need to decode teh PDF file, find the string which uses the text containing the glyphs you want | 19:06.25 |
locustlord | how do I decode it beyond the gibberish I get when I try to open it in notepad? | 19:06.41 |
artifexirc-bot | <KenSharp> Then find the text with the glyphs you want replaced. Then you simply change the character codes in the string. | 19:06.44 |
| <KenSharp> PDF files are usually compressed, so you need to decompress it mutool clean -d or something like that will decompress the file. | 19:07.09 |
| <KenSharp> I foget the exact incantation | 19:07.21 |
| <KenSharp> Note that if the new glyphs are not the same width as the old glyphs then they may collide if the characters are drawn at specific points on the page | 19:07.55 |
| <KenSharp> If you replaced a 'i' with a 'M' for example | 19:08.11 |
| <KenSharp> Start by decompressing the file and seeing if you can figure out what it does 🙂 | 19:08.42 |
locustlord | initially I just did it in gimp and copied the stuff I wanted over the stuff I didn't, and then exported the pages as a pdf | 19:09.16 |
| sorry for leaving my internet DC'd on me | 19:11.09 |
| I don't know if you're still here KenSharp but if you are thank you for your help | 19:11.29 |
| hopefully my internet doesn't crap out on me again | 19:13.20 |
| going to restart my PC brb | 19:15.58 |
| I'm back | 19:18.22 |
| holy crap | 19:20.24 |
| KenSharp u are a god | 19:20.30 |
| I can actually figure this out | 19:20.37 |
| AAAAAAHH | 19:27.23 |
| wth | 19:27.26 |
| I changed a 1 to a zero | 19:27.35 |
| saved | 19:27.38 |
| everything disappeared | 19:27.40 |
| changed the zero back to 1 | 19:27.45 |
| everything reappeared with the fonts screwed up | 19:27.55 |
| some serious rocket surgery here | 19:28.39 |
| this is super difficult to read | 19:31.07 |
patanga | why does one have to do press the right mouse button to mark text? what library is that? | 20:45.48 |
| <<<Back 1 day (to 2021/04/07) | Forward 1 day (to 2021/04/09)>>> | |