Log of #mupdf at irc.freenode.net.

 <<<Back 1 day (to 2021/04/07)Fwd 1 day (to 2021/04/09)>>>20210408 
artifexirc-bot <Ihor> They use own technologies, and we don't have same possibility to handle links?07:34.17 
ator All I know is that the PDF file does not have any links on that page. There is some text that looks like a partial URL but isn't. What about page 1, are there supposed to be links there too?09:12.39 
malc_ ator: there are 2 links on page 110:05.17 
  https://www.clinicalpocketreference.com and https://www.clinicalpocketreference.com10:06.24 
  whoops sorry and:10:06.35 
  that's according 2 llpp's interrogation of mupdf10:07.12 
ator malc_: sorry, I meant the page labeled "1" (i.e. page 3)10:08.27 
artifexirc-bot <ator> the first page has those two links for me as well10:09.10 
malc_ ator: nothing on page 310:09.33 
artifexirc-bot <ator> but the third page has text which says "www.clinicalpocketreference.com" which is not a link annotation10:09.41 
malc_ ator: aye10:10.51 
artifexirc-bot <KenSharp> We've seen before (as you noted) that some PDF consumers, and Acrobat is one, will turn somethign which looks like a URL into a hyperlink, even though there is no hyperlink in the PDF file. As you suggested I would imagine that's the problem10:45.03 
malc_ KenSharp, ator: did you catch what viewer the reporter was using?10:59.25 
artifexirc-bot <KenSharp> I think they said 'Apple' so presumably the Quartz built-in viewer11:01.33 
malc_ Preview then... i can test that11:03.19 
  KenSharp: yes, thehttp://www.clinicalpocketreference.com is clickable in Preview, (and that's not all that Preview does with this document)11:11.45 
  ator: i think what preview does is something "Previewy" (not "PDFy" IOW)12:20.10 
artifexirc-bot <KenSharp> malc_ Sorry was chatting elsewhere. Yes we know Acrobat does this (we've had people raise bug reports).12:50.39 
  <KenSharp> Up to ator wehter he thinks its worth adding, I'm inclined to feel not myself12:51.01 
malc_ Kensharp: aha, just want to reiterate - i wasn't using Acrobat12:53.02 
ator I'd prefer not to.12:53.03 
  it's something a client can do without too much trouble using the text extraction if they feel a pressing need.12:53.38 
artifexirc-bot <KenSharp> That's hw I'd feel, yes13:08.08 
velix MuScripting is damn great!18:20.49 
  I just wrote this: "usage: mutool run pdf-split.js a.pdf@2 b.pdf@8 input.pdf"18:21.14 
  2 pages to "a.pdf", then 8 pages to "b.pdf" then again 2 pages to "a" etc.18:21.39 
  I don't know other tools, who are that flexible.18:21.50 
locustlord hey how do I use mutool to extract everything from a pdf?18:49.21 
  nvm got it18:50.36 
  ok sorry I did not18:50.52 
  it says that they're not valid font files18:50.57 
  when I try to install them18:51.03 
  is there a way to turn them into valid font files?18:54.25 
  I understand that they only contain a selection of the glyphs but I only need that selection of glyphs for my purposes anyway18:54.57 
artifexirc-bot <KenSharp> They may simply be in a format your OS does not support. Such as CFF or type 318:55.20 
locustlord nope these are ttf18:57.19 
  I'm on windows 1018:57.32 
artifexirc-bot <KenSharp> Without seeing the fonts I couldn't possibly comment 🙂18:57.48 
locustlord alright give my a second18:57.59 
  I'll upload them18:58.04 
artifexirc-bot <KenSharp> No, please don't, I'm off now its late for me.18:58.14 
  <KenSharp> It may also be that the fonts are CIDFonts with TrueType outlines, your OS won't like those either18:58.30 
locustlord https://a.tmp.ninja/qTPhQUPw.zip18:59.15 
artifexirc-bot <KenSharp> If you open the file in Acrobat and look at the fotns tab it should say if they are CIDFonts18:59.21 
locustlord ah18:59.21 
artifexirc-bot <KenSharp> Well one of those has a cff extension18:59.41 
locustlord so that means they're CIDFonts?19:00.06 
artifexirc-bot <KenSharp> The .cff is, I would guess, a CFF format font19:00.18 
  <KenSharp> (Compact Font Format) not a TTF19:00.26 
  <KenSharp> THer other ones I don't know offhand19:00.34 
  <KenSharp> Easiest way to check if you ahev Acrobat is to look at the fonts tab in the info dialog19:00.50 
locustlord can I do it on the free acrobat reader?19:01.26 
artifexirc-bot <KenSharp> Not sure, I'l look19:01.59 
locustlord maybe you'd know a better way of doing what I'm trying to do19:03.02 
  I want to change 2 text fields in an existing pdf19:03.11 
artifexirc-bot <KenSharp> I really am off but someone else may be abl;;e to answer19:03.22 
locustlord alright have a good evening19:03.33 
artifexirc-bot <KenSharp> Oh well if they are using an embedded subset font, then that's simple, you can't19:03.37 
locustlord what if the glyphs I intent to use are used elsewhere in the same font in the document/19:04.01 
artifexirc-bot <KenSharp> Then you can, but.....19:04.10 
  <KenSharp> You need to be sure they are in the same subset font.19:04.17 
locustlord they definitely are19:04.23 
artifexirc-bot <KenSharp> You then need to identify the character codes used for the glyphs you want19:04.28 
  <KenSharp> Then replace the characetr codes you want replaced by the character code for the glyphs you want instead.19:04.46 
locustlord how do I identify those codes?19:04.57 
artifexirc-bot <KenSharp> For example, 'Hello World' in a subset fotn will consist of character codes 1, 2, 3, 3, 4, 5, 6, 4, 7, 3, 819:05.35 
  <KenSharp> In general the first time a glyph is used it is assigned the character code of 1, the next glyph is given 2 and so on19:06.04 
  <KenSharp> SO you need to decode teh PDF file, find the string which uses the text containing the glyphs you want19:06.25 
locustlord how do I decode it beyond the gibberish I get when I try to open it in notepad?19:06.41 
artifexirc-bot <KenSharp> Then find the text with the glyphs you want replaced. Then you simply change the character codes in the string.19:06.44 
  <KenSharp> PDF files are usually compressed, so you need to decompress it mutool clean -d or something like that will decompress the file.19:07.09 
  <KenSharp> I foget the exact incantation19:07.21 
  <KenSharp> Note that if the new glyphs are not the same width as the old glyphs then they may collide if the characters are drawn at specific points on the page19:07.55 
  <KenSharp> If you replaced a 'i' with a 'M' for example19:08.11 
  <KenSharp> Start by decompressing the file and seeing if you can figure out what it does 🙂19:08.42 
locustlord initially I just did it in gimp and copied the stuff I wanted over the stuff I didn't, and then exported the pages as a pdf19:09.16 
  sorry for leaving my internet DC'd on me19:11.09 
  I don't know if you're still here KenSharp but if you are thank you for your help19:11.29 
  hopefully my internet doesn't crap out on me again19:13.20 
  going to restart my PC brb19:15.58 
  I'm back19:18.22 
  holy crap19:20.24 
  KenSharp u are a god19:20.30 
  I can actually figure this out19:20.37 
  I changed a 1 to a zero19:27.35 
  everything disappeared19:27.40 
  changed the zero back to 119:27.45 
  everything reappeared with the fonts screwed up19:27.55 
  some serious rocket surgery here19:28.39 
  this is super difficult to read19:31.07 
patanga why does one have to do press the right mouse button to mark text? what library is that?20:45.48 
 <<<Back 1 day (to 2021/04/07)Forward 1 day (to 2021/04/09)>>> 
ghostscript.com #ghostscript