• JackbyDev@programming.dev
    link
    fedilink
    English
    arrow-up
    13
    ·
    10 hours ago

    I’m still mad there’s no straightforward way to convert a PDF into semantic HTML. There’s plenty of tools to convert it into HTML that looks the same with pages and such, but I just want the content.

    • AnimalsDream@slrpnk.net
      link
      fedilink
      English
      arrow-up
      5
      ·
      9 hours ago

      Would it work to convert it to a simpler intermediate format like rtf or txt, and then convert into html? Why html anyway, Isn’t epub more appropriate?

        • AnimalsDream@slrpnk.net
          link
          fedilink
          English
          arrow-up
          2
          ·
          4 hours ago

          Yeah I get that. I’ve just gotten used to leaving pdfs the way they are, and choosing to read them on more appropriate devices like laptops or tablets.