How to copy text from PDF and paste it to other application preserving text format?

The name of the pictureThe name of the pictureThe name of the pictureClash Royale CLAN TAG#URR8PPP








up vote
0
down vote

favorite












I have PDF file.

I need to copy some text from it preserving its formatting (so I need to save italic, bold, _underline_, font sizes and colors, links).



It seems that I need to copy text from PDF as HTML or RTF (or something similar).



I tried:



  • copy text from Atril, paste to LibreOffice Writer - got plain text even if I select paste as RTF;

  • copy text from Atril, paste to Abiword - got plain text;

  • copy text from Okular, paste to LibreOffice or Abiword - got plain text;

  • converting PDF to HTML with pdf2htmlex - got HTML, but copying from web-browser makes all text plain again;

  • importing PDF to LibreOffice Impress - formatting is preserved, but copying is difficult because of large amount of text blocks.

Which software from Ubuntu 16.04 LTS repository should I use to solve my problem?










share|improve this question























  • This is not a native solution, but opening the PDF in Google Doc may give you better results than LibreOffice.
    – user68186
    Apr 10 at 21:12














up vote
0
down vote

favorite












I have PDF file.

I need to copy some text from it preserving its formatting (so I need to save italic, bold, _underline_, font sizes and colors, links).



It seems that I need to copy text from PDF as HTML or RTF (or something similar).



I tried:



  • copy text from Atril, paste to LibreOffice Writer - got plain text even if I select paste as RTF;

  • copy text from Atril, paste to Abiword - got plain text;

  • copy text from Okular, paste to LibreOffice or Abiword - got plain text;

  • converting PDF to HTML with pdf2htmlex - got HTML, but copying from web-browser makes all text plain again;

  • importing PDF to LibreOffice Impress - formatting is preserved, but copying is difficult because of large amount of text blocks.

Which software from Ubuntu 16.04 LTS repository should I use to solve my problem?










share|improve this question























  • This is not a native solution, but opening the PDF in Google Doc may give you better results than LibreOffice.
    – user68186
    Apr 10 at 21:12












up vote
0
down vote

favorite









up vote
0
down vote

favorite











I have PDF file.

I need to copy some text from it preserving its formatting (so I need to save italic, bold, _underline_, font sizes and colors, links).



It seems that I need to copy text from PDF as HTML or RTF (or something similar).



I tried:



  • copy text from Atril, paste to LibreOffice Writer - got plain text even if I select paste as RTF;

  • copy text from Atril, paste to Abiword - got plain text;

  • copy text from Okular, paste to LibreOffice or Abiword - got plain text;

  • converting PDF to HTML with pdf2htmlex - got HTML, but copying from web-browser makes all text plain again;

  • importing PDF to LibreOffice Impress - formatting is preserved, but copying is difficult because of large amount of text blocks.

Which software from Ubuntu 16.04 LTS repository should I use to solve my problem?










share|improve this question















I have PDF file.

I need to copy some text from it preserving its formatting (so I need to save italic, bold, _underline_, font sizes and colors, links).



It seems that I need to copy text from PDF as HTML or RTF (or something similar).



I tried:



  • copy text from Atril, paste to LibreOffice Writer - got plain text even if I select paste as RTF;

  • copy text from Atril, paste to Abiword - got plain text;

  • copy text from Okular, paste to LibreOffice or Abiword - got plain text;

  • converting PDF to HTML with pdf2htmlex - got HTML, but copying from web-browser makes all text plain again;

  • importing PDF to LibreOffice Impress - formatting is preserved, but copying is difficult because of large amount of text blocks.

Which software from Ubuntu 16.04 LTS repository should I use to solve my problem?







libreoffice pdf format-conversion






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited Apr 10 at 21:11

























asked Apr 10 at 21:04









N0rbert

15.8k33274




15.8k33274











  • This is not a native solution, but opening the PDF in Google Doc may give you better results than LibreOffice.
    – user68186
    Apr 10 at 21:12
















  • This is not a native solution, but opening the PDF in Google Doc may give you better results than LibreOffice.
    – user68186
    Apr 10 at 21:12















This is not a native solution, but opening the PDF in Google Doc may give you better results than LibreOffice.
– user68186
Apr 10 at 21:12




This is not a native solution, but opening the PDF in Google Doc may give you better results than LibreOffice.
– user68186
Apr 10 at 21:12










1 Answer
1






active

oldest

votes

















up vote
1
down vote



accepted










That is not possible in general. Not even with Acrobat Reader Pro. Adobe recommended saving as Word doc or HTML. Evince or Okular will allow you to copy and paste using the from the viewer in utf8 but that discards formatting, mostly.



As you said, the best bet might be to use the LibreOffice import which tries to preserve many features of pdf format that map on to odf. Also, you can use the pdftk tool to select specific pages from a larger pdf file and then import them into LibreOffice.






share|improve this answer




















    Your Answer







    StackExchange.ready(function()
    var channelOptions =
    tags: "".split(" "),
    id: "89"
    ;
    initTagRenderer("".split(" "), "".split(" "), channelOptions);

    StackExchange.using("externalEditor", function()
    // Have to fire editor after snippets, if snippets enabled
    if (StackExchange.settings.snippets.snippetsEnabled)
    StackExchange.using("snippets", function()
    createEditor();
    );

    else
    createEditor();

    );

    function createEditor()
    StackExchange.prepareEditor(
    heartbeatType: 'answer',
    convertImagesToLinks: true,
    noModals: false,
    showLowRepImageUploadWarning: true,
    reputationToPostImages: 10,
    bindNavPrevention: true,
    postfix: "",
    onDemand: true,
    discardSelector: ".discard-answer"
    ,immediatelyShowMarkdownHelp:true
    );



    );













     

    draft saved


    draft discarded


















    StackExchange.ready(
    function ()
    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2faskubuntu.com%2fquestions%2f1023816%2fhow-to-copy-text-from-pdf-and-paste-it-to-other-application-preserving-text-form%23new-answer', 'question_page');

    );

    Post as a guest






























    1 Answer
    1






    active

    oldest

    votes








    1 Answer
    1






    active

    oldest

    votes









    active

    oldest

    votes






    active

    oldest

    votes








    up vote
    1
    down vote



    accepted










    That is not possible in general. Not even with Acrobat Reader Pro. Adobe recommended saving as Word doc or HTML. Evince or Okular will allow you to copy and paste using the from the viewer in utf8 but that discards formatting, mostly.



    As you said, the best bet might be to use the LibreOffice import which tries to preserve many features of pdf format that map on to odf. Also, you can use the pdftk tool to select specific pages from a larger pdf file and then import them into LibreOffice.






    share|improve this answer
























      up vote
      1
      down vote



      accepted










      That is not possible in general. Not even with Acrobat Reader Pro. Adobe recommended saving as Word doc or HTML. Evince or Okular will allow you to copy and paste using the from the viewer in utf8 but that discards formatting, mostly.



      As you said, the best bet might be to use the LibreOffice import which tries to preserve many features of pdf format that map on to odf. Also, you can use the pdftk tool to select specific pages from a larger pdf file and then import them into LibreOffice.






      share|improve this answer






















        up vote
        1
        down vote



        accepted







        up vote
        1
        down vote



        accepted






        That is not possible in general. Not even with Acrobat Reader Pro. Adobe recommended saving as Word doc or HTML. Evince or Okular will allow you to copy and paste using the from the viewer in utf8 but that discards formatting, mostly.



        As you said, the best bet might be to use the LibreOffice import which tries to preserve many features of pdf format that map on to odf. Also, you can use the pdftk tool to select specific pages from a larger pdf file and then import them into LibreOffice.






        share|improve this answer












        That is not possible in general. Not even with Acrobat Reader Pro. Adobe recommended saving as Word doc or HTML. Evince or Okular will allow you to copy and paste using the from the viewer in utf8 but that discards formatting, mostly.



        As you said, the best bet might be to use the LibreOffice import which tries to preserve many features of pdf format that map on to odf. Also, you can use the pdftk tool to select specific pages from a larger pdf file and then import them into LibreOffice.







        share|improve this answer












        share|improve this answer



        share|improve this answer










        answered Apr 10 at 21:30









        Martin W

        74238




        74238



























             

            draft saved


            draft discarded















































             


            draft saved


            draft discarded














            StackExchange.ready(
            function ()
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2faskubuntu.com%2fquestions%2f1023816%2fhow-to-copy-text-from-pdf-and-paste-it-to-other-application-preserving-text-form%23new-answer', 'question_page');

            );

            Post as a guest













































































            Popular posts from this blog

            Trouble downloading packages list due to a “Hash sum mismatch” error

            How do so many people here on Academia.SE, and in general, afford lavish higher education programs?

            How do I move numbers in filenames, in a batch renaming operation?