0
Welcome Guest! Login
0 items Join Now

HTML web pages being converted to PDF

  • HTML web pages being converted to PDF

    Posted 16 years 1 day ago
    • Hi

      not really a Rocket Theme issue as such but hoping someone might be able to help.

      A good number of my pages seem to have been converted to PDF files.

      For example if you do a search for Wheal Grey, Newton in Google, the first result that comes up is my page which is not an html file as it was created, but somehow has been converted (not very well) to a PDF document.

      How can this be? Anyone else come across this before?

      Regards

      Paul
    • www.west5web.com
  • Re: HTML web pages being converted to PDF

    Posted 16 years 1 day ago
  • Re: HTML web pages being converted to PDF

    Posted 16 years 21 hours ago
    • Hi Nanci

      thanks for the reply.

      I am confused as to why and how these PDFs are being created in the first place. I have never had any sites where this has happended before.

      cheers

      Paul
    • www.west5web.com
  • Re: HTML web pages being converted to PDF

    Posted 15 years 11 months ago
  • Re: HTML web pages being converted to PDF

    Posted 15 years 10 months ago
    • Hi

      just an update on this issue:

      in an attempt to stop Google indexing the PDF versions of my web pages I added the following code to the robots.txt file:

      Disallow: /index.php?view=article*&format=pdf
      Disallow: /index.php?view=article*&print=1*
      Disallow: /index.php?option=com_mailto*
      Disallow: /component/mailto/*

      Does this look correct?

      It's just that I still have over 300 PDF versions of my web pages indexed with Google. I thought they should have been removed from the index by now.

      Cheers

      Paul
    • www.west5web.com
  • Re: HTML web pages being converted to PDF

    Posted 15 years 10 months ago
  • Re: HTML web pages being converted to PDF

    Posted 15 years 10 months ago
    • Thanks Nanci

      I've added the variables you suggested to the robots file. I'll let you know how I get on :cheesy:

      I thought I was using user friendly URLs ???

      cheers

      Paul
    • www.west5web.com
  • Re: HTML web pages being converted to PDF

    Posted 15 years 10 months ago
    • Good deal. Also check your components list on your ftp to see what else may be added. I can't do that from my end.

      What I meant by the urls is .... you list above: Disallow: /index.php?view=article*&format=pdf

      I don't think you have to list the url, I think you just list the component/ for the component/module/plugin installed. So look at the folder structure of these items.

      For example above I listed the component/gallery where now I'm thinking just component would take care of it. Here is an example of how I have one of mine set up. By all means, use Google's reference....

      User-agent: *
      Disallow: /administrator/
      Disallow: /cache/
      Disallow: /components/
      Disallow: /images/
      Disallow: /includes/
      Disallow: /language/
      Disallow: /libraries/
      Disallow: /media/
      Disallow: /modules/
      Disallow: /templates/
      Disallow: /tmp/
      Disallow: /working_folder/
      Disallow: /xmlrpc/

      Luck to you!
    • Mark your threads as Solved. Please, Please, Please!
      Using FIREBUG will save you time and HELP you learn.
      Tips Tricks and Tutorial Links
      Security Tips and Joomla Version Info
      Style Tips and Code Snippets
  • Re: HTML web pages being converted to PDF

    Posted 15 years 10 months ago
    • Hi Nanci

      my full robots file looks like

      User-agent: *
      Disallow: /administrator/
      Disallow: /cache/
      Disallow: /components/
      Disallow: /images/
      Disallow: /includes/
      Disallow: /installation/
      Disallow: /language/
      Disallow: /libraries/
      Disallow: /media/
      Disallow: /modules/
      Disallow: /plugins/
      Disallow: /templates/
      Disallow: /tmp/
      Disallow: /xmlrpc/
      Disallow: /index.php?view=article*&format=pdf
      Disallow: /index.php?view=article*&print=1*
      Disallow: /index.php?option=com_mailto*
      Disallow: /component/mailto/*
      Disallow: /component/search
      Disallow: /component/contact
      Disallow: /component/gallery

      Guess I don;t need the bottom 4 as I aleady have: Disallow: /components/

      cheers :D

      Paul
    • www.west5web.com
  • Re: HTML web pages being converted to PDF

    Posted 15 years 10 months ago

Time to create page: 0.280 seconds