• expired

[FREE eBook] Python Web Scraping at Packtpub - Today Only

160

Gday Ozbargain

From today 18/9 to Wed 19/9 9:59am, the free ebook at Packtpub is
Python Web Scraping

For those interested in making a second Price Hipster for yourself, or just improving your code-Fu
Packt Publishing is a legit publisher. Easy to make a free account and the books are DRM free and they give you many options to choose: ePub, PDF, mobi etc
Maybe call it the Price Socialist so socialists are known for more than just whirling a storm in a teacup and wrecking stores during riots when they're not happy about some 'globalisation' problem :)

Every day at 10am they'd have a new book for free, so bookmark this to be opened on your coffee break!

Related Stores

Packt Publishing
Packt Publishing

closed Comments

  • +2

    just a note: Web Scraping is not allowed by all the websites. Web Scraping could lead to serious legal issues if done without permission of the website.

    • that's a valid point, however why would any websites would allow other people to scrap their data?

    • +2

      legal issues if done without permission of the website.

      Why would you need permission to view a site that has been published to the public and not secured?

      • Doesnt matter if the site is secured or not, you can still scrap the data.

      • IIRC PriceHipster cannot scrape all sites because some don't allow pricing info for thousands of their products to be grabbed daily, and available for use by the public and their competitors.

        But scrapers can take more than just prices (eg images) which can be a legal problem

        • +1

          But scrapers can take more than just prices (eg images) which can be a legal problem

          That has nothing to do with scraping, you can do that by just downloading the image manually too…

        • +2

          But scrapers can take more than just prices (eg images) which can be a legal problem

          See i don't see how even scraping an image can be a legal problem. If you used the image without permission, that's another thing (nothing to do with scraping). But downloading an image using a scraper is no different to visiting that webpage, everything you view in a browser is downloaded, which is no different.

        • @supabrudda:
          True, if you just scrape a site for personal use and not for business its most likely fine

    • +2

      What sort of serious legal issues ?

      I'd be interested in what law they're using. Under The Cybercrime Act in most states, its basically unauthorised access to a restricted system and also intent. So if you're scraping publically available data, then I doubt the public prosecutor would entertain that.

      Sure they could legally bully you (like in the US companies have done with copyright infringements), but anyone can do that to anyone and it doesn't mean it's illegal.

      • Someone can be legally held if they scrap the website and use that data for commercial purposes without the consent of the actual website owner.

        • +2

          Right, but that's totally different. Thats just basic copyright theft, nothing to do with scraping per se. It's not how you procured the data (in your example scraping), its the fact you've committed an offence with the data you've acquired.

        • @supabrudda: Web scraping can cause issues like slow the website by sending much more request than as anticipated per second. Also you have mentioned copyright theft, this is a bigger issue as well. It goes back to what is the intent of web scrapping. Even doing for fun can cause other website to slowdown this itself can be seen as part of Denial of Service attack.

        • @DesiBoy: oh you're just making it up now.

          But please do point me in the direction where someone got into legal hotwater from causing a website to slowdown by scraping?

          Dare I say it, anyone who has a website which can be bought down by a single person scraping, wouldn't be able to afford Saul Goodman to send you a cease & desist letter, let alone afford any proper legal representation. And then there's nothing illegal about it.

          If you were using it as part of a DDOS attack, then it's no the scraping that's illegal, it's your intention to do harm, financial or otherwise to the business/website owner which is illegal, and they'll probably pursue you for loss of income, damages, loss of income, etc. But they won't ping you for scraping.

        • @supabrudda: Yes, you are right it comes down to intention or purpose of scrapping but how to justify that the intent was only for entertainment.

        • @DesiBoy: no you're wrong, you are making it up.

          Your original comment "Web Scraping is not allowed by all the websites. Web Scraping could lead to serious legal issues" and follow up comments are just plain wrong. You've just made it up.

          The act of scraping will not get you into any legal trouble, let alone into lead to serious legal issues

          Stop scaremongering.

  • I remember 20+ years ago there were already Windows software than can scrape the entire website structure and files
    The robot can pretend to be a browser too
    They'd probably have a giant disclaimer every time its run now.

    • It wasn't Xenu's Link Sleuth by any chance?
      http://home.snafu.de/tilman/xenulink.html

      • The name escapes me but deff not that one

    • WebZip, offline explorer.. the days when web was simple

      • Found it, Teleport Pro it was

    • GetRight could do that iirc

      • haha Getright
        Brings back memories
        Remembered it, it was Teleport Pro

  • +1
    • Looks like the link expired

      • +2

        bugmenot has some logins with this book & many others (lots of python books) in the collection.

        • Thanks , worked, that was so neat.

  • Trust it to have an error on the title page.. Misspelt Selenium…

Login or Join to leave a comment