<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>WebD &#187; webscraping</title>
	<atom:link href="http://webdlabs.com/tag/webscraping/feed/" rel="self" type="application/rss+xml" />
	<link>http://webdlabs.com</link>
	<description>Just another WordPress weblog</description>
	<lastBuildDate>Wed, 09 May 2012 18:29:00 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Basic cURL wrapper function for PHP</title>
		<link>http://webdlabs.com/2011/03/basic-curl-wrapper-function-for-php/</link>
		<comments>http://webdlabs.com/2011/03/basic-curl-wrapper-function-for-php/#comments</comments>
		<pubDate>Sat, 26 Mar 2011 16:13:31 +0000</pubDate>
		<dc:creator>Akshay</dc:creator>
				<category><![CDATA[PHP]]></category>
		<category><![CDATA[code snippet]]></category>
		<category><![CDATA[curl]]></category>
		<category><![CDATA[webscraping]]></category>

		<guid isPermaLink="false">http://webdlabs.com/?p=81</guid>
		<description><![CDATA[I recently started using snipplr.com to publish small bits of re-usable code with the community. Here&#8217;s a quick and dirty basic cURL wrapper function for PHP. cURL wrappers are often needed only when we are working on projects without a CMS or a framework, hence I have tried to keep it very basic. Comments and feedback [...]]]></description>
			<content:encoded><![CDATA[<p>I recently started using <a href="http://snipplr.com/" target="_blank">snipplr.com</a> to publish small bits of re-usable code with the community. Here&#8217;s a <a href="http://snipplr.com/view/51161/basic-curl-wrapper-function-for-php/" target="_blank">quick and dirty basic cURL wrapper function for PHP</a>. cURL wrappers are often needed only when we are working on projects without a CMS or a framework, hence I have tried to keep it very basic. Comments and feedback is welcome.</p>
<div id="snipplr_embed_51161" class="snipplr_embed"><a href="http://snipplr.com/view/51161/basic-curl-wrapper-function-for-php/">Code snippet &#8211; Basic cURL wrapper function for PHP</a> on Snipplr</div>
<p><script type="text/javascript" src="http://snipplr.com/js/embed.js"></script><script type="text/javascript" src="http://snipplr.com/json/51161"></script></p>
]]></content:encoded>
			<wfw:commentRss>http://webdlabs.com/2011/03/basic-curl-wrapper-function-for-php/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>HTML Table Queries through WP Web Scraper</title>
		<link>http://webdlabs.com/2009/08/html-table-queries-through-wp-web-scraper/</link>
		<comments>http://webdlabs.com/2009/08/html-table-queries-through-wp-web-scraper/#comments</comments>
		<pubDate>Tue, 11 Aug 2009 09:37:43 +0000</pubDate>
		<dc:creator>Akshay</dc:creator>
				<category><![CDATA[Plugins]]></category>
		<category><![CDATA[WordPress]]></category>
		<category><![CDATA[HTML]]></category>
		<category><![CDATA[webscraping]]></category>

		<guid isPermaLink="false">http://webdlabs.com/?p=68</guid>
		<description><![CDATA[Scraping HTML tables is easy, but parsing them has always been tricky. That&#8217;s exactly what my next release of WP Web Scraper will let you do. This feature will have methods to query HTML tables within your scrap. For instance, the scraper will let you filter by value of a specific table column and also [...]]]></description>
			<content:encoded><![CDATA[<p>Scraping HTML tables is easy, but parsing them has always been tricky. That&#8217;s exactly what my next release of <a href="http://wordpress.org/extend/plugins/wp-web-scrapper/" target="_blank">WP Web Scraper</a> will let you do. This feature will have methods to query HTML tables within your scrap. For instance, the scraper will let you filter by value of a specific table column and also restrict the number of rows using a &#8216;from&#8217; and &#8216;to&#8217; index key.</p>
<p>Further, it will also let you delete a certain column from the output and also apply specific CSS classes to even and odd rows. This feature is specifically designed for users intending to scrap and filter or parse data extracted from HTML tables. This feature will be implemented as a module within WP Web Scraper.</p>
]]></content:encoded>
			<wfw:commentRss>http://webdlabs.com/2009/08/html-table-queries-through-wp-web-scraper/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>WP Web Scraper &#8211; A WordPress Stock Market plugin</title>
		<link>http://webdlabs.com/2009/06/wp-web-scraper-a-wordpress-stock-market-plugin/</link>
		<comments>http://webdlabs.com/2009/06/wp-web-scraper-a-wordpress-stock-market-plugin/#comments</comments>
		<pubDate>Mon, 08 Jun 2009 12:35:14 +0000</pubDate>
		<dc:creator>Akshay</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[Plugins]]></category>
		<category><![CDATA[WordPress]]></category>
		<category><![CDATA[NASDAQ]]></category>
		<category><![CDATA[NSE]]></category>
		<category><![CDATA[Stock Market]]></category>
		<category><![CDATA[webscraping]]></category>

		<guid isPermaLink="false">http://webdlabs.com/?p=50</guid>
		<description><![CDATA[This is probably a major milestone in the lifecycle of WP Web Scraper WordPress plugin. Technically speaking, the plugin gets in own ‘module architecture’ to incorporate unlimited extensions without touching the core codebase. Speaking non-technically, this opens WP Web Scraper to a non-techie WordPress user. To startoff, this mod extends the plugin with a specific [...]]]></description>
			<content:encoded><![CDATA[<p>This is probably a major milestone in the lifecycle of <a href="http://wordpress.org/extend/plugins/wp-web-scrapper" target="_blank">WP Web Scraper WordPress plugin</a>. Technically speaking, the plugin gets in own ‘module architecture’ to incorporate unlimited extensions without touching the core codebase. Speaking non-technically, this opens WP Web Scraper to a non-techie WordPress user. To startoff, this mod extends the plugin with a specific shortcode to get stock market data from NSE and NASDAQ (to start off with, more exchanges soon to come). The data is scraped with a cache interval of a minute (which can be further increased as per your requirement) and includes data types such as Open, High, Low, Last Price, Previous Close, Change, Change Percentage and Volume information for all active symbols on these exchanges.</p>
<p>The plugin API will provides a simple shortcode. For example &#8211; [<code>wpws_market_data market="nse" symbol="acc" datatype="last"</code>] or [<code>wpws_market_data market="nasdaq" symbol="csco" datatype="open"</code>]. NSE data is currently scraped from nseindia.com and NASDAQ data is scraped from reuters.com. The immediate plan is to implement all major stock markets in this API. Later, I plan to extend this modular architecture to other categories of scraps such as Weather, Sports scores etc too.</p>
]]></content:encoded>
			<wfw:commentRss>http://webdlabs.com/2009/06/wp-web-scraper-a-wordpress-stock-market-plugin/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Plan of action for WP Web Scraper</title>
		<link>http://webdlabs.com/2009/05/plan-of-action-for-wp-web-scraper/</link>
		<comments>http://webdlabs.com/2009/05/plan-of-action-for-wp-web-scraper/#comments</comments>
		<pubDate>Wed, 27 May 2009 06:50:24 +0000</pubDate>
		<dc:creator>Akshay</dc:creator>
				<category><![CDATA[Plugins]]></category>
		<category><![CDATA[WordPress]]></category>
		<category><![CDATA[BSE]]></category>
		<category><![CDATA[NASDAQ]]></category>
		<category><![CDATA[NSE]]></category>
		<category><![CDATA[Stock Market]]></category>
		<category><![CDATA[webscraping]]></category>

		<guid isPermaLink="false">http://webdlabs.com/?p=48</guid>
		<description><![CDATA[My latest WordPress plugin for web scraping &#8211; WP Web Scrapper was a grand launch. It recorded more than 200 downloads in the first two days itself! Thanks for all the appreciation and comments. This post is mainly to list down my plan to extend WP Web Scrapper into a standard scraping framework. Apart from [...]]]></description>
			<content:encoded><![CDATA[<p>My latest WordPress plugin for web scraping &#8211; <a href="http://webdlabs.com/projects/wp-web-scraper/" target="_self">WP Web Scrapper</a> was a grand launch. It recorded more than 200 downloads in the first two days itself! Thanks for all the appreciation and comments. This post is mainly to list down my plan to extend WP Web Scrapper into a standard scraping framework. Apart from being a flexible framework, I also plan to introduce some pre-built modules to make specific and highly desired scraping tasks easy. First such module will be a stock market data grabber. This module will extend the plugin to get stock market data from various big exchange websites easily (planning to support NSE, BSE and NASDAQ to start off with). The data will be almost realtime (delay ranging between 1 to 10 mins) and will include Open, High, Low, Last Price, Previous Close, Change, Change Percentage and Volume information for all active symbols on these exchanges.</p>
<p>The plugin API will provide a shortcode something like this &#8211; [<code>wpws mod="nse" symbol="acc" datatype="last"</code>] should output the latest price for <a href="http://nseindia.com/marketinfo/equities/cmquote_tab.jsp?key=ACCEQN&amp;symbol=ACC&amp;flag=0" target="_blank">ACC listed at NSE</a>. The aim is to make it an extendable module framework and hence I am taking time to code it well. Apart from this features, I am also planning to improve the core scrapper with functionalities like a regex powered cleanup function to remove all unwanted text strings from the scrap and also a more flexible algorithm to query html tables returned by the scrap.</p>
]]></content:encoded>
			<wfw:commentRss>http://webdlabs.com/2009/05/plan-of-action-for-wp-web-scraper/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Bringing Web Scraping to WordPress!</title>
		<link>http://webdlabs.com/2009/05/bringing-web-scraping-to-wordpress/</link>
		<comments>http://webdlabs.com/2009/05/bringing-web-scraping-to-wordpress/#comments</comments>
		<pubDate>Sun, 24 May 2009 06:24:53 +0000</pubDate>
		<dc:creator>Akshay</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[Plugins]]></category>
		<category><![CDATA[WordPress]]></category>
		<category><![CDATA[curl]]></category>
		<category><![CDATA[phpquery]]></category>
		<category><![CDATA[webscraping]]></category>

		<guid isPermaLink="false">http://webdlabs.com/?p=47</guid>
		<description><![CDATA[Web scraping (or Web harvesting, Web data extraction) is a computer software technique of extracting information from websites. Web scraping focuses more on the transformation of unstructured Web content, typically in HTML format, into structured data that can be formatted and displayed or stored and analyzed. Exemplary uses of Web scraping include online price comparison, [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://en.wikipedia.org/wiki/Web_scraping" target="_blank">Web scraping</a> (or Web harvesting, Web data extraction) is a computer software technique of extracting information from websites. Web scraping focuses more on the transformation of unstructured Web content, typically in HTML format, into structured data that can be formatted and displayed or stored and analyzed. Exemplary uses of Web scraping include online price comparison, weather data monitoring, market data tracking, Web content mashup and Web data integration.</p>
<p>Imagine what you can do with all this power in your WordPress blog! Pages and posts can display realtime content from other pages, letting you create a meshup of content. This all is now possible using my <a href="http://wordpress.org/extend/plugins/wp-web-scrapper" target="_blank">WP Web Scraper plugin</a>. Its an easy to implement professional web scrapper for WordPress. This can be used to display real time data from any websites directly into your posts, pages or sidebar. Use this to include real time stock quotes, cricket or soccer scores or any other generic content. The scrapper is built using time tested libraries cURL for scrapping and phpQuery for parsing HTML. Please post all your suggestions and thoughts about this on the <a href="http://webdlabs.com/projects/wp-web-scraper/" target="_self">WP Web Scraper project page</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://webdlabs.com/2009/05/bringing-web-scraping-to-wordpress/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

