The Web Scraper Plug-in for HomeSeer allows the user to create virtual devices containing information that is scraped from a web site.
This plugin is written in C# .Net 4.0 and requires Homeseer Version 3 or later.
Features:
Download from the following location:
http://www.rhusoft.com/downloads/hs3...er_1-0-1-1.zip
Extract the contents to your HS3 root folder, except for the install.txt file.
After starting the plugin it will create a HSPI_Web Scraper.ini file in the HS3 config directory along with 3 new devices.
The file can be updated in real-time and if the debug flag is set it will create 2 files in the log directory showing the results of the web site scrapping with a filename prefix of the ini file section heading.
I have requested for it to be placed in the downloader so hopefully it will show up soon.
Use this thread for now for comments or feature requests. I plan on adding many new features.
Example regular expressions below:
Jason
This plugin is written in C# .Net 4.0 and requires Homeseer Version 3 or later.
Features:
- User can use regular expressions to scrap content from any web site
- Content can be refreshed using a user specified update frequency
- Configuration is done through a configuration file(HSPIWeb Scraper.ini) in the HS3 config directory
- Device value is incremented everytime the device is updated. This makes it easy to fire events when the status text change.
Download from the following location:
http://www.rhusoft.com/downloads/hs3...er_1-0-1-1.zip
Extract the contents to your HS3 root folder, except for the install.txt file.
After starting the plugin it will create a HSPI_Web Scraper.ini file in the HS3 config directory along with 3 new devices.
The file can be updated in real-time and if the debug flag is set it will create 2 files in the log directory showing the results of the web site scrapping with a filename prefix of the ini file section heading.
I have requested for it to be placed in the downloader so hopefully it will show up soon.
Use this thread for now for comments or feature requests. I plan on adding many new features.
Example regular expressions below:
Code:
'Homeseer announcements [homeseer] URL=https://twitter.com/HomeSeer RegExSearch=(?s)js-tweet-text.*?dir="ltr">(.*?)-* *<a.*?</p> RegExReplace=1 RefreshInterval=60 Enabled=true Debug=true 'ESPN feed - should work for any twitter feed though - just replace the URL [espn] URL=https://twitter.com/espn RegExSearch=(?s)js-tweet-text.*?dir="ltr">(.*?)<a.*?</p> RegExReplace=1 RefreshInterval=60 Enabled=true Debug=true 'CNN Twitter feed [cnntwitterfeed] URL=https://twitter.com/cnnbrk RegExSearch=(?s)js-tweet-text.*?dir="ltr">(.*?)<a.*?</p> RegExReplace=1 RefreshInterval=60 Enabled=true Debug=true [cnnbreakingnews] URL=http://www.cnn.com/.element/ssi/www/breaking_news/3.0/banner.html RegExSearch="content":\s*"(?<cnnbreakingnews>.*)", RegExReplace=cnnbreakingnews RefreshInterval=60 Enabled=true Debug=false [forecast] URL=http://forecast.weather.gov/MapClick.php?lat=32.9657005&lon=-117.1147095&site=all&smap=1&searchresult=San%20Diego%2C%20CA%2092129%2C%20USA RegExSearch=(?s)<li class="row-odd"><span class="label">.*?</span>(.*?)</li> RegExReplace=1 RefreshInterval=60 Enabled=true Debug=false 'All earthquakes in the world [earthquake-world] URL=http://earthquake.usgs.gov/earthquakes/feed/v0.1/summary/all_hour.atom RegExSearch=<title>M (.*?)</title> RegExReplace=1 RefreshInterval=60 Enabled=true Debug=true 'Magnitude 5 or higher earthquakes in the world [earthquake-world-M5+] URL=http://earthquake.usgs.gov/earthquakes/feed/v0.1/summary/all_hour.atom RegExSearch=<title>M ([5-9].*?)</title> RegExReplace=1 RefreshInterval=60 Enabled=true Debug=true 'All earthquakes in California [earthquake-ca] URL=http://earthquake.usgs.gov/earthquakes/feed/v0.1/summary/all_hour.atom RegExSearch=<title>M (.*?), California</title> RegExReplace=1 RefreshInterval=60 Enabled=true Debug=true 'Magnitude 2 or higher earthquakes in California [earthquake-ca-M2+] URL=http://earthquake.usgs.gov/earthquakes/feed/v0.1/summary/all_hour.atom RegExSearch=<title>M ([2-9].*?), California</title> RegExReplace=1 RefreshInterval=60 Enabled=true Debug=true [School Status] URL=http://www.calvertnet.k12.md.us/info/status/schoolstatus.asp RegExSearch=(?s)<div style='padding-left:5px'>(.*?)</div> RegExReplace=1 RefreshInterval=60 Enabled=true Debug=true
Comment