c# - How Can I fetch/scrape HTML text and images to Windows phone? -

February 15, 2010

hello, want know that, how can scrape html site's text in list (ul, li) in windows phone. want make rss feed reader. please in details, new in htmlagilitypack. thanks.

this not simple think. have use htmlagility pack parse , normalize html content. need go through each node assess if it's content node or not, i.e. want ignore divs, embeds etc.

i'll try started.

read document

uri url = new uri(<your url>); htmlagilitypack.htmlweb web = new htmlagilitypack.htmlweb(); htmlagilitypack.htmldocument document = web.load(url.absoluteuri);

here how can extract image , text tags

var docnode = documentnode; // if want text withing document life simpler. string htmltext = docnode.innertext;  // images ienumerable<htmlnode> imagenodes = docnode.descendants("img"); // iterate through images , like...

if want implement readability/instapaper cleanup download nreadability https://github.com/marek-stoj/nreadability

Search This Blog

Detect

c# - How Can I fetch/scrape HTML text and images to Windows phone? -

Comments

Post a Comment

Popular posts from this blog

javascript - addthis share facebook and google+ url -

ios - Show keyboard with UITextField in the input accessory view -

c++ - importing crypto++ in QT application and occurring linker errors? -