Parsing HTML with c#.net [duplicate]
I'm trying to parse the following HTML file, I'd like the get the value of key. This is being done on Silverlight for Windows phone.
<HTML>
<link ref="shortcut icon" href="favicon.ico">
<BODY>
<script Language="JavaScript">
location.href="login.html?key=UEFu1EIsgGTgAV7guTRhsgrTQU28TImSZkYhPMLj7BChpBkvlCO11aJU2Alj4jc5"
</script>
<CENTER><a href="login.html?key=UEFu1EIsgGTgAV7guTRhsgrTQU28TImSZkYhPMLj7BChpBkvlCO11aJU2Alj4jc5">Welcome</a></CENTER></BODY></HTML>
any idea's on where to go from here?
thanks
Solution 1:
Give the HTMLAgilityPack a look into. Its a pretty decent HTML parser
http://html-agility-pack.net/?z=codeplex
Here's some code to get you started (requires error checking)
HtmlDocument document = new HtmlDocument();
string htmlString = "<html>blabla</html>";
document.LoadHtml(htmlString);
HtmlNodeCollection collection = document.DocumentNode.SelectNodes("//a");
foreach (HtmlNode link in collection)
{
string target = link.Attributes["href"].Value;
}