• Want to try out RSS reading; Where to start?
    2 replies, posted
So let's say I wanted to take an rss like [url]http://world.kbs.co.kr/rss/rss_news.htm?lang=e[/url] and get that raw text and have a C program be able to parse just the headlines. I can understand the parsing, but where would I start in trying to download the html file and convert it into a using text format for C?
That feed is intentionally shortened so people will have to visit the website, and the website doesn't follow any standards. Just parse it with ah HTML(!) parser and figure out which page elements you need from the source. The reason you need an HTML parser is that websites are often malformed and can't be parsed by a normal strict parser for that reason. [editline]29th September 2014[/editline] This particular site has a <p id="content"></p> that you can target directly for the article text. That's not a give though.
Do you know of any good libraries for parsing with?
Sorry, you need to Log In to post a reply to this thread.