Strip out HTML Tags

Question

I am reading an XML file to populate a springboard, but the descriptions in the XML have HTML tags embeded in them.  How can I quickly strip these out?Thanks,Matt

lewi-p · Answer

For anyone that's interested I used this...

function StringRemoveHTMLTags(baseStr as String) as String
    r = createObject("roRegex", "<[^<]+?>", "i")
    return r.replaceAll(baseStr, "")
end function

neoRiley · Answer

"lewi-p" wrote:For anyone that's interested I used this...function StringRemoveHTMLTags(baseStr as String) as String &nbsp; &nbsp;r = createObject("roRegex", "&lt;[^&lt;]+?&gt;", "i") &nbsp; &nbsp;return r.replaceAll(baseStr, "")end functionThank you 🙂 &nbsp;Does exactly what it says. &nbsp;I figured I'd say thanks 5yrs later since nobody else did

kbenson · Answer

"cpradio" wrote:I am reading an XML file to populate a springboard, but the descriptions in the XML have HTML tags embeded in them.  How can I quickly strip these out?Do a google search for a regular expression to strip HTML (there's tons online), and use the regular expression component in brightscript to remove them.

cpradio · Answer

Ah, missed the regular expression component.  Thanks, that should do the trick.  Was kinda hoping it would support the HTML tags, but I understand why it doesn't.

jbrave · Answer

I've had way more success using string functions like instr, mid left and right than regex. I posted a regex I found for parsing HTML a while ago, but have never actually gotten it to work beyond giving me the string "&lt;HTML&gt;" so if anyone has some code that works to get a specific tag or an associativearray of tags it would be awesome.

Forum Discussion

Strip out HTML Tags

6 Replies

Recent Discussions

SceneGraph Sample Feed Doesn't Validate

Horizontal ButtonGroup

OS 15.1.4 query/media-player Expired

Free Tools for Roku Content Preparation - Built by Fellow Indie Publisher

Developer Rev Share column in Sales Activity Report