How to parse HTML tags by using Brightscript?

Question

From API&nbsp;, we have some text with HTML tags. Actually, it is privacy policy content. So is it possible to show privacy policy content without HTML tags? But we want the Right style. Like font size and weight. Is it possible to convert HTML&nbsp;tags to relevant format for this? Please suggest me the best way.For example&nbsp;&lt;ol&gt;
	&lt;li&gt;We use Personal Data to allow you to participate in the features on the Site, to process your registration, and to provide you with other requested content related to our content and other offerings. Click here to learn more&amp;nbsp;&lt;/li&gt;&lt;/ol&gt;

venkatareddy · Answer

Hi&nbsp;I am also looking for same issue, if you got any solution for this. Please give me an update. Thanks in advance, hope to get response from you.

speechles · Answer

Brightscript Debugger&gt; html = "&lt;tag&gt;hi there&lt;another tag/&gt;&lt;tag2&gt; &lt;TAG3&gt;MORE&lt;/tag3&gt;"Brightscript Debugger&gt; ? html&lt;tag&gt;hi there&lt;another tag/&gt;&lt;tag2&gt; &lt;TAG3&gt;MORE&lt;/tag3&gt;Brightscript Debugger&gt; r = CreateObject("roRegex", "&lt;.*?&gt;", "") : ? r.ReplaceAll(html, "")hi there MOREBrightscript Debugger&gt; html = "
	HELLO HOW ARE YOU?"Brightscript Debugger&gt; ? html
	HELLO HOW ARE YOU?Brightscript Debugger&gt; r = CreateObject("roRegex", "(\r|\t|\v|\n)", "") : ? r.ReplaceAll(html, "")HELLO HOW ARE YOU?Brightscript Debugger&gt; html = "&lt;ol&gt;
	&lt;li&gt;We use Personal Data to allow you to participate in the features on the Site, to process your registration, and to provide you with other requested content related to our content and other offerings. Click here to learn more&amp;nbsp;&lt;/li&gt;&lt;/ol&gt;"' strip html tagsBrightscript Debugger&gt; r = CreateObject("roRegex", "&lt;.*?&gt;", "") : html = r.ReplaceAll(html, "")' strip carriage return, tab, vertical tab, newlineBrightscript Debugger&gt; r = CreateObject("roRegex", "(\r|\t|\v|\n)", "") : html = r.ReplaceAll(html, "")Brightscript Debugger&gt; ?htmlWe use Personal Data to allow you to participate in the features on the Site, to process your registration, and to provide you with other requested content related to our content and other offerings. Click here to learn more&amp;nbsp;' strip non breaking space entityBrightscript Debugger&gt; r = CreateObject("roRegex", "&amp;nbsp;", "") : ? r.ReplaceAll(html, "")We use Personal Data to allow you to participate in the features on the Site, to process your registration, and to provide you with other requested content related to our content and other offerings. Click here to learn more

NB_ · Answer

don't use roRegEx when simple .replace() would do; the latter is faster. roXmlElement may be of help, if the html in question is well-formed from the point of view of XML.

speechles · Answer

replace doesn't do glob or grouping does it?So would still need regex to strip the html tags and possibly the grouped  
 	 \v. You are right though, the last part where it strips off the &amp;nbsp; could've been replace.

NB_ · Answer

i doubt actual string would have backspace literals, that's neither here (html) nor there (c source) encoding. In Roku-speak, 
	 would have been chr(13)+chr(10)+chr(8)

Forum Discussion

How to parse HTML tags by using Brightscript?

7 Replies

Recent Discussions

Missed USD payment

Remote key press event

Screensaver Certification Failing

Problem with Deep Link and Channel Content Play

Public Key Endpoint only Exposes 1 Public Key [ROKU-PARTNER-SERVICE-2021-08-05-10-20]