javascript - Remove HTML tags in script - Stack Overflow

IT技术

更新时间：2025-03-1810

admin管理员组
文章数量:1332395

I've found this piece of code on the internet. It takes a sentence and makes every single word into link with this word. But it has weak side: if a sentence has HTML in it, this script doesn't remove it.

For example: it replaces 'asserted' with '/asserted'

Could you please tell me what to change in this code for it to change 'asserted' to ''.

var content = document.getElementById("sentence").innerHTML;

var punctuationless = content.replace(/[.,\/#!$%\؟^?&\*;:{}=\-_`~()”“"]/g, "");
var mixedCase = punctuationless.replace(/\s{2,}/g);
var finalString = mixedCase.toLowerCase();

var words = (finalString).split(" ");

var punctuatedWords = (content).split(" ");

var processed = "";
for (i = 0; i < words.length; i++) {
    processed += "<a href = \"/" + words[i] + "\">";
    processed += punctuatedWords[i];
    processed += "</a> ";
}

document.getElementById("sentence").innerHTML = processed;

I've found this piece of code on the internet. It takes a sentence and makes every single word into link with this word. But it has weak side: if a sentence has HTML in it, this script doesn't remove it.

For example: it replaces 'asserted' with 'http://www.merriam-webster./dictionary/asserted'

Could you please tell me what to change in this code for it to change 'asserted' to 'http://www.merriam-webster./dictionary/asserted'.

var content = document.getElementById("sentence").innerHTML;

var punctuationless = content.replace(/[.,\/#!$%\؟^?&\*;:{}=\-_`~()”“"]/g, "");
var mixedCase = punctuationless.replace(/\s{2,}/g);
var finalString = mixedCase.toLowerCase();

var words = (finalString).split(" ");

var punctuatedWords = (content).split(" ");

var processed = "";
for (i = 0; i < words.length; i++) {
    processed += "<a href = \"http://www.merriam-webster./dictionary/" + words[i] + "\">";
    processed += punctuatedWords[i];
    processed += "</a> ";
}

document.getElementById("sentence").innerHTML = processed;

Share edited Oct 18, 2016 at 9:56 Al.G. 4,4266 gold badges34 silver badges62 bronze badges asked Oct 18, 2016 at 9:33 Максим Ціпан 231 silver badge3 bronze badges

1 Possible duplicate of JavaScript: How to strip HTML tags from string? – evolutionxbox Commented Oct 18, 2016 at 9:34
You might want to escape the . in your regexp like \., like you do with \- for instance. – Azamantes Commented Oct 18, 2016 at 9:37
2 Possible duplicate of Strip HTML from Text JavaScript – Andreas Commented Oct 18, 2016 at 9:41

Add a ment |

3 Answers 3

Sorted by: Reset to default 5

This regex /<{1}[^<>]{1,}>{1}/g should replace any text in a string that is between two of these <> and the brackets themselves with a white space. This

  var str = "<hi>How are you<hi><table><tr>I<tr><table>love cake<g>"
  str = str.replace(/<{1}[^<>]{1,}>{1}/g," ")
  document.writeln(str);

will give back " How are you I love cake".

If you paste this

var stripHTML = str.mixedCase(/<{1}[^<>]{1,}>{1}/g,"")

just below this

var mixedCase = punctuationless.replace(/\s{2,}/g);

and replace mixedCase with stripHTML in the line after, it will probably work

function stripAllHtml(str) {
  if (!str || !str.length) return ''

  str = str.replace(/<script.*?>.*?<\/script>/igm, '')

  let tmp = document.createElement("DIV");
  tmp.innerHTML = str;

  return tmp.textContent || tmp.innerText || "";
}

stripAllHtml('<a>test</a>')

This function will strip all the HTML and return only text.

Hopefully, this will work for you

if you need to remove HTML tags And HTML Entities You can use

const text = '<p>test content </p><p><strong>test bold</strong>&nbsp;</p>'
text.replace(/<[^>]*(>|$)|&nbsp;|&zwnj;|&raquo;|&laquo;|&gt;/g, '');

the result will be "test content test bold"

本文标签： javascriptRemove HTML tags in scriptStack Overflow

版权声明：本文标题：javascript - Remove HTML tags in script - Stack Overflow 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/web/1742265175a2443252.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

编程频道|软件玩家 - 软件改变生活！

javascript - Remove HTML tags in script - Stack Overflow

3 Answers 3

更多相关文章