原文 C#用正则表达式去掉Html中的script脚本和html标签

/// <summary>

        /// 用正则表达式去掉Html中的script脚本和html标签
        /// </summary>
        /// <param name="Htmlstring"></param>
        /// <returns></returns>
        public static string NoHTML(string Htmlstring)
        {
            //删除脚本  
            Htmlstring = Regex.Replace(Htmlstring, @"<script[^>]*?>.*?</script>", "", RegexOptions.IgnoreCase);
            //删除HTML  
            Htmlstring = Regex.Replace(Htmlstring, @"<(.[^>]*)>", "", RegexOptions.IgnoreCase);
            Htmlstring = Regex.Replace(Htmlstring, @"([\r\n])[\s]+", "", RegexOptions.IgnoreCase);
            Htmlstring = Regex.Replace(Htmlstring, @"-->", "", RegexOptions.IgnoreCase);
            Htmlstring = Regex.Replace(Htmlstring, @"<!--.*", "", RegexOptions.IgnoreCase);
 
            Htmlstring = Regex.Replace(Htmlstring, @"&(quot|#34);", "\"", RegexOptions.IgnoreCase);
            Htmlstring = Regex.Replace(Htmlstring, @"&(amp|#38);", "&", RegexOptions.IgnoreCase);
            Htmlstring = Regex.Replace(Htmlstring, @"&(lt|#60);", "<", RegexOptions.IgnoreCase);
            Htmlstring = Regex.Replace(Htmlstring, @"&(gt|#62);", ">", RegexOptions.IgnoreCase);
            Htmlstring = Regex.Replace(Htmlstring, @"&(nbsp|#160);", "   ", RegexOptions.IgnoreCase);
            Htmlstring = Regex.Replace(Htmlstring, @"&(iexcl|#161);", "\xa1", RegexOptions.IgnoreCase);
            Htmlstring = Regex.Replace(Htmlstring, @"&(cent|#162);", "\xa2", RegexOptions.IgnoreCase);
            Htmlstring = Regex.Replace(Htmlstring, @"&(pound|#163);", "\xa3", RegexOptions.IgnoreCase);
            Htmlstring = Regex.Replace(Htmlstring, @"&(copy|#169);", "\xa9", RegexOptions.IgnoreCase);
            Htmlstring = Regex.Replace(Htmlstring, @"&#(\d+);", "", RegexOptions.IgnoreCase);
 
            Htmlstring.Replace("<", "");
            Htmlstring.Replace(">", "");
            Htmlstring.Replace("\r\n", "");
            Htmlstring = HttpUtility.HtmlDecode(Htmlstring).Replace("<br/>", "").Replace("<br>", "").Trim();
 
            return Htmlstring;
        }

最新文章

  1. JS,html压缩及混淆工具
  2. 采用p6spy完整显示hibernate的SQL语句
  3. CSS skills: 4) goto page head script
  4. Hierarchy--分层。单词意思即为分层视图。
  5. Spring MVC 3.0.5+Spring 3.0.5+MyBatis3.0.4全注解实例详解(一)
  6. WTL的消息机制
  7. linux修改时区,时间格式
  8. NOIP201504推销员
  9. Omi命令行界面omi-cli发布
  10. Swift基础语法
  11. Java8-初识Lambda
  12. react-router详细解释
  13. kettle 连接 SQL Server 异常
  14. [python,2018-03-06] python中的继承顺序
  15. October 14th 2017 Week 41st Saturday
  16. 《图说VR入门》——360全景视频
  17. JAVA实现概率计算(数字不同范围按照不同几率产生随机数)
  18. [Windows Azure] Learn SQL Reporting on Windows Azure (9-Step Tutorial)
  19. ubuntu编译centos7部署大象医生 dr-elephant
  20. 【bzoj5210】最大连通子块和 动态dp

热门文章

  1. 微信红包API接口(PHP)
  2. HDOJ1232 并查集
  3. (Problem 10)Summation of primes
  4. zzuli Camellia的难题(暴力)
  5. opencv鼠标绘制直线 C++版
  6. Sqrt(x) 牛顿迭代法
  7. Qt 内存管理机制
  8. 华为OJ:计算两个自然时间相加
  9. UF访问,一些对用友最新的旗舰级产品U9一些引进(图像)
  10. 集合简单总结 ArrayList、List、Hashtable、Dictionary