你的位置:首页 > 软件开发 > ASP.net > C#实现网页爬虫

C#实现网页爬虫

发布时间:2016-01-30 11:00:11
HTTP请求工具类(功能:1、获取网页html;2、下载网络图片;):using System;using System.Collections.Generic;using System.IO;using System.Linq;using System.Net;using Sy ...

HTTP请求工具类(功能:1、获取网页html;2、下载网络图片;):

C#实现网页爬虫C#实现网页爬虫
using System;using System.Collections.Generic;using System.IO;using System.Linq;using System.Net;using System.Text;using System.Threading.Tasks;using System.Windows.Forms;namespace Utils{  /// <summary>  /// HTTP请求工具类  /// </summary>  public class HttpRequestUtil  {    /// <summary>    /// 获取页面html    /// </summary>    public static get='_blank'>string GetPageHtml(string url)    {      // 设置参数      HttpWebRequest request = WebRequest.Create(url) as HttpWebRequest;      request.UserAgent = "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 6.0; Trident/4.0)";      //发送请求并获取相应回应数据      HttpWebResponse response = request.GetResponse() as HttpWebResponse;      //直到request.GetResponse()程序才开始向目标网页发送Post请求      Stream responseStream = response.GetResponseStream();      StreamReader sr = new StreamReader(responseStream, Encoding.UTF8);      //返回结果网页(html)代码      string content = sr.ReadToEnd();      return content;    }    /// <summary>    /// Http下载文件    /// </summary>    public static void HttpDownloadFile(string url)    {      int pos = url.LastIndexOf("/") + 1;      string fileName = url.Substring(pos);      string path = Application.StartupPath + "\\download";      if (!Directory.Exists(path))      {        Directory.CreateDirectory(path);      }      string filePathName = path + "\\" + fileName;      if (File.Exists(filePathName)) return;      // 设置参数      HttpWebRequest request = WebRequest.Create(url) as HttpWebRequest;      request.UserAgent = "Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 6.0; Trident/4.0)";      //发送请求并获取相应回应数据      HttpWebResponse response = request.GetResponse() as HttpWebResponse;      //直到request.GetResponse()程序才开始向目标网页发送Post请求      Stream responseStream = response.GetResponseStream();      //创建本地文件写入流      Stream stream = new FileStream(filePathName, FileMode.Create);      byte[] bArr = new byte[1024];      int size = responseStream.Read(bArr, 0, (int)bArr.Length);      while (size > 0)      {        stream.Write(bArr, 0, size);        size = responseStream.Read(bArr, 0, (int)bArr.Length);      }      stream.Close();      responseStream.Close();    }  }}

原标题:C#实现网页爬虫

关键词:C#

C#
*特别声明:以上内容来自于网络收集,著作权属原作者所有,如有侵权,请联系我们: admin#shaoqun.com (#换成@)。

可能感兴趣文章

我的浏览记录