因此,我将这样的代码从站点中某个站点的标签中获取值:
try {
URL url = new URL("google.com");
BufferedReader in = new BufferedReader(new InputStreamReader(url.openStream()));
String inputLine;
while (in.readLine() != null) {
inputLine = in.readLine();
}
in.close();
} catch (IOException e) {
e.printStackTrace();
}
所以说我需要它来找到“ Pizza”,但是只有一些代码会弹出,所以我无法访问该部分,所以有一种方法可以打印出整个HTML(使用BufferReader,并且没有像Jsoup这样的多余导入),然后检查它?
URL url = new URL("http://www.google.com");
URLConnection uc = url.openConnection();
InputStreamReader input = new InputStreamReader(uc.getInputStream());
BufferedReader in = new BufferedReader(input);
String inputLine;
FileWriter outFile = new FileWriter("orhancan");
PrintWriter out = new PrintWriter(outFile);
while ((inputLine = in.readLine()) != null) {
out.println(inputLine);
}
in.close();
out.close();
File fXmlFile = new File("orhancan");
DocumentBuilderFactory dbFactory = DocumentBuilderFactory.newInstance();
DocumentBuilder dBuilder = dbFactory.newDocumentBuilder();
Document doc = dBuilder.parse(fXmlFile);
NodeList prelist = doc.getElementsByTagName("body");
System.out.println(prelist.getLength());
有一种更简单的方法可以做到这一点。我建议使用JSoup。使用JSoup,您可以做类似的事情。json文件doc = Jsoup.connect(“ http://en.wikipedia.org/ ”).get(); 元素newsHeadlines = doc.select(“#mp-itn b a”); 或者,如果您想要身体:
Elements body = doc.select("body");
或者,如果您需要所有链接:
Elements links = doc.select("body a");
本文收集自互联网,转载请注明来源。
如有侵权,请联系[email protected] 删除。
我来说两句