unable to find absolute URL

Calvin 发表于 Dev

user2950150

I'm writing some code to find absolute URLS of a single webpage:

http://explore.bfi.org.uk/4ce2b69ea7ef3

So far I get all the links of that page and print the absolute urls

Here is part of the code:

 Elements hyperLinks = htmlDoc.select("a[href]");

    for(Element link: hyperLinks)
    {
        System.out.println(link.attr("abs:href"));
    }

This prints out alot or urls just like the one above. However, it seems to skip a few URLS aswell. The ones it skips are the ones I actually need.

This is one of the a[href] elements its not turning into the absolute URL:

<div class="title"><a href="/4ce2b69ea7ef3">Royal Review</a><br /></div>

It will print this line if I just print "link" but when I put "abs:href", it will just print blank.

I am new to Java and appreciate any feedback!

alex

You shouldn't use "a[href]", use "a" instead following this example:

Document doc = Jsoup.connect("http://jsoup.org").get();

Element link = doc.select("a").first();
String relHref = link.attr("href"); // == "/"
String absHref = link.attr("abs:href"); // "http://jsoup.org/"

So in your case:

Elements hyperLinks = htmlDoc.select("a");

    for(Element link: hyperLinks)
    {
        System.out.println(link.attr("abs:href"));
    }

本文收集自互联网，转载请注明来源。

如有侵权，请联系[email protected] 删除。

编辑于2021-02-4

我来说两句

0条评论

登录后参与评论

上一篇：如何从静态函数返回指向const的指针？

来自分类Dev

Find absolute path from a script

来自分类Dev

Symfony UrlGeneratorInterface :: ABSOLUTE_URL

来自分类Dev

带有参数的get_absolute_url

来自分类Dev

使用get_absolute_url（）的NoReverseMatch错误

来自分类Dev

Absolute URL in email generated using Velocity template

来自分类Dev

Django Python的get_absolute_url问题

来自分类Dev

get_absolute_url（）取决于属性

来自分类Dev

Django get_absolute_url 查询

来自分类Dev

Django:get_absolute_url 使用反向

来自分类Dev

在Django 1.7中访问User.get_absolute_url

来自分类Dev

如何调用djangos的get_absolute_url方法？

来自分类Dev

调用get_absolute_url时出现永久链接错误

来自分类Dev

django get_absolute_url重定向问题

来自分类Dev

在Django 1.7中访问User.get_absolute_url

来自分类Dev

在Django身份验证中设置`get_absolute_url`

来自分类Dev

Django：get_absolute_url 不起作用

来自分类Dev

Unable to find library in eclipse for Android

来自分类Dev

可以在没有模型对象的 {% url %} 标签中使用 get_absolute_url 吗？

来自分类Dev

没有要重定向到的URL。在模型上提供URL或定义get_absolute_url方法

来自分类Dev

Bash - 使用 find 提取 url

来自分类Dev

pod 搜索“XXX”并返回 Unable to find

来自分类Dev

get_absolute_url未将信息添加到HTML文件

来自分类Dev

如何在Django模板中使用domain_get_absolute_url？

来自分类Dev

生成Django sitemap.xml：如何修复“ get_absolute_url”错误

来自分类Dev

（干草堆+飞快移动）{{result.object.get_absolute_url}}无法正常工作

来自分类Dev

使用args的带有get_absolute_url（）的Django模板不起作用

来自分类Dev

Django中get_absolute_url中相关对象的数据库命中

来自分类Dev

Django-models.py-get_absolute_url函数是否带条件？

来自分类Dev

将get_absolute_url的结果保存在Django模型字段中

Related 相关文章

文章