Linkify text file in Linux

SwagFapper Published at Java

SwagFapper

I have parsed all rows containing urls from a text file and appended line breaks, and I want to make the links clickable in a new file.

How do I append <a href> -tags around only the urls, using standard linux tools, preferably awk? It needs to be automatable in cron.

For example,

source file chaturls.txt:

    12:30 <user> check this: https://link.to/stuff.jpg</br>
    13:47 <user4> https://another.link.lol eyyyy</br>

desired output in new file, chatlinkified.html:

12:30 <user> check this: <a href='https://link.to/stuff.jpg'>https://link.to/stuff.jpg</a></br>
13:47 <user4> <a href='https://another.link.lol'>https://another.link.lol</a> eyyyy</br>

I tried awk '{printf "<a href=\"%s\">%s</a><br>", $0,$0}' chaturls.txt > chatlinkified.html, but this makes the whole line an (invalid) clickable link.

Hielke Walinga

sed -E 's@(https?://[^[:space:]/$.?#].[^[:space:]<]*)@<a href="\1">\1</a>@g' chaturls.txt > chatlinkified.html

You can use sed and refer back to the matched group with \1. NB. here I separate using the @ instead of / (as in s/../../g), you are free the use any character and this saves some escapes.

The regex for finding the URL does some validation checks for the first character after the https?:// and then proceeds the match until a space or the starting bracket of another tag.

You can if you want to use a more simpler regex for the url like, given in one of the comments https?://[^ ]*) which doesn't include this small validation.

You can find more extensive validated url regex here: https://mathiasbynens.be/demo/url-regex (But you have to convert from PHP regex to sed extended regex)

Collected from the Internet

Please contact [email protected] to delete if infringement.

edited at2020-10-3

Comments

0 comments

From Dev

Related Related

Article

Linkify text file in Linux

Linkify text file in Linux

Linkify any text Android

linux compare text file

PHP: How do I linkify all links inside a given text?

Android, EditText - How to "delinkify" already linkified (using Linkify class) text

text colors for one TextView using two Linkify's and weblinks clickable

linux replacing line in text file

Linkify Method in html.erb file showing up with HTML in view

How to create a text file with specific text in it in Linux?

How to create a text file with specific text in it in Linux?

How to Randomize Columns of a Text File in Linux

Where is the character encoding of a text file stored in Linux?

Print Text (Without File) On Linux With LPR

Linux PdfToText function return blank text file

How get value from text file in linux

Merge two columns of a text file in Linux

partial merging 2 text file in linux

Sort lines in text file with specific separator in Linux

Sort lines in text file with specific separator in Linux

Linux : Data extraction from text file

Linux: Locating text file from a string input

Linux PdfToText function return blank text file

find and replace text in file recursively in linux

Linux script to extract usernames from text file

Linux Extract Matched Text Field from File

Embedded linux to read text and append it to another file

Partially open and modify a text file in Linux

How to search each occurrence in a text file Linux?

AWK / Linux script calculations from a text file

Removing block of text from csv file in Linux