How can I change ^L code in many files in Ubuntu?

Control-L (represented as ^L) is the "form feed" character. In ASCII, it has decimal value 12 (L is the 12th letter of the alphabet) or hex value 0c:

$ printf 'foo\x0cbar\n' | cat -et
foo^Lbar$

$ printf 'foo\x0cbar\n'
foo
   bar

You can replace it using tools like sed by specifying the hexadecimal escape code:

$ printf 'foo\x0cbar\n' | sed 's/\x0c//'
foobar

Alternatively, compose ^L directly using the keyboard sequence CTRL+V CTRL+L

sed 's/CTRL+VCTRL+L//'

For your specific replacement, given

$ printf '<\x0cilename\n'
<
 ilename

then

$ printf '<\x0cilename\n' | sed 's/<\x0c/<\/f/g'
</filename

(the g modifier is added in case there is more than one instance per line).


As Hans-Martin Mosner points out in the comments, it seems that someone used backslashes instead of forward slashes when generating the XML (or possibly ran the whole <filename> section through a Unix-to-Windows converter which was overzealous about slashes). \f is a rarely-used escape sequence for a form-feed character, aka U+0C or ^L. So some later step of the pipeline then replaced the \f with literal U+0C characters.

Fortunately, U+0C is an extremely rare character that's unlikely to be found intentionally in any sort of XML. And since only \f would produce this, as opposed to (say) \g or \k, a universal find-and-replace should fix not only </filename> but also </folder>, </file>, or anything else that got mangled.

That's what steeldriver's sed-script does; I'd just make it very slightly more general:

sed 's|\x0c|/f|g'

This means "(s)wap all instances of \x0c (that is, U+0C) to /f, (g)lobally".


\f is the form feed character in Perl. It looks as though these malformed files were created by someone new to both Perl and XML.

Here's a much Perlier fix -- which also meets the OP's goals of automating update of all of the files, unlike the accepted answer with sed, which will only work on one file at a time as it isn't paired with find.

\f can simply be employed itself instead of the hexadecimal code x0c.

find . -type f -exec perl -pi.bkp -e 's [ \f ilename ][ /f ilename ]gx' {} \;

Here I've added -type f to tel find to only return plain files - otherwise find will return . in the list, and trigger a warning when you try to edit it, though everything else will still work.

I've also made the regex easier to see by using the x flag which ignores real whitespace, allowing you to space out the elements of your regex. If you don't like this, here it is without:

find . -type f -exec perl -pi.bkp -e 's[\filename][/filename]g' {} \;

And in the likely case that all the form feed characters are spurious and all should be replaced by /f, then you can slim the one-liner down even further:

find . -type f -exec perl -pi.bkp -e 's[\f][/f]g' {} \;

You don't need to use forward slashes to surround your regex substitution command's elements (s///) in Perl. You can use any symbol. If you choose to use any kind of paired bracket-like symbol, however, you have to use both of them: s[old][new] for instance.

Since I'm not using slashes, I don't have to escape any slashes.

As for -i.bkp: perl -pi -e lets you edit in-place -- but if you want extra insurance in case you got your find-and-replace Perl program wrong, you can put in a file extension so that it will make a copy of the original files for you. Here, I've used .bkp.

In the most recent versions of Perl, in-place editing has been updated to be more resilient in case your system suffers a serious problem like power loss or running out of disk space, too. Here's Perl author brian d foy on improved in-place editing in recent Perls.

You should consider using Perl for these kinds of tasks, because it is an extremely powerful yet under-rated general-purpose programming language, one of whose original design goals was to replace sed and awk with something much better.

Perl 5's regex matching capabilities and improved regex syntax far exceed those of sed, awk, and indeed every other programming language apart from Perl 6, making Perl the most sensible choice for both simple and advanced regex manipulations.

To clarify: sed will work OK with find too and you can also use sed -i.bkp to make a backup of each file edited, but as far as I know it doesn't feature the extra resilience in Perl 5.28 and above. It also uses the clunkier and far less powerful traditional UNIX ® regex syntax.

Tags:

Bash

Perl