How to count the characters of jar files by wc

Under the folder /usr/hdp/2.6.4.0-91/tez_hive2/lib/
We have .jar files as the following

$ ls  /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar"

async-http-client-1.8.16.jar

azure-data-lake-store-sdk-2.1.4.jar

commons-cli-1.2.jar

commons-codec-1.4.jar

commons-collections-3.2.2.jar

commons-collections4-4.1.jar

commons-io-2.4.jar

commons-lang-2.6.jar

commons-math3-3.1.1.jar

guava-11.0.2.jar

hadoop-aws-2.7.3.2.6.4.0-91.jar

hadoop-azure-2.7.3.2.6.4.0-91.jar

hadoop-azure-datalake-2.7.3.2.6.4.0-91.jar

hadoop-mapreduce-client-common-2.7.3.2.6.4.0-91.jar

hadoop-mapreduce-client-core-2.7.3.2.6.4.0-91.jar

hadoop-yarn-server-timeline-pluginstorage-2.7.3.2.6.4.0-91.jar

jersey-client-1.9.jar

jersey-json-1.9.jar

jettison-1.3.4.jar

jetty-6.1.26.hwx.jar

jetty-util-6.1.26.hwx.jar

jsr305-3.0.0.jar

metrics-core-3.1.0.jar

protobuf-java-2.5.0.jar

RoaringBitmap-0.4.9.jar

servlet-api-2.5.jar

slf4j-api-1.7.10.jar

I want to count all characters from the .jar files by wc , in order to understand if .jar files renamed

So I do the following command in order to count all characters from all .jar files

ls  /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar" | sed s'/// /g' | awk '{print $NF}' | wc | awk '{print $NF}'

758

So in this case we get 758 characters from all .jars

But the command isn’t elegant

How we can improve the command to be better?

edited 10 hours ago

jimmij

32k874108

asked 11 hours ago

yael

2,66422571

1

You want to get total string length of all the filename? That sounds like a strange request. Smells like an XY problem.

– glenn jackman
11 hours ago

what I want it to count all characters from the output by wc or any other suggestion

– yael
11 hours ago

1) Wouldn't it make sense to get the length of "each" filename? 2) That wouldn't really tell you a whole lot as it can be renamed to something with the same amount of letters. 3) There really isn't a way to tell if any of the files have been renamed unless a script runs something like ls -l | awk '{print $NF} > jarlog.txt so that you can compare the filenames.

– Nasir Riley
11 hours ago

look only I want is to improve the cli - ls /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar" | sed s'/// /g' | awk '{print $NF}' | wc | awk '{print $NF}'

– yael
11 hours ago

1

printf '%sn' *.jar | md5sum wouldn't be blind to renaming to something with the same amount of letters.

– Kamil Maciorowski
11 hours ago

|
show 5 more comments

Under the folder /usr/hdp/2.6.4.0-91/tez_hive2/lib/
We have .jar files as the following

$ ls  /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar"

async-http-client-1.8.16.jar

azure-data-lake-store-sdk-2.1.4.jar

commons-cli-1.2.jar

commons-codec-1.4.jar

commons-collections-3.2.2.jar

commons-collections4-4.1.jar

commons-io-2.4.jar

commons-lang-2.6.jar

commons-math3-3.1.1.jar

guava-11.0.2.jar

hadoop-aws-2.7.3.2.6.4.0-91.jar

hadoop-azure-2.7.3.2.6.4.0-91.jar

hadoop-azure-datalake-2.7.3.2.6.4.0-91.jar

hadoop-mapreduce-client-common-2.7.3.2.6.4.0-91.jar

hadoop-mapreduce-client-core-2.7.3.2.6.4.0-91.jar

hadoop-yarn-server-timeline-pluginstorage-2.7.3.2.6.4.0-91.jar

jersey-client-1.9.jar

jersey-json-1.9.jar

jettison-1.3.4.jar

jetty-6.1.26.hwx.jar

jetty-util-6.1.26.hwx.jar

jsr305-3.0.0.jar

metrics-core-3.1.0.jar

protobuf-java-2.5.0.jar

RoaringBitmap-0.4.9.jar

servlet-api-2.5.jar

slf4j-api-1.7.10.jar

I want to count all characters from the .jar files by wc , in order to understand if .jar files renamed

So I do the following command in order to count all characters from all .jar files

ls  /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar" | sed s'/// /g' | awk '{print $NF}' | wc | awk '{print $NF}'

758

So in this case we get 758 characters from all .jars

But the command isn’t elegant

How we can improve the command to be better?

edited 10 hours ago

jimmij

32k874108

asked 11 hours ago

yael

2,66422571

1

You want to get total string length of all the filename? That sounds like a strange request. Smells like an XY problem.

– glenn jackman
11 hours ago

what I want it to count all characters from the output by wc or any other suggestion

– yael
11 hours ago

1) Wouldn't it make sense to get the length of "each" filename? 2) That wouldn't really tell you a whole lot as it can be renamed to something with the same amount of letters. 3) There really isn't a way to tell if any of the files have been renamed unless a script runs something like ls -l | awk '{print $NF} > jarlog.txt so that you can compare the filenames.

– Nasir Riley
11 hours ago

look only I want is to improve the cli - ls /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar" | sed s'/// /g' | awk '{print $NF}' | wc | awk '{print $NF}'

– yael
11 hours ago

1

printf '%sn' *.jar | md5sum wouldn't be blind to renaming to something with the same amount of letters.

– Kamil Maciorowski
11 hours ago

|
show 5 more comments

Under the folder /usr/hdp/2.6.4.0-91/tez_hive2/lib/
We have .jar files as the following

$ ls  /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar"

async-http-client-1.8.16.jar

azure-data-lake-store-sdk-2.1.4.jar

commons-cli-1.2.jar

commons-codec-1.4.jar

commons-collections-3.2.2.jar

commons-collections4-4.1.jar

commons-io-2.4.jar

commons-lang-2.6.jar

commons-math3-3.1.1.jar

guava-11.0.2.jar

hadoop-aws-2.7.3.2.6.4.0-91.jar

hadoop-azure-2.7.3.2.6.4.0-91.jar

hadoop-azure-datalake-2.7.3.2.6.4.0-91.jar

hadoop-mapreduce-client-common-2.7.3.2.6.4.0-91.jar

hadoop-mapreduce-client-core-2.7.3.2.6.4.0-91.jar

hadoop-yarn-server-timeline-pluginstorage-2.7.3.2.6.4.0-91.jar

jersey-client-1.9.jar

jersey-json-1.9.jar

jettison-1.3.4.jar

jetty-6.1.26.hwx.jar

jetty-util-6.1.26.hwx.jar

jsr305-3.0.0.jar

metrics-core-3.1.0.jar

protobuf-java-2.5.0.jar

RoaringBitmap-0.4.9.jar

servlet-api-2.5.jar

slf4j-api-1.7.10.jar

I want to count all characters from the .jar files by wc , in order to understand if .jar files renamed

So I do the following command in order to count all characters from all .jar files

ls  /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar" | sed s'/// /g' | awk '{print $NF}' | wc | awk '{print $NF}'

758

So in this case we get 758 characters from all .jars

But the command isn’t elegant

How we can improve the command to be better?

edited 10 hours ago

jimmij

32k874108

asked 11 hours ago

yael

2,66422571

Under the folder /usr/hdp/2.6.4.0-91/tez_hive2/lib/
We have .jar files as the following

$ ls  /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar"

async-http-client-1.8.16.jar

azure-data-lake-store-sdk-2.1.4.jar

commons-cli-1.2.jar

commons-codec-1.4.jar

commons-collections-3.2.2.jar

commons-collections4-4.1.jar

commons-io-2.4.jar

commons-lang-2.6.jar

commons-math3-3.1.1.jar

guava-11.0.2.jar

hadoop-aws-2.7.3.2.6.4.0-91.jar

hadoop-azure-2.7.3.2.6.4.0-91.jar

hadoop-azure-datalake-2.7.3.2.6.4.0-91.jar

hadoop-mapreduce-client-common-2.7.3.2.6.4.0-91.jar

hadoop-mapreduce-client-core-2.7.3.2.6.4.0-91.jar

hadoop-yarn-server-timeline-pluginstorage-2.7.3.2.6.4.0-91.jar

jersey-client-1.9.jar

jersey-json-1.9.jar

jettison-1.3.4.jar

jetty-6.1.26.hwx.jar

jetty-util-6.1.26.hwx.jar

jsr305-3.0.0.jar

metrics-core-3.1.0.jar

protobuf-java-2.5.0.jar

RoaringBitmap-0.4.9.jar

servlet-api-2.5.jar

slf4j-api-1.7.10.jar

I want to count all characters from the .jar files by wc , in order to understand if .jar files renamed

So I do the following command in order to count all characters from all .jar files

ls  /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar" | sed s'/// /g' | awk '{print $NF}' | wc | awk '{print $NF}'

758

So in this case we get 758 characters from all .jars

But the command isn’t elegant

How we can improve the command to be better?

bash shell-script awk sed wc

edited 10 hours ago

jimmij

32k874108

asked 11 hours ago

yael

2,66422571

edited 10 hours ago

jimmij

32k874108

asked 11 hours ago

yael

2,66422571

edited 10 hours ago

jimmij

32k874108

edited 10 hours ago

jimmij

32k874108

edited 10 hours ago

jimmij

32k874108

asked 11 hours ago

yael

2,66422571

asked 11 hours ago

yael

2,66422571

asked 11 hours ago

yael

2,66422571

1

You want to get total string length of all the filename? That sounds like a strange request. Smells like an XY problem.

– glenn jackman
11 hours ago

what I want it to count all characters from the output by wc or any other suggestion

– yael
11 hours ago

1) Wouldn't it make sense to get the length of "each" filename? 2) That wouldn't really tell you a whole lot as it can be renamed to something with the same amount of letters. 3) There really isn't a way to tell if any of the files have been renamed unless a script runs something like ls -l | awk '{print $NF} > jarlog.txt so that you can compare the filenames.

– Nasir Riley
11 hours ago

look only I want is to improve the cli - ls /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar" | sed s'/// /g' | awk '{print $NF}' | wc | awk '{print $NF}'

– yael
11 hours ago

1

printf '%sn' *.jar | md5sum wouldn't be blind to renaming to something with the same amount of letters.

– Kamil Maciorowski
11 hours ago

|
show 5 more comments

1

You want to get total string length of all the filename? That sounds like a strange request. Smells like an XY problem.

– glenn jackman
11 hours ago

what I want it to count all characters from the output by wc or any other suggestion

– yael
11 hours ago

1) Wouldn't it make sense to get the length of "each" filename? 2) That wouldn't really tell you a whole lot as it can be renamed to something with the same amount of letters. 3) There really isn't a way to tell if any of the files have been renamed unless a script runs something like ls -l | awk '{print $NF} > jarlog.txt so that you can compare the filenames.

– Nasir Riley
11 hours ago

look only I want is to improve the cli - ls /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar" | sed s'/// /g' | awk '{print $NF}' | wc | awk '{print $NF}'

– yael
11 hours ago

1

printf '%sn' *.jar | md5sum wouldn't be blind to renaming to something with the same amount of letters.

– Kamil Maciorowski
11 hours ago

You want to get total string length of all the filename? That sounds like a strange request. Smells like an XY problem.

– glenn jackman
11 hours ago

what I want it to count all characters from the output by wc or any other suggestion

– yael
11 hours ago

1) Wouldn't it make sense to get the length of "each" filename? 2) That wouldn't really tell you a whole lot as it can be renamed to something with the same amount of letters. 3) There really isn't a way to tell if any of the files have been renamed unless a script runs something like ls -l | awk '{print $NF} > jarlog.txt so that you can compare the filenames.

– Nasir Riley
11 hours ago

look only I want is to improve the cli - ls /usr/hdp/2.6.4.0-91/tez_hive2/lib/ | grep ".jar" | sed s'/// /g' | awk '{print $NF}' | wc | awk '{print $NF}'

– yael
11 hours ago

printf '%sn' *.jar | md5sum wouldn't be blind to renaming to something with the same amount of letters.

– Kamil Maciorowski
11 hours ago

|
show 5 more comments

4 Answers
4

active

oldest

votes

Most probably you are looking for

basename -a /usr/hdp/2.6.4.0-91/tez_hive2/lib/*.jar | wc -c

The path with wildcard list all jar files, the basename command strips directories (-a is needed to accept many arguments), and wc -c just counts bytes (if some filenames consist of 2 byte characters then perhaps wc -m (characters count) would be a better choice).

However, to if the goal is to check if files have been modified then perhaps stat (for modification time) or md5sum/shasum for checksum would be a better tools.

edited 10 hours ago

answered 11 hours ago

jimmij

32k874108

wc -c counts the number of bytes, not characters. wc -m counts the number of characters. Also note that you're also counting one extra newline character per file.

– Stéphane Chazelas
11 hours ago

what is the diff between wc -c to wc -m ? ( because on both I get the same results )

– yael
10 hours ago

1

@yael, wc -m counts the number of characters, wc -c the number of bytes. That makes a difference in the case of characters made of more than one byte (in UTF-8, that's all the non-ASCII ones (over a million of them)). Compare printf € | wc -c with printf € | wc -m.

– Stéphane Chazelas
10 hours ago

add a comment |

Counting the number of letters in the filenames would not be a safe way of detecting a renamed filename.

Instead, create a simple file listing of the names, and compare it to an existing list. By using diff, you would be shown exactly which line(s) in the list had changed.

#!/bin/sh



LC_ALL=C



newlist=$HOME/filelist.new

oldlist=$HOME/filelist.old



echo /usr/hdp/2.6.4.0-91/tez_hive2/lib/*.jar >"$newlist"



if [ -f "$oldlist" ]; then

    diff -u "$oldlist" "$newlist"

fi



mv "$newlist" "$oldlist"

Obviously, the first time you do this, filelist.old would not exist, so the diff would not run.

Note that I save the full path to each file in the output file. This does not matter since the directory path is static.

Change echo to ls -l if you want to also compare timestamps etc. Change it to stat if you want to compare even more meta data (this would generate diff output when even the last-access timestamp on a file changed). Install wdiff and change diff to wdiff to get a word-based diff rather than a line-based one.

The LC_ALL=C is to guarantee a consistent sorting of the expansion of the shell glob.

edited 10 hours ago

answered 10 hours ago

Kusalananda

133k17253416

add a comment |

To just get the number of characters in the (non-hidden) jar filenames, I would do

cd /usr/hdp/2.6.4.0-91/tez_hive2/lib/ && printf %s *.jar | wc -m

This will not count any newlines, just the filename characters (replace wc -m with wc -c for the number of bytes instead of characters).

Purposefully, I'm not parsing ls output.

edited 11 hours ago

Stéphane Chazelas

308k57581939

answered 11 hours ago

glenn jackman

52k572112

Note that in shells like bash, if there's no .jar file in the current directory, that will output 5 (the number of characters in *.jar). In bash, you can do shopt -s nullglob to get 0 in that case.

– Stéphane Chazelas
10 hours ago

add a comment |

I don't understand the point of the sed command, you are replacing a slash with a space? Why?

Aside from that, it seems that you want to count the total number of characters in all the file names of the .jar files. IF so, try this:
ls /usr/hdp/2.6.4.0-91/tez_hive2/lib/*.jar | sed s'/// /g' | wc -c

Two other possibilities for your sed command:
sed "s|/| |g" -- or -- tr '/' ' '

Since your ls command won't show directory names, I am not sure you need it.

This will also count the LF at the end of each file name. Is that OK?

answered 11 hours ago

Scottie H

366

add a comment |

Your Answer

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "106"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f503621%2fhow-to-count-the-characters-of-jar-files-by-wc%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

4 Answers
4

active

oldest

votes

4 Answers
4

active

oldest

votes

Most probably you are looking for

basename -a /usr/hdp/2.6.4.0-91/tez_hive2/lib/*.jar | wc -c

However, to if the goal is to check if files have been modified then perhaps stat (for modification time) or md5sum/shasum for checksum would be a better tools.

edited 10 hours ago

answered 11 hours ago

jimmij

32k874108

wc -c counts the number of bytes, not characters. wc -m counts the number of characters. Also note that you're also counting one extra newline character per file.

– Stéphane Chazelas
11 hours ago

what is the diff between wc -c to wc -m ? ( because on both I get the same results )

– yael
10 hours ago

1

@yael, wc -m counts the number of characters, wc -c the number of bytes. That makes a difference in the case of characters made of more than one byte (in UTF-8, that's all the non-ASCII ones (over a million of them)). Compare printf € | wc -c with printf € | wc -m.

– Stéphane Chazelas
10 hours ago

add a comment |

Most probably you are looking for

basename -a /usr/hdp/2.6.4.0-91/tez_hive2/lib/*.jar | wc -c

However, to if the goal is to check if files have been modified then perhaps stat (for modification time) or md5sum/shasum for checksum would be a better tools.

edited 10 hours ago

answered 11 hours ago

jimmij

32k874108

wc -c counts the number of bytes, not characters. wc -m counts the number of characters. Also note that you're also counting one extra newline character per file.

– Stéphane Chazelas
11 hours ago

what is the diff between wc -c to wc -m ? ( because on both I get the same results )

– yael
10 hours ago

1

@yael, wc -m counts the number of characters, wc -c the number of bytes. That makes a difference in the case of characters made of more than one byte (in UTF-8, that's all the non-ASCII ones (over a million of them)). Compare printf € | wc -c with printf € | wc -m.

– Stéphane Chazelas
10 hours ago

add a comment |

Most probably you are looking for

basename -a /usr/hdp/2.6.4.0-91/tez_hive2/lib/*.jar | wc -c

However, to if the goal is to check if files have been modified then perhaps stat (for modification time) or md5sum/shasum for checksum would be a better tools.

edited 10 hours ago

answered 11 hours ago

jimmij

32k874108

Most probably you are looking for

basename -a /usr/hdp/2.6.4.0-91/tez_hive2/lib/*.jar | wc -c

However, to if the goal is to check if files have been modified then perhaps stat (for modification time) or md5sum/shasum for checksum would be a better tools.

edited 10 hours ago

answered 11 hours ago

jimmij

32k874108

edited 10 hours ago

answered 11 hours ago

jimmij

32k874108

answered 11 hours ago

jimmij

32k874108

answered 11 hours ago

jimmij

32k874108

wc -c counts the number of bytes, not characters. wc -m counts the number of characters. Also note that you're also counting one extra newline character per file.

– Stéphane Chazelas
11 hours ago

what is the diff between wc -c to wc -m ? ( because on both I get the same results )

– yael
10 hours ago

1

@yael, wc -m counts the number of characters, wc -c the number of bytes. That makes a difference in the case of characters made of more than one byte (in UTF-8, that's all the non-ASCII ones (over a million of them)). Compare printf € | wc -c with printf € | wc -m.

– Stéphane Chazelas
10 hours ago

add a comment |

wc -c counts the number of bytes, not characters. wc -m counts the number of characters. Also note that you're also counting one extra newline character per file.

– Stéphane Chazelas
11 hours ago

what is the diff between wc -c to wc -m ? ( because on both I get the same results )

– yael
10 hours ago

1

@yael, wc -m counts the number of characters, wc -c the number of bytes. That makes a difference in the case of characters made of more than one byte (in UTF-8, that's all the non-ASCII ones (over a million of them)). Compare printf € | wc -c with printf € | wc -m.

– Stéphane Chazelas
10 hours ago

wc -c counts the number of bytes, not characters. wc -m counts the number of characters. Also note that you're also counting one extra newline character per file.

– Stéphane Chazelas
11 hours ago

what is the diff between wc -c to wc -m ? ( because on both I get the same results )

– yael
10 hours ago

@yael, wc -m counts the number of characters, wc -c the number of bytes. That makes a difference in the case of characters made of more than one byte (in UTF-8, that's all the non-ASCII ones (over a million of them)). Compare printf € | wc -c with printf € | wc -m.

– Stéphane Chazelas
10 hours ago

add a comment |

Counting the number of letters in the filenames would not be a safe way of detecting a renamed filename.

Instead, create a simple file listing of the names, and compare it to an existing list. By using diff, you would be shown exactly which line(s) in the list had changed.

#!/bin/sh



LC_ALL=C



newlist=$HOME/filelist.new

oldlist=$HOME/filelist.old



echo /usr/hdp/2.6.4.0-91/tez_hive2/lib/*.jar >"$newlist"



if [ -f "$oldlist" ]; then

    diff -u "$oldlist" "$newlist"

fi



mv "$newlist" "$oldlist"

Obviously, the first time you do this, filelist.old would not exist, so the diff would not run.

Note that I save the full path to each file in the output file. This does not matter since the directory path is static.

The LC_ALL=C is to guarantee a consistent sorting of the expansion of the shell glob.

edited 10 hours ago

answered 10 hours ago

Kusalananda

133k17253416

add a comment |

Counting the number of letters in the filenames would not be a safe way of detecting a renamed filename.

Instead, create a simple file listing of the names, and compare it to an existing list. By using diff, you would be shown exactly which line(s) in the list had changed.

#!/bin/sh



LC_ALL=C



newlist=$HOME/filelist.new

oldlist=$HOME/filelist.old



echo /usr/hdp/2.6.4.0-91/tez_hive2/lib/*.jar >"$newlist"



if [ -f "$oldlist" ]; then

    diff -u "$oldlist" "$newlist"

fi



mv "$newlist" "$oldlist"

Obviously, the first time you do this, filelist.old would not exist, so the diff would not run.

Note that I save the full path to each file in the output file. This does not matter since the directory path is static.

The LC_ALL=C is to guarantee a consistent sorting of the expansion of the shell glob.

edited 10 hours ago

answered 10 hours ago

Kusalananda

133k17253416

add a comment |

Counting the number of letters in the filenames would not be a safe way of detecting a renamed filename.

Instead, create a simple file listing of the names, and compare it to an existing list. By using diff, you would be shown exactly which line(s) in the list had changed.

#!/bin/sh



LC_ALL=C



newlist=$HOME/filelist.new

oldlist=$HOME/filelist.old



echo /usr/hdp/2.6.4.0-91/tez_hive2/lib/*.jar >"$newlist"



if [ -f "$oldlist" ]; then

    diff -u "$oldlist" "$newlist"

fi



mv "$newlist" "$oldlist"

Obviously, the first time you do this, filelist.old would not exist, so the diff would not run.

Note that I save the full path to each file in the output file. This does not matter since the directory path is static.

The LC_ALL=C is to guarantee a consistent sorting of the expansion of the shell glob.

edited 10 hours ago

answered 10 hours ago

Kusalananda

133k17253416

Counting the number of letters in the filenames would not be a safe way of detecting a renamed filename.

Instead, create a simple file listing of the names, and compare it to an existing list. By using diff, you would be shown exactly which line(s) in the list had changed.

#!/bin/sh



LC_ALL=C



newlist=$HOME/filelist.new

oldlist=$HOME/filelist.old



echo /usr/hdp/2.6.4.0-91/tez_hive2/lib/*.jar >"$newlist"



if [ -f "$oldlist" ]; then

    diff -u "$oldlist" "$newlist"

fi



mv "$newlist" "$oldlist"

Obviously, the first time you do this, filelist.old would not exist, so the diff would not run.

Note that I save the full path to each file in the output file. This does not matter since the directory path is static.

The LC_ALL=C is to guarantee a consistent sorting of the expansion of the shell glob.

edited 10 hours ago

answered 10 hours ago

Kusalananda

133k17253416

edited 10 hours ago

answered 10 hours ago

Kusalananda

133k17253416

answered 10 hours ago

Kusalananda

133k17253416

answered 10 hours ago

Kusalananda

133k17253416

add a comment |

To just get the number of characters in the (non-hidden) jar filenames, I would do

cd /usr/hdp/2.6.4.0-91/tez_hive2/lib/ && printf %s *.jar | wc -m

This will not count any newlines, just the filename characters (replace wc -m with wc -c for the number of bytes instead of characters).

Purposefully, I'm not parsing ls output.

edited 11 hours ago

Stéphane Chazelas

308k57581939

answered 11 hours ago

glenn jackman

52k572112

Note that in shells like bash, if there's no .jar file in the current directory, that will output 5 (the number of characters in *.jar). In bash, you can do shopt -s nullglob to get 0 in that case.

– Stéphane Chazelas
10 hours ago

add a comment |

To just get the number of characters in the (non-hidden) jar filenames, I would do

cd /usr/hdp/2.6.4.0-91/tez_hive2/lib/ && printf %s *.jar | wc -m

This will not count any newlines, just the filename characters (replace wc -m with wc -c for the number of bytes instead of characters).

Purposefully, I'm not parsing ls output.

edited 11 hours ago

Stéphane Chazelas

308k57581939

answered 11 hours ago

glenn jackman

52k572112

Note that in shells like bash, if there's no .jar file in the current directory, that will output 5 (the number of characters in *.jar). In bash, you can do shopt -s nullglob to get 0 in that case.

– Stéphane Chazelas
10 hours ago

add a comment |

To just get the number of characters in the (non-hidden) jar filenames, I would do

cd /usr/hdp/2.6.4.0-91/tez_hive2/lib/ && printf %s *.jar | wc -m

This will not count any newlines, just the filename characters (replace wc -m with wc -c for the number of bytes instead of characters).

Purposefully, I'm not parsing ls output.

edited 11 hours ago

Stéphane Chazelas

308k57581939

answered 11 hours ago

glenn jackman

52k572112

To just get the number of characters in the (non-hidden) jar filenames, I would do

cd /usr/hdp/2.6.4.0-91/tez_hive2/lib/ && printf %s *.jar | wc -m

This will not count any newlines, just the filename characters (replace wc -m with wc -c for the number of bytes instead of characters).

Purposefully, I'm not parsing ls output.

edited 11 hours ago

Stéphane Chazelas

308k57581939

answered 11 hours ago

glenn jackman

52k572112

edited 11 hours ago

Stéphane Chazelas

308k57581939

edited 11 hours ago

Stéphane Chazelas

308k57581939

edited 11 hours ago

Stéphane Chazelas

308k57581939

answered 11 hours ago

glenn jackman

52k572112

answered 11 hours ago

glenn jackman

52k572112

answered 11 hours ago

glenn jackman

52k572112

Note that in shells like bash, if there's no .jar file in the current directory, that will output 5 (the number of characters in *.jar). In bash, you can do shopt -s nullglob to get 0 in that case.

– Stéphane Chazelas
10 hours ago

add a comment |

Note that in shells like bash, if there's no .jar file in the current directory, that will output 5 (the number of characters in *.jar). In bash, you can do shopt -s nullglob to get 0 in that case.

– Stéphane Chazelas
10 hours ago

Note that in shells like bash, if there's no .jar file in the current directory, that will output 5 (the number of characters in *.jar). In bash, you can do shopt -s nullglob to get 0 in that case.

– Stéphane Chazelas
10 hours ago

add a comment |

I don't understand the point of the sed command, you are replacing a slash with a space? Why?

answered 11 hours ago

Scottie H

366

add a comment |

I don't understand the point of the sed command, you are replacing a slash with a space? Why?

answered 11 hours ago

Scottie H

366

add a comment |

I don't understand the point of the sed command, you are replacing a slash with a space? Why?

answered 11 hours ago

Scottie H

366

I don't understand the point of the sed command, you are replacing a slash with a space? Why?

answered 11 hours ago

Scottie H

366

answered 11 hours ago

Scottie H

366

answered 11 hours ago

Scottie H

366

answered 11 hours ago

Scottie H

366

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Unix & Linux Stack Exchange!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Argthtjtr