linux - Bash `wait` command, waiting for more than 1 PID to finish execution

Question

Welcome To Ask or Share your Answers For Others

linux - Bash `wait` command, waiting for more than 1 PID to finish execution

asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)

linux - Bash `wait` command, waiting for more than 1 PID to finish execution

I recently posted a question asking if it was possible to prevent PID's from being re-used.

So far the answer appears to be no. (Which is fine.)

However, the user Diego Torres Milano added an answer to that question, and my question here is in regards to that answer.

Diego answered,

If you are afraid of reusing PID's, which won't happen if you wait as other answers explain, you can use
echo 4194303 > /proc/sys/kernel/pid_max
to decrease your fear ;-)

I don't actually understand why Diego has used the number 4194303 here, but that's another question.

My understanding was that I had a problem with the following code:

for pid in "${PIDS[@]}"
do
    wait $pid
done

The problem being that I have multiple PIDs in an array, and that the for loop will run the wait command sequentially with each PID in the array, however I cannot predict that the processes will finish in the same order that their PIDs are stored in this array.

ie; the following could happen:

Start waiting for PID in array index 0
Process with PID in index 1 of array terminates
New job(s) run on system, resulting in PID which is stored in index 1 of PID array being reused for another process
wait terminates as PID in array index 0 exits
Start waiting for PID in array index 0, except this is now a different process and we have no idea what it is
The process which was run which re-used the PID which wait is currently waiting for never terminates. Perhaps it is the PID of a mail server or something which a system admin has started.
wait keeps waiting until the next serious linux bug is found and the system is rebooted or there is a power outage

Diego said:

which won't happen if you wait as other answers explain

ie; that the situation I have described above cannot happen.

Is Diego correct?

If so, why can the situation I discribed above not occur?

Or is Diego not correct?

If so, well, then I post a new question later today...

Additional notes

It has occured to me that this question might be confusing, unless you are aware that the PID's are PID's of processes launched in the background. ie;

my_function &
PID="$!"
PIDS+=($PID)

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

1 Answer

深蓝 · Answer 1 · 2021-10-17T02:58:04+0000

Let's go through your options.

Wait for all background jobs, unconditionally

for i in 1 2 3 4 5; do
    cmd &
done
wait

This has the benefit of being simple, but you can't keep your machine busy. If you want to start new jobs as old ones complete, you can't. You machine gets less and less utilized until all the background jobs complete, at which point you can start a new batch of jobs.

Related is the ability to wait for a subset of jobs by passing multiple arguments to wait:

unrelated_job &
for i in 1 2 3 4 5; do
  cmd & pids+=($!)
done
wait "${pids[@]}"   # Does not wait for unrelated_job, though

Wait for individual jobs in arbitrary order

for i in 1 2 3 4 5; do
   cmd & pids+=($!)
done

for pid in "${pids[@]}"; do
   wait "$pid"
   # do something when a job completes
done

This has the benefit of letting you do work after a job completes, but still has the problem that jobs other than $pid might complete first, leaving your machine underutilized until $pid actually completes. You do, however, still get the exit status for each individual job, even if it completes before you actually wait for it.

Wait for the next job to complete (`bash` 4.3 or later)

for i in 1 2 3 4 5; do
   cmd & pids+=($!)
done

for pid in "${pids[@]}"; do
   wait -n
   # do something when a job completes
done

Here, you can wait until a job completes, which means you can keep your machine as busy as possible. The only problem is, you don't necessarily know which job completed, without using jobs to get the list of active processes and comparing it to pids.

Other options?

The shell by itself is not an ideal platform for doing job distribution, which is why there are a multitude of programs designed for managing batch jobs: xargs, parallel, slurm, qsub, etc.

Categories

linux - Bash `wait` command, waiting for more than 1 PID to finish execution

linux - Bash `wait` command, waiting for more than 1 PID to finish execution

Additional notes

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Wait for all background jobs, unconditionally

Wait for individual jobs in arbitrary order

Wait for the next job to complete (`bash` 4.3 or later)

Other options?

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags

Categories

linux - Bash `wait` command, waiting for more than 1 PID to finish execution

linux - Bash `wait` command, waiting for more than 1 PID to finish execution

Additional notes

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Wait for all background jobs, unconditionally

Wait for individual jobs in arbitrary order

Wait for the next job to complete (bash 4.3 or later)

Other options?

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags

Wait for the next job to complete (`bash` 4.3 or later)