Debian Bug report logs - #1078514
autopkgtest: tmpfails if removal of tmp dir fails

version graph

Package: autopkgtest; Maintainer for autopkgtest is Debian CI team <[email protected]>; Source for autopkgtest is src:autopkgtest (PTS, buildd, popcon).

Reported by: Paul Gevers <[email protected]>

Date: Thu, 8 Aug 2024 10:42:02 UTC

Severity: normal

Found in version autopkgtest/5.38

Reply or subscribe to this bug.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to [email protected], Debian CI team <[email protected]>:
Bug#1078214; Package autopkgtest. (Thu, 08 Aug 2024 10:42:03 GMT) (full text, mbox, link).


Acknowledgement sent to Paul Gevers <[email protected]>:
New Bug report received and forwarded. Copy sent to Debian CI team <[email protected]>. (Thu, 08 Aug 2024 10:42:03 GMT) (full text, mbox, link).


Message #5 received at [email protected] (full text, mbox, reply):

From: Paul Gevers <[email protected]>
To: Debian Bug Tracking System <[email protected]>
Subject: autopkgtest: several packages triggers tmpfail instead of fail
Date: Thu, 8 Aug 2024 12:40:15 +0200
[Message part 1 (text/plain, inline)]
Package: autopkgtest
Version: 5.38
Severity: normal

Hi,

This bug is mostly to document and track an issue that has been annoying 
me for a long time already. On ci.d.n we have reject_listed [0] several 
packages because they tmpfail instead of failing. This points at 
autopkgtest not being able to handle the situation as I think it should.

Examples where I just verified this is the case:
* llvm-toolchain-11 on armhf in bullseye [1] as reported in bug #980957
* llvm-toolchain-13 on armhf in bullseye [2] as reported in bug #980957

Examples where the reject_list mentions tmpfails:
* libreoffice on armel
* linux (qemu) on stable and oldstable
* openjdk-8
* openjdk-11 (I verified it no longer tmpfails on unstable, maybe
              oldstable is OK too, but I didn't check)
* openjdk-21 on armel and riscv64
* openjdk-22 on riscv64
* openjdk-23 on armel, loong64 and riscv64

I recall that most of those openjdk-* failures actually happen during 
cleanup where the `rm` complains that the /tmp/$some-autopkgtest-dir 
directory isn't empty. That appears a race condition to me, where 
cleanup starts before the tests really have stopped writing to files 
after being killed. I wonder if retrying the removal at least once isn't 
going to go a long way to solve that particular issue.

Paul

[0] https://ci.debian.net/status/reject_list/
[1] 
https://ci.debian.net/packages/l/llvm-toolchain-11/oldstable/armhf/50087518/
156s clang-$VERSION -flto foo.c -o foo
156s clang: error: unable to execute command: Segmentation fault
156s clang: error: linker command failed due to signal (use -v to see 
invocation)
161s autopkgtest [08:22:06]: ERROR: testbed failure: testbed auxverb 
failed with exit code 254

[2] 
https://ci.debian.net/packages/l/llvm-toolchain-13/oldstable/armhf/50087616/
151s clang-$VERSION -flto foo.c -o foo
151s clang: error: unable to execute command: Segmentation fault
151s clang: error: linker command failed due to signal (use -v to see 
invocation)
156s autopkgtest [09:03:07]: ERROR: testbed failure: testbed auxverb 
failed with exit code 254
[OpenPGP_signature.asc (application/pgp-signature, attachment)]

Information forwarded to [email protected], Debian CI team <[email protected]>:
Bug#1078214; Package autopkgtest. (Sun, 11 Aug 2024 18:06:05 GMT) (full text, mbox, link).


Acknowledgement sent to Paul Gevers <[email protected]>:
Extra info received and forwarded to list. Copy sent to Debian CI team <[email protected]>. (Sun, 11 Aug 2024 18:06:05 GMT) (full text, mbox, link).


Message #10 received at [email protected] (full text, mbox, reply):

From: Paul Gevers <[email protected]>
To: [email protected]
Subject: Re: autopkgtest: several packages triggers tmpfail instead of fail
Date: Sun, 11 Aug 2024 20:03:12 +0200
[Message part 1 (text/plain, inline)]
Control: clone -1 -2
Control: retitle -1 autopkgtest: tmpfails if test exits with 254 or 255
Control: tags -1 pending
Control: retitle -2 autopkgtest: tmpfails if removal of tmp dir fails

On Thu, 8 Aug 2024 12:40:15 +0200 Paul Gevers <[email protected]> wrote:
> I recall that most of those openjdk-* failures actually happen during 
> cleanup where the `rm` complains that the /tmp/$some-autopkgtest-dir 
> directory isn't empty. That appears a race condition to me, where 
> cleanup starts before the tests really have stopped writing to files 
> after being killed.
Let's clone this one, because I have a fix worked out for the exit 254 case.

Paul
[OpenPGP_signature.asc (application/pgp-signature, attachment)]

Bug 1078214 cloned as bug 1078514 Request was from Paul Gevers <[email protected]> to [email protected]. (Sun, 11 Aug 2024 18:06:06 GMT) (full text, mbox, link).


Changed Bug title to 'autopkgtest: tmpfails if removal of tmp dir fails' from 'autopkgtest: several packages triggers tmpfail instead of fail'. Request was from Paul Gevers <[email protected]> to [email protected]. (Sun, 11 Aug 2024 18:06:07 GMT) (full text, mbox, link).


Information forwarded to [email protected], Debian CI team <[email protected]>:
Bug#1078514; Package autopkgtest. (Thu, 16 Jan 2025 18:06:02 GMT) (full text, mbox, link).


Acknowledgement sent to Paul Gevers <[email protected]>:
Extra info received and forwarded to list. Copy sent to Debian CI team <[email protected]>. (Thu, 16 Jan 2025 18:06:02 GMT) (full text, mbox, link).


Message #19 received at [email protected] (full text, mbox, reply):

From: Paul Gevers <[email protected]>
To: [email protected]
Subject: Re: [1078514] autopkgtest: tmpfails if removal of tmp dir fails
Date: Thu, 16 Jan 2025 19:02:12 +0100
[Message part 1 (text/plain, inline)]
Hi,

On Sun, 11 Aug 2024 20:03:12 +0200 Paul Gevers <[email protected]> wrote:
> On Thu, 8 Aug 2024 12:40:15 +0200 Paul Gevers <[email protected]> wrote:
> > I recall that most of those openjdk-* failures actually happen during 
> > cleanup where the `rm` complains that the /tmp/$some-autopkgtest-dir 
> > directory isn't empty. That appears a race condition to me, where 
> > cleanup starts before the tests really have stopped writing to files 
> > after being killed.
> Let's clone this one, because I have a fix worked out for the exit 254 case.
Current example (with slightly different failure message): cmake on 
riscv64 [1]:

20988s The following tests FAILED:
20988s 	622 - Qt5Autogen.RerunMocPlugin (Failed)                Qt5
20988s Errors while running CTest
21033s autopkgtest [05:39:11]: ERROR: "rm -rf 
/tmp/autopkgtest-lxc.8323tz_i/downtmp/testsuite-artifacts 
/tmp/autopkgtest-lxc.8323tz_i/downtmp/autopkgtest_tmp" failed with 
status 137

Paul

[1] https://ci.debian.net/packages/c/cmake/unstable/riscv64/54908817/


[OpenPGP_signature.asc (application/pgp-signature, attachment)]

Send a report that this bug log contains spam.


Debian bug tracking system administrator <[email protected]>. Last modified: Tue May 13 15:40:54 2025; Machine Name: bembo

Debian Bug tracking system

Debbugs is free software and licensed under the terms of the GNU General Public License version 2. The current version can be obtained from https://bugs.debian.org/debbugs-source/.

Copyright © 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson, 2005-2017 Don Armstrong, and many other contributors.