public inbox for isar-users@googlegroups.com
 help / color / mirror / Atom feed
From: <Venkata.Pyla@toshiba-tsip.com>
To: <henning.schild@siemens.com>
Cc: <isar-users@googlegroups.com>, <jan.kiszka@siemens.com>,
	<kazuhiro3.hayashi@toshiba.co.jp>,
	<dinesh.kumar@toshiba-tsip.com>
Subject: RE: [PATCH] image.bbclass: set file timestamps inside the rootfs and initramfs image
Date: Mon, 21 Nov 2022 05:16:37 +0000	[thread overview]
Message-ID: <OSYPR01MB55425EC15AEC59046AC06F4CA40A9@OSYPR01MB5542.jpnprd01.prod.outlook.com> (raw)
In-Reply-To: <20221110091226.4a3695f2@md1za8fc.ad001.siemens.net>



>-----Original Message-----
>From: isar-users@googlegroups.com <isar-users@googlegroups.com> On Behalf
>Of Henning Schild
>Sent: 10 November 2022 12:42
>To: pyla venkata(TSIP TMIEC ODG Porting) <Venkata.Pyla@toshiba-
>tsip.com>
>Cc: isar-users@googlegroups.com; jan.kiszka@siemens.com; hayashi kazuhiro(林
>和宏 □SWC◯ACT) <kazuhiro3.hayashi@toshiba.co.jp>; dinesh kumar(TS
>IP TMIEC ODG Porting) <dinesh.kumar@toshiba-tsip.com>
>Subject: Re: [PATCH] image.bbclass: set file timestamps inside the rootfs and
>initramfs image
>
>Am Wed, 9 Nov 2022 09:27:20 +0000
>schrieb <Venkata.Pyla@toshiba-tsip.com>:
>
>> Hi Henning Schild,
>>
>> Thanks for your review, please find my comments below.
>>
>>
>> Thanks,
>> Venkata.
>>
>> >-----Original Message-----
>> >From: Henning Schild <henning.schild@siemens.com>
>> >Sent: 07 November 2022 14:24
>> >To: pyla venkata(TSIP TMIEC ODG Porting) <Venkata.Pyla@toshiba-
>> >tsip.com>
>> >Cc: isar-users@googlegroups.com; jan.kiszka@siemens.com; hayashi
>> >kazuhiro(林和宏 □SWC◯ACT) <kazuhiro3.hayashi@toshiba.co.jp>;
>> >dinesh kumar(TSIP TMIEC ODG Porting)
>> ><dinesh.kumar@toshiba-tsip.com> Subject: Re: [PATCH] image.bbclass:
>> >set file timestamps inside the rootfs and initramfs image
>> >
>> >Am Mon,  7 Nov 2022 13:55:03 +0530
>> >schrieb venkata.pyla@toshiba-tsip.com:
>> >
>> >> From: venkata pyla <venkata.pyla@toshiba-tsip.com>
>> >>
>> >> As part of reproducible-build work, one of the problem chosen to
>> >> solve is the file time stamps inside rootfs and initramfs are not
>> >> identical between two builds.
>> >>
>> >> With the help of reproducible-builds.org and their suggestions, the
>> >> above problem can be fixed using 'SOURCE_DATE_EPOCH' variable [2].
>> >>
>> >> In case of rootfs file time-stamps, set all the files and folders
>> >> that are newer than 'SOURCE_DATE_EPOCH' and set it to same.
>> >> In case of initramfs, regenerate the initramfs image with
>> >> 'SOURCE_DATE_EPOCH' variable set as the mkinitramfs script is
>> >> already taken care of creating reproducible initramfs image when
>> >> the variable is set in the environment[3].
>> >>
>> >> The SOURCE_DATE_EPOCH variable should be set to the last
>> >> modification of the git repository as explained in the
>> >> documentation[2].
>> >>
>> >> e.g:
>> >> SOURCE_DATE_EPOCH=$(git log -1 --pretty=%ct)
>> >>
>> >> To know more about the reproducible builds and its goals please
>> >> refer [1].
>> >>
>> >> [1] https://reproducible-builds.org/ [2]
>> >> https://reproducible-builds.org/docs/source-date-epoch/
>> >> [3]
>> >> https://manpages.debian.org/bullseye/initramfs-tools-core/mkinitramfs.
>> >> 8.en.html#ENVIRONMENT
>> >>
>> >> Signed-off-by: venkata pyla <venkata.pyla@toshiba-tsip.com>
>> >> ---
>> >>  meta/classes/image.bbclass | 15 +++++++++++++++
>> >>  1 file changed, 15 insertions(+)
>> >>
>> >> diff --git a/meta/classes/image.bbclass
>> >> b/meta/classes/image.bbclass index ccff810..c1bb4fd 100644
>> >> --- a/meta/classes/image.bbclass
>> >> +++ b/meta/classes/image.bbclass
>> >> @@ -431,6 +431,21 @@ do_rootfs_finalize() {
>> >>              "${ROOTFSDIR}/etc/apt/sources.list.d/bootstrap.list"
>> >>
>> >>          rm -f "${ROOTFSDIR}/etc/apt/sources-list"
>> >> +
>> >> +        # Recreate initramfs inorder to set timestamps to
>> >> SOURCE_DATE_EPOCH
>> >> +        # inorder to make reproducible initramfs
>> >> +        test ! -z "${SOURCE_DATE_EPOCH}" && \
>> >> +           SOURCE_DATE_EPOCH=${SOURCE_DATE_EPOCH} chroot
>> >> "${ROOTFSDIR}" \
>> >> +                  update-initramfs -u -v
>> >
>> >This should be done where that initial update-initramfs can be found.
>> >And not fix things up later. Not every image will have an initrd so
>> >that is wrong in multiple ways.
>>
>> Thanks for correcting me, I understood the initrd update should not be
>> in the goal of image class, as some of the images may not be required
>> initrd as you mentioned. I will find the better place to do this and
>> send the another patch.
>>
>> >
>> >> +
>> >> +	# Set timestamp to files inside the rootfs image inorder
>> >> to make
>> >> +	# reproducible rootfs
>> >> +	test ! -z "${SOURCE_DATE_EPOCH}" && \
>> >> +           find ${ROOTFSDIR} -newermt \
>> >> +               "$(date -d@${SOURCE_DATE_EPOCH} '+%Y-%m-%d
>> >> %H:%M:%S')" \
>> >> +               -printf "%y %p\n" \
>> >> +               -exec touch '{}' -h -d@${SOURCE_DATE_EPOCH} ';'
>> >> +
>> >
>> >No! Which files do we care about? My guess would content of packages
>> >built with isar. We should export the variable in the dpkg class then
>> >the packages will be correct and we do not have to mess around with
>> >that rootfs and fix problems that should have never been there.
>> >
>>
>> This will only change the files are created during build, and not the
>> files come with package.
>
>All files, or very close to all, come with packages. Because isar builds packages
>for stuff it does. The wanted timestamp has to be provided at build time of
>those packages so that later on one does not need to run such a find.
>
>You basically first want to make sure the packages (those from
>isar-apt) are reproducible and only later look at the whole rootfs. That whole
>rootfs will have remaining differences, just from what maintainer hooks and
>stuff do.
>
>That find violates the rule that everything should come from a package.
>There are some exceptions to that rule in isar, but those are likely the ones
>causing repro issues. And anyhow not everyone delivers only the images, some
>people also deliver the packages. So those need to be reproducible as well ... or
>even first.

I am trying to understand Isar and how it installs the debian packages, I think Isar rebuilds some of the packages from the sources using sbuild,
If that is so, then sbuild taken care of creating the package reproducibly, if it is not creating then we should definitely check why it is no generating reproducible packages.

The find command here mostly fixes the timestamps of files or folders that are created or modified during 'postinst' scripts (or after package installation).

>
>> The idea is to set time stamps to the files as same that are modified
>> or added during build time (e.g: /etc/*) and they are newer than
>> SOURCE_DATE_EPOCH date.
>>
>>
>> >
>> >I would like to ask for test cases. Ideally first a breaking test and
>> >later a commit fixing the issue.
>>
>> I executed this in one of the child project (isar-cip-core) and
>> reported the issue here [1], are you expecting to write test cases in
>> isar?
>>
>> [1] https://gitlab.com/cip-project/cip-core/isar-cip-core/-/issues/31
>
>The motivation always has to come with the patch. So that manual diffoscope
>workflow should be explained in the commit message or documentation. Ideally
>you would add a test that actually runs that and asserts that there are no
>problems left. Or ignoring the left ones for starters.
>
>That layer and all other layers will later benefit. But whatever you try to fix has
>to be explained and consistent inside isar only. So issues from some layer
>clearly do not count.
>
>Say we get the debian package feed to become reproducible, i bet we could just
>compare the repo metadata and the package checksums. Not even dive into the
>packages and check what might be different.
>
>A good way of explaining a change is also to first write the test and later the
>code to fix the problem a test shows. Everybody would understand easily and if
>any future change would break it again we would see before a merge.

I agree with you and I have also sent patch[1] for this that have test script for verifying the reproducibility in Isar images.

[1] https://groups.google.com/g/isar-users/c/4ZIuKCOQzVc/m/sPUafWDgAgAJ


>
>regards,
>Henning
>
>> >
>> >Note that deriving the time from git means one needs git. And when
>> >using layers just that one Isar git is clearly not good enough. And
>> >we will have to check how that works with sstate. I assume any new
>> >commit would rebuild all custom packages. Here i see a conflict
>> >between regular dev work and repro pedantics. We might need a way to
>> >turn that stuff off.
>> >
>> >regards,
>> >Henning
>> >
>> >>  EOSUDO
>> >>  }
>> >>  addtask rootfs_finalize before do_rootfs after
>> >> do_rootfs_postprocess
>>
>
>--
>You received this message because you are subscribed to the Google Groups
>"isar-users" group.
>To unsubscribe from this group and stop receiving emails from it, send an email
>to isar-users+unsubscribe@googlegroups.com.
>To view this discussion on the web visit
>https://groups.google.com/d/msgid/isar-
>users/20221110091226.4a3695f2%40md1za8fc.ad001.siemens.net.

  parent reply	other threads:[~2022-11-21  5:16 UTC|newest]

Thread overview: 25+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-11-07  8:25 venkata.pyla
2022-11-07  8:50 ` Moessbauer, Felix
2022-11-09  6:56   ` Venkata.Pyla
2022-11-07  8:53 ` Henning Schild
2022-11-09  9:27   ` Venkata.Pyla
2022-11-10  7:12     ` Henning Schild
2022-11-20  7:53       ` [PATCH] scripts/repro-test.sh: script to test reproducibility of Isar image venkata.pyla
2022-11-22  7:28         ` Anton Mikanovich
2022-11-22  8:49           ` Venkata.Pyla
2022-12-19 14:41             ` [PATCH 0/2] Test for verifiying reproducible images venkata.pyla
2022-12-28  8:40               ` Anton Mikanovich
2023-01-02  6:24                 ` [PATCH] repro-build-test.py: Fix date_epoch time contains byte character b'' venkata.pyla
2023-01-11  5:42                   ` Uladzimir Bely
2022-12-19 14:41             ` [PATCH 1/2] cibuilder.py: Add source_date_epoch to build configuration venkata.pyla
2022-12-19 14:41             ` [PATCH 2/2] repro-build-test.py: Test to check images are reproducible venkata.pyla
2023-01-07  0:01               ` Henning Schild
2022-11-21  5:16       ` Venkata.Pyla [this message]
2022-11-10  9:52 ` [PATCH] image.bbclass: set file timestamps inside the rootfs and initramfs image Balasubramanian Sundaram
2022-11-10 10:03   ` Anton Mikanovich
2022-11-10 10:17     ` Balasubramanian Sundaram
2022-11-10 10:19       ` Balasubramanian Sundaram
2022-11-10 10:27       ` Anton Mikanovich
2022-11-10 10:51         ` Balasubramanian Sundaram
2022-11-10 16:30           ` Henning Schild
2022-11-10 17:05             ` Henning Schild

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=OSYPR01MB55425EC15AEC59046AC06F4CA40A9@OSYPR01MB5542.jpnprd01.prod.outlook.com \
    --to=venkata.pyla@toshiba-tsip.com \
    --cc=dinesh.kumar@toshiba-tsip.com \
    --cc=henning.schild@siemens.com \
    --cc=isar-users@googlegroups.com \
    --cc=jan.kiszka@siemens.com \
    --cc=kazuhiro3.hayashi@toshiba.co.jp \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox