From: Jan Kiszka <jan.kiszka@siemens.com>
To: Benedikt Niedermayr <benbrenson89@googlemail.com>,
isar-users <isar-users@googlegroups.com>
Subject: Re: [RFC v2][PATCH 2/3] build-rep: Add helper class
Date: Wed, 24 Jan 2018 19:48:23 +0100 [thread overview]
Message-ID: <f360c043-2db7-2e65-8ff0-796dbd5c9938@siemens.com> (raw)
In-Reply-To: <f652a961-b7ae-c416-acc3-d18e0a8c6cae@googlemail.com>
On 2018-01-19 22:23, 'Benedikt Niedermayr' via isar-users wrote:
> Am 14.01.2018 um 17:53 schrieb Jan Kiszka:
>> On 2018-01-12 17:25, [ext] Henning Schild wrote:
>>> Am Fri, 12 Jan 2018 16:29:20 +0300
>>> schrieb Alexander Smirnov <asmirnov@ilbers.de>:
>>>
>>>> On 01/12/2018 03:32 PM, Henning Schild wrote:
>>>>> Am Thu, 11 Jan 2018 14:19:38 +0300
>>>>> schrieb Alexander Smirnov <asmirnov@ilbers.de>:
>>>>>
>>>>>> Add class that helps to implement build reproducibility. It
>>>>>> implements anonymous function that will get all the Debian
>>>>>> dependencies that are needed for current Isar tree.
>>>>>>
>>>>>> Until build reproducibility will be fully implemented, it's
>>>>>> disabled by default. To enable it just set ISAR_BUILD_REP to "1"
>>>>>> in local.conf.
>>>>>>
>>>>>> Signed-off-by: Alexander Smirnov <asmirnov@ilbers.de>
>>>>>> ---
>>>>>> meta-isar/conf/local.conf.sample | 6 +++++
>>>>>> meta-isar/recipes-app/hello/hello.bb | 2 ++
>>>>>> meta/classes/build-rep.bbclass | 32
>>>>>> ++++++++++++++++++++++++
>>>>>> meta/classes/dpkg-base.bbclass | 2 ++
>>>>>> meta/classes/image.bbclass | 2 ++
>>>>>> meta/recipes-devtools/buildchroot/buildchroot.bb | 2 ++ 6 files
>>>>>> changed, 46 insertions(+) create mode 100644
>>>>>> meta/classes/build-rep.bbclass
>>>>>>
>>>>>> diff --git a/meta-isar/conf/local.conf.sample
>>>>>> b/meta-isar/conf/local.conf.sample index 660958f..45b8995 100644
>>>>>> --- a/meta-isar/conf/local.conf.sample
>>>>>> +++ b/meta-isar/conf/local.conf.sample
>>>>>> @@ -162,3 +162,9 @@ BB_NUMBER_THREADS = "4"
>>>>>> #
>>>>>> # Number of attempts to try to get reprepro lock for access to
>>>>>> apt cache REPREPRO_LOCK_ATTEMPTS = "16"
>>>>>> +
>>>>>> +# Isar build reproducibility feature creates local repository
>>>>>> which contains +# copies for all the upstream Debian packages that
>>>>>> could be used to build +# your Isar tree. So fetching them once
>>>>>> will guarantee that all the next Isar +# builds will be
>>>>>> identically. +ISAR_BUILD_REP ?= "0"
>>>>>> diff --git a/meta-isar/recipes-app/hello/hello.bb
>>>>>> b/meta-isar/recipes-app/hello/hello.bb index 44b8bc3..fafda2e
>>>>>> 100644 --- a/meta-isar/recipes-app/hello/hello.bb
>>>>>> +++ b/meta-isar/recipes-app/hello/hello.bb
>>>>>> @@ -16,3 +16,5 @@ SRCREV =
>>>>>> "ad7065ecc4840cc436bfcdac427386dbba4ea719" SRC_DIR = "git"
>>>>>> inherit dpkg
>>>>>> +
>>>>>> +DEBIAN_DEPENDS = "debhelper (>= 9), autotools-dev"
>>>>> If i understand it correctly, this approach is an absolute NoGo.
>>>>>
>>>>> DEBIAN_DEPENDS as it is used today are runtime deps of pre-built
>>>>> packages. Here you include build-deps of a package that Isar needs
>>>>> to build. And you do so by introducing a copy of a string that is
>>>>> already included in the sources /debian/-folder. In fact you would
>>>>> have to put build and runtime deps into that one variable. And what
>>>>> about packages where the content depends on DISTRO_ARCH and friends?
>>>>>
>>>>> I think we first need a working solution of two "inherit dpkg"
>>>>> packages actually depending on each-other. After that we will need
>>>>> a real build of the Image to fill the cache. That is the only
>>>>> reliable way to make sure that the repo will contain the runtime
>>>>> and the build deps of all packages. More "sed"-guesswork is not
>>>>> going to do that trick and is only getting us 95% of what we need.
>>>>> And maintaining copies of stuff that is in /debian/ should not be
>>>>> considered.
>>>> Let's look at this realistically:
>>>>
>>>> 1. The upstream Debian should be fetched via single operation. If you
>>>> want to populate the cache during image build - you can't guarantee
>>>> that all the packages were built in the same environment. As
>>>> Christoph wrote, at any time the upstream apt could be updated. So
>>>> the buildchroot and image filesystems should be generated using local
>>>> static apt, otherwise it makes no sense to claim reproducibility.
>>> If the versions change on the way we should end up with two versions of
>>> one package in the cache. When we build from that again we will not get
>>> what we got the first time because image and buildchroot will probably
>>> take the latest version, while in the first run they got different
>>> versions. I think that is fine and we could detect that and declare the
>>> first build as "failed because of upstream change".
>>> You will just have to do that over and over again until you find a
>>> 30min window where the mirror did not change, i suspect you will hardly
>>> ever hit the case where the mirror changed between the multistraps.
>>>
>>>> 2. Regarding 'debian/control' duplications. According to the idea,
>>>> that 'base-apt' should be generated first of all, in bitbake terms it
>>>> means that:
>>> INHO that idea is wrong, it should be generated as a side-product of an
>>> actual build.
>>> If you want to get this right you will end up reimplementing Debian
>>> internals and keep running after them. That does not sound like a
>>> promising way to go.
>>>
>>>> *.bb:do_build() should depends on base-apt:do_build().
>>>>
>>>> On the other hand, base-apt:do_build() should already know the full
>>>> list of all deps for *.bb. So for now I see two ways how to implement
>>>> this:
>>>>
>>>> - Duplicate 'debian/control' in recipe and share this variable with
>>>> base-apt (like it's done in this series).
>>>>
>>>> - Manually parse 'debian/control'. But in this case base-apt should
>>>> depend on *.bb:do_unpack. Due to bitbake is 'statically configured'
>>>> tool, base-apt should know the list of all the recipes, what is also
>>>> impossible. In previous series I've tried to implement this by
>>>> defining a list of images in some global variable.
>>> "Manuall parse" is a NoGo, just look at the multiple broken versions of
>>> perl code guessing the build-deps ... emulating something that debian
>>> itself just gets right.
>>>
>>>> 2. Question how to get the list of build and runtime dependencies is
>>>> still open, I've already asked about this in previous mail. So, the
>>>> goal is to have the full list of packages that should be fetched.
>>>> 'debian/control' contains very specific format, so I can't feed this
>>>> line to multistrap/apt-get as it is. I see the following solutions:
>>>> - Implement custom tool to do this (apt/dpkg uses standard perl
>>>> libs which contains all this stuff).
>>>> - Manually parse 'debian/control'
>>>> - Drop the requirement about single upstream fetch.
>>> And again the infamous "manual parse" and even a "custom tool" ... No.
>>>
>>> The repo should come out of what the multiple "multistraps" and "apt-get
>>> installs" actually fetch when they are allowed to fetch from the
>>> outside. If you refer to that with "Drop the requirement about ..." i
>>> fully agree.
>> I agree with Henning's view. We should exploit Debian for building the
>> repo and only use bitbake recipes to control and direct Debian tools,
>> definitely not reimplement them.
>>
>> Having one build run that both generates outputs and fills that repos
>> which would then be used on succeeding runs would be ok from user
>> perspective. We do not necessarily need that strict ordering of
>> fetch-before-use in the first build.
>>
>> Jan
>>
> I do also agree with Henning's view.
>
>
> It would be better not struggling with those complexity within Isar.
> Maybe I'm a bit late. But is reprepro really the best solution for
> implementing the cache?
> Let's assume we implement it like Henning mentioned already. We let Isar
> generate its image and add all required packages to the local repository.
>
>
> Then I see a big problem when adding new recipes to Isar, which will extend
> the local apt repository with maybe new versions of a dependent package.
> You never know that before.
> In this case the new dependencies can trigger a bunch replacement of the
> old packages, and the old stable state of local repo is gone.
> The solution may be to create snapshots of the current local repository,
> but reprepro is IMHO therefore not the best solution:
> https://serverfault.com/questions/489504/using-snapshots-with-reprepro-to-enable-rollbacks
>
> I think "aptly" would be much better here, since aptly gives you the
> possibility to create partial snapshots of new package versions.
> It also comes along with snapshot merging and lot of things Isar
> currently doesn't need, but maybe will sometime in the future.
>
>
> I imagine the apt-cache as a real cache. Means this cache should be able
> to be the interface between Isar and downloading the packages.
> So the cache should download the package (and dependencies) into the
> cache and Isar installs those packages from the cache instead from the
> remote repository.
> I have not tested that with aptly, but the link looks like it can
> achieve that:
> https://www.aptly.info/tutorial/pull/
> When aptly is capable of downloading single packages (and dependencies)
> from a remote repo, then i strongly recommend using this tool instead
> of reprepro.
>
I also started to look into aptly, which I had on my notes for a while.
One reason is that it could act as a manual workaround for the missing
version-freezing support in Isar (pull into aptly repo, take a snapshot,
and then build from it with Isar).
Another is that we also need some tooling to manage mirrors and
snapshots on the long run, and reprepro, TBHO, smells strange. On first
glance, it looks to me like the multistrap of the repo management tools:
once a cool thing, but now unmaintained.
Did you dig further into aptly? I'm currently wondering how it
internally manages snapshots, if it does proper deduplication and how
much of that can be preserved when publishing for consumption by, say,
Isar...
Jan
--
Siemens AG, Corporate Technology, CT RDA IOT SES-DE
Corporate Competence Center Embedded Linux
next prev parent reply other threads:[~2018-01-24 18:48 UTC|newest]
Thread overview: 18+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-01-11 11:19 [RFC v2][PATCH 0/3] Introduce base-apt Alexander Smirnov
2018-01-11 11:19 ` [RFC v2][PATCH 1/3] dpkg-base: Make DEBIAN_DEPENDS global Alexander Smirnov
2018-01-11 11:19 ` [RFC v2][PATCH 2/3] build-rep: Add helper class Alexander Smirnov
2018-01-11 15:47 ` Jan Kiszka
2018-01-12 12:32 ` Henning Schild
2018-01-12 13:29 ` Alexander Smirnov
2018-01-12 16:25 ` Henning Schild
2018-01-14 16:53 ` Jan Kiszka
2018-01-19 21:23 ` Benedikt Niedermayr
2018-01-24 18:48 ` Jan Kiszka [this message]
2018-01-24 20:53 ` Benedikt Niedermayr
2018-01-24 21:31 ` Jan Kiszka
2018-01-25 18:52 ` Benedikt Niedermayr
2018-01-23 11:50 ` Baurzhan Ismagulov
2018-01-23 13:02 ` Jan Kiszka
2018-01-24 13:44 ` Baurzhan Ismagulov
2018-01-23 16:34 ` Christian Storm
2018-01-11 11:19 ` [RFC v2][PATCH 3/3] base-apt: Introduce fetching upstream apt Alexander Smirnov
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=f360c043-2db7-2e65-8ff0-796dbd5c9938@siemens.com \
--to=jan.kiszka@siemens.com \
--cc=benbrenson89@googlemail.com \
--cc=isar-users@googlegroups.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox