From: Jan Kiszka <jan.kiszka@siemens.com>
To: Henning Schild <henning.schild@siemens.com>,
vijai kumar <vijaikumar.kanagarajan@gmail.com>
Cc: isar-users <isar-users@googlegroups.com>,
Baurzhan Ismagulov <ibr@radix50.net>
Subject: Re: [Discussion]: Metadata to consolidate and rebuild base-apt from distributed CI builds
Date: Fri, 25 Feb 2022 18:27:38 +0100 [thread overview]
Message-ID: <45e6e9ba-8a05-e617-d5ae-949efb53c35b@siemens.com> (raw)
In-Reply-To: <20220224164244.6e4bb002@md1za8fc.ad001.siemens.net>
On 24.02.22 16:42, Henning Schild wrote:
> Am Thu, 24 Feb 2022 18:50:50 +0530
> schrieb vijai kumar <vijaikumar.kanagarajan@gmail.com>:
>
>> Hi Henning,
>>
>> On Tue, Feb 22, 2022 at 8:01 PM Henning Schild
>> <henning.schild@siemens.com> wrote:
>>>
>>> Hey Vijai,
>>>
>>> Am Tue, 22 Feb 2022 16:04:36 +0530
>>> schrieb vijai kumar <vijaikumar.kanagarajan@gmail.com>:
>>>
>>>> Problem:
>>>> --------
>>>> We could have several CI jobs that are running in parallel in
>>>> different nodes. One might want to consolidate and build a
>>>> base-apt from the debs/deb-srcs of all these builds.
>>>
>>> Can you go into more detail. I do not yet get the problem.
>>
>> runner 1(Germany) -> Building de0 nano
>> runner 2(India) -> Building qemuarm
>> runner 3(US) -> Building qemuamd64
>>
>>
>> All these builds are running in different servers.
>> If we wanted to create a single base-apt from all these servers, then
>> we need to copy over their deb/debsrcs/base-apt to a common server and
>> then
>> create a consolidated repo.
>
> But why would you want to do that? I mean i get why you would want to
> store all in the same location, but not why it should be one repo.
> Maybe to save some space on sources and arch all .. but hey there are
> ways of deduplcating on filesystem or block level.
> You are just risking a weird local "all" package not being so "all"
> after all ... false sharing.
We want to auto-build a single, "offline" capable repo from the BoM
accumulated from those builds of all possible targets. And that in a way
that does not require pushing large artifacts between the build stages,
ideally only those BoM lists.
>
>> This involves moving around this data.
>
> Yes, if it one central storage place. No matter if it is one "repo" or
> many "repos" in i.e. folders.
>
>> The problem can be avoided if we have a single metadata produced by
>> all these builds which would have details of all the packages the
>> build used.
>> Basically a manifest of the build. This manifest can be later used to
>> recreate the repo which can be hosted later on for these jobs.
>
> We have a manifest for "image content" which already is fed into
> clearing, it is a bill of materials an nothing else, it can not
> be used to rebuild.
> Even if you had all metadata you need to store sources and binaries
> somewhere reliable, whether that is central or distributed is another
> story.
> Pointers to anything on the internet (including all debian repos) will
> at some point stop working. So if "exact rebuilding" in a "far away
> future" is what you want, mirroring is what you will need.
Exactly, this mirror is supposed to be generated, and that shortly after
the individual builds succeeded (in a common pipeline stage). That can
fail as any build can fail if a referenced version picked up during
bootstrap got dropped while building an image.
> Partial mirroring based on base-apt even with sources will be shaky and
> you will find yourself digging in snapshots again. But it will work.
Yes, it works for us (you should know ;)).
Jan
--
Siemens AG, Technology
Competence Center Embedded Linux
next prev parent reply other threads:[~2022-02-25 17:27 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2022-02-22 10:34 vijai kumar
2022-02-22 14:31 ` Henning Schild
2022-02-24 13:20 ` vijai kumar
2022-02-24 15:42 ` Henning Schild
2022-02-25 17:27 ` Jan Kiszka [this message]
2022-03-03 13:45 ` vijai kumar
2022-03-04 10:03 ` Baurzhan Ismagulov
2022-03-07 7:23 ` vijai kumar
2022-03-15 11:45 ` Baurzhan Ismagulov
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=45e6e9ba-8a05-e617-d5ae-949efb53c35b@siemens.com \
--to=jan.kiszka@siemens.com \
--cc=henning.schild@siemens.com \
--cc=ibr@radix50.net \
--cc=isar-users@googlegroups.com \
--cc=vijaikumar.kanagarajan@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox