From mboxrd@z Thu Jan 1 00:00:00 1970 X-GM-THRID: 6509751388101148672 X-Received: by 10.25.76.6 with SMTP id z6mr1650410lfa.25.1516397002020; Fri, 19 Jan 2018 13:23:22 -0800 (PST) X-BeenThere: isar-users@googlegroups.com Received: by 10.25.233.148 with SMTP id j20ls239460lfk.12.gmail; Fri, 19 Jan 2018 13:23:21 -0800 (PST) X-Google-Smtp-Source: ACJfBot6XQEH3KfDHkrB4nZvYZ00sWDRM075QDK08/fTWX2coqtmgn40UrzQHKkn2sNaUhwdqK1z X-Received: by 10.46.49.18 with SMTP id x18mr1881486ljx.21.1516397001285; Fri, 19 Jan 2018 13:23:21 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1516397001; cv=none; d=google.com; s=arc-20160816; b=TeNDPhmrS8oXZzJQeFQucyeoi7H4fB9jJ1XV/gD/WnjIBoknxR05bGbAQRMoqnlFHt o58+5Y/STDkWEp6rYxAf93y6L+neZV0Mc8mpz+RmKUnn93Zi6pqlO6befKxPtqrGNvnV j3nYA+aTUPIsvWmgDZjAhTrPxdF+yetHCRkpME0nMvwQjD3BDePY2okrKMl3K0QVc2Cd 8YOuhpQLTg+vH02kCq9OzxEt5cvadQywPwZRTxZiG5zIPyAsPdeqGkByjGOFLLsxC6VC 8ObsA5BX/KozC1odY74ILfTsL7AzPQjaY2d/9UtnVIS7V3nsdUwt6SHuR39+oV8AmLH7 h90Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-language:content-transfer-encoding:in-reply-to:mime-version :user-agent:date:message-id:from:references:to:subject :dkim-signature:arc-authentication-results; bh=jQmcmVGGJMRPQA6B7+PSSNPqNhTLJ1odekHwYRcAsMA=; b=NikdkUBW9nn/zBpnvXzDDphxOxrKEDt7AJzoXwL7modBqXioGRO1rFmk5AGdZfwJTL XJVuxB70S8us4iyZNlRy/5UQKNATOaRCDUnqCN4E0qayPmR7hKV+d+azwfK+XCrth73X unCCFTbqJOJoeHLg4hTT/ONDvTUXjpOAXNA7IMK36L4vHd50u7SUep5j+qRlWm/C4SIs W6i2BAJECm4bUu4KyBR68FVizqMzgpLMKV5e36M2Rye4B4pKvmBVFD5G40X1rKT5OwWw e4xSynfA4ZbsdwhzQYmVIFhXyCdUT2G8jpARgZt45LbmEmVZ1AFyIoymht2J8x0uKk1s /uSA== ARC-Authentication-Results: i=1; gmr-mx.google.com; dkim=pass header.i=@googlemail.com header.s=20161025 header.b=cr2f09+r; spf=pass (google.com: domain of benbrenson89@googlemail.com designates 2a00:1450:400c:c09::231 as permitted sender) smtp.mailfrom=benbrenson89@googlemail.com; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=googlemail.com Return-Path: Received: from mail-wm0-x231.google.com (mail-wm0-x231.google.com. [2a00:1450:400c:c09::231]) by gmr-mx.google.com with ESMTPS id k24si1507274ljk.5.2018.01.19.13.23.21 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Fri, 19 Jan 2018 13:23:21 -0800 (PST) Received-SPF: pass (google.com: domain of benbrenson89@googlemail.com designates 2a00:1450:400c:c09::231 as permitted sender) client-ip=2a00:1450:400c:c09::231; Authentication-Results: gmr-mx.google.com; dkim=pass header.i=@googlemail.com header.s=20161025 header.b=cr2f09+r; spf=pass (google.com: domain of benbrenson89@googlemail.com designates 2a00:1450:400c:c09::231 as permitted sender) smtp.mailfrom=benbrenson89@googlemail.com; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=googlemail.com Received: by mail-wm0-x231.google.com with SMTP id t74so5914256wme.3 for ; Fri, 19 Jan 2018 13:23:21 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=20161025; h=subject:to:references:from:message-id:date:user-agent:mime-version :in-reply-to:content-transfer-encoding:content-language; bh=jQmcmVGGJMRPQA6B7+PSSNPqNhTLJ1odekHwYRcAsMA=; b=cr2f09+rSdCLu0IPIlbtB8g1F+KxDjVdmz1pM38lfOaDM9ZL7pb/knEijgCia1zDMd bUOLMd2Eq+v7DXEsZFtBocnXuttdmcqftNc6xzNA2+ZS4uE9sz2qDknvD0XL12FrJqin JxZkvw+ZdjbOPWBY1IYN3q679Y+Q1lWE2wNQj/WvXXvyrG24YQGiyX+Nx9XJw8r/Gnok tBr5vsTBFFoEVuE7DfHJjBnVqpLe3HbWvPFJrDYgO3zbBnjecFRdt7OEJYToD8JEu1pg Xl22ePVVwhqf9abJE6I432bJhHrnfhb3Zrhcd8Jbkx8CFfzLBNM7r6TNKpWWQ87iXvG+ cmYA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-transfer-encoding :content-language; bh=jQmcmVGGJMRPQA6B7+PSSNPqNhTLJ1odekHwYRcAsMA=; b=bX/EvywJKsLuYKKw6VoPPuzqFdEln7u7oSrVZYTwPN5NT1y5ZLsmiPANYWm+j/Ug+c xUqGa7cmnKSVEYgsyg/TFkCbC8PjjPo4TGeSDXSEmBAvHmCV590QTd+8nOfTnAh7mV0y yp3EDthjFWOuqO8PiDJGZSxOmjz1px2mR9sAROfCrJYKs8uDiyqOyvJn7IJhjZD0pS5f YWod5Ipp3d65unZUMTS6LAanq53zggaVhrOq+fZJjvepa4k3QpQD1IPyImgCaQCtfQjl MkKGCiEfZCmRpkg/2zRzqDVhl8zqcLSvUTQY1pVShgWlk79xkCDwpWjYw0/JClNGUAWF yCeA== X-Gm-Message-State: AKwxytcjz/FmoYeOVhP+05ZOnGZb82V0viIZrOamRHYNU58jmRVVke+6 oJOCOQrZ2LtyOJfS3p5phm02j7n+ X-Received: by 10.80.142.15 with SMTP id 15mr334881edw.127.1516397000419; Fri, 19 Jan 2018 13:23:20 -0800 (PST) Return-Path: Received: from [192.168.0.11] (ipb21b4230.dynamic.kabel-deutschland.de. [178.27.66.48]) by smtp.gmail.com with ESMTPSA id o42sm7374346edc.33.2018.01.19.13.23.19 for (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Fri, 19 Jan 2018 13:23:19 -0800 (PST) Subject: Re: [RFC v2][PATCH 2/3] build-rep: Add helper class To: isar-users References: <20180111111939.25667-1-asmirnov@ilbers.de> <20180111111939.25667-3-asmirnov@ilbers.de> <20180112133205.2d2d6615@mmd1pvb1c.ad001.siemens.net> <20180112172526.19bd23a2@mmd1pvb1c.ad001.siemens.net> From: Benedikt Niedermayr Message-ID: Date: Fri, 19 Jan 2018 22:23:19 +0100 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.4.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Content-Language: en-US X-TUID: ojgYMBYf5WBh Am 14.01.2018 um 17:53 schrieb Jan Kiszka: > On 2018-01-12 17:25, [ext] Henning Schild wrote: >> Am Fri, 12 Jan 2018 16:29:20 +0300 >> schrieb Alexander Smirnov : >> >>> On 01/12/2018 03:32 PM, Henning Schild wrote: >>>> Am Thu, 11 Jan 2018 14:19:38 +0300 >>>> schrieb Alexander Smirnov : >>>> >>>>> Add class that helps to implement build reproducibility. It >>>>> implements anonymous function that will get all the Debian >>>>> dependencies that are needed for current Isar tree. >>>>> >>>>> Until build reproducibility will be fully implemented, it's >>>>> disabled by default. To enable it just set ISAR_BUILD_REP to "1" >>>>> in local.conf. >>>>> >>>>> Signed-off-by: Alexander Smirnov >>>>> --- >>>>> meta-isar/conf/local.conf.sample | 6 +++++ >>>>> meta-isar/recipes-app/hello/hello.bb | 2 ++ >>>>> meta/classes/build-rep.bbclass | 32 >>>>> ++++++++++++++++++++++++ >>>>> meta/classes/dpkg-base.bbclass | 2 ++ >>>>> meta/classes/image.bbclass | 2 ++ >>>>> meta/recipes-devtools/buildchroot/buildchroot.bb | 2 ++ 6 files >>>>> changed, 46 insertions(+) create mode 100644 >>>>> meta/classes/build-rep.bbclass >>>>> >>>>> diff --git a/meta-isar/conf/local.conf.sample >>>>> b/meta-isar/conf/local.conf.sample index 660958f..45b8995 100644 >>>>> --- a/meta-isar/conf/local.conf.sample >>>>> +++ b/meta-isar/conf/local.conf.sample >>>>> @@ -162,3 +162,9 @@ BB_NUMBER_THREADS = "4" >>>>> # >>>>> # Number of attempts to try to get reprepro lock for access to >>>>> apt cache REPREPRO_LOCK_ATTEMPTS = "16" >>>>> + >>>>> +# Isar build reproducibility feature creates local repository >>>>> which contains +# copies for all the upstream Debian packages that >>>>> could be used to build +# your Isar tree. So fetching them once >>>>> will guarantee that all the next Isar +# builds will be >>>>> identically. +ISAR_BUILD_REP ?= "0" >>>>> diff --git a/meta-isar/recipes-app/hello/hello.bb >>>>> b/meta-isar/recipes-app/hello/hello.bb index 44b8bc3..fafda2e >>>>> 100644 --- a/meta-isar/recipes-app/hello/hello.bb >>>>> +++ b/meta-isar/recipes-app/hello/hello.bb >>>>> @@ -16,3 +16,5 @@ SRCREV = >>>>> "ad7065ecc4840cc436bfcdac427386dbba4ea719" SRC_DIR = "git" >>>>> >>>>> inherit dpkg >>>>> + >>>>> +DEBIAN_DEPENDS = "debhelper (>= 9), autotools-dev" >>>> If i understand it correctly, this approach is an absolute NoGo. >>>> >>>> DEBIAN_DEPENDS as it is used today are runtime deps of pre-built >>>> packages. Here you include build-deps of a package that Isar needs >>>> to build. And you do so by introducing a copy of a string that is >>>> already included in the sources /debian/-folder. In fact you would >>>> have to put build and runtime deps into that one variable. And what >>>> about packages where the content depends on DISTRO_ARCH and friends? >>>> >>>> I think we first need a working solution of two "inherit dpkg" >>>> packages actually depending on each-other. After that we will need >>>> a real build of the Image to fill the cache. That is the only >>>> reliable way to make sure that the repo will contain the runtime >>>> and the build deps of all packages. More "sed"-guesswork is not >>>> going to do that trick and is only getting us 95% of what we need. >>>> And maintaining copies of stuff that is in /debian/ should not be >>>> considered. >>> Let's look at this realistically: >>> >>> 1. The upstream Debian should be fetched via single operation. If you >>> want to populate the cache during image build - you can't guarantee >>> that all the packages were built in the same environment. As >>> Christoph wrote, at any time the upstream apt could be updated. So >>> the buildchroot and image filesystems should be generated using local >>> static apt, otherwise it makes no sense to claim reproducibility. >> If the versions change on the way we should end up with two versions of >> one package in the cache. When we build from that again we will not get >> what we got the first time because image and buildchroot will probably >> take the latest version, while in the first run they got different >> versions. I think that is fine and we could detect that and declare the >> first build as "failed because of upstream change". >> You will just have to do that over and over again until you find a >> 30min window where the mirror did not change, i suspect you will hardly >> ever hit the case where the mirror changed between the multistraps. >> >>> 2. Regarding 'debian/control' duplications. According to the idea, >>> that 'base-apt' should be generated first of all, in bitbake terms it >>> means that: >> INHO that idea is wrong, it should be generated as a side-product of an >> actual build. >> If you want to get this right you will end up reimplementing Debian >> internals and keep running after them. That does not sound like a >> promising way to go. >> >>> *.bb:do_build() should depends on base-apt:do_build(). >>> >>> On the other hand, base-apt:do_build() should already know the full >>> list of all deps for *.bb. So for now I see two ways how to implement >>> this: >>> >>> - Duplicate 'debian/control' in recipe and share this variable with >>> base-apt (like it's done in this series). >>> >>> - Manually parse 'debian/control'. But in this case base-apt should >>> depend on *.bb:do_unpack. Due to bitbake is 'statically configured' >>> tool, base-apt should know the list of all the recipes, what is also >>> impossible. In previous series I've tried to implement this by >>> defining a list of images in some global variable. >> "Manuall parse" is a NoGo, just look at the multiple broken versions of >> perl code guessing the build-deps ... emulating something that debian >> itself just gets right. >> >>> 2. Question how to get the list of build and runtime dependencies is >>> still open, I've already asked about this in previous mail. So, the >>> goal is to have the full list of packages that should be fetched. >>> 'debian/control' contains very specific format, so I can't feed this >>> line to multistrap/apt-get as it is. I see the following solutions: >>> - Implement custom tool to do this (apt/dpkg uses standard perl >>> libs which contains all this stuff). >>> - Manually parse 'debian/control' >>> - Drop the requirement about single upstream fetch. >> And again the infamous "manual parse" and even a "custom tool" ... No. >> >> The repo should come out of what the multiple "multistraps" and "apt-get >> installs" actually fetch when they are allowed to fetch from the >> outside. If you refer to that with "Drop the requirement about ..." i >> fully agree. > I agree with Henning's view. We should exploit Debian for building the > repo and only use bitbake recipes to control and direct Debian tools, > definitely not reimplement them. > > Having one build run that both generates outputs and fills that repos > which would then be used on succeeding runs would be ok from user > perspective. We do not necessarily need that strict ordering of > fetch-before-use in the first build. > > Jan > I do also agree with Henning's view. It would be better not struggling with those complexity within Isar. Maybe I'm a bit late. But is reprepro really the best solution for implementing the cache? Let's assume we implement it like Henning mentioned already. We let Isar generate its image and add all required packages to the local repository. Then I see a big problem when adding new recipes to Isar, which will extend the local apt repository with maybe new versions of a dependent package. You never know that before. In this case the new dependencies can trigger a bunch replacement of the old packages, and the old stable state of local repo is gone. The solution may be to create snapshots of the current local repository, but reprepro is IMHO therefore not the best solution: https://serverfault.com/questions/489504/using-snapshots-with-reprepro-to-enable-rollbacks I think "aptly" would be much better here, since aptly gives you the possibility to create partial snapshots of new package versions. It also comes along with snapshot merging and lot of things Isar currently doesn't need, but maybe will sometime in the future. I imagine the apt-cache as a real cache. Means this cache should be able to be the interface between Isar and downloading the packages. So the cache should download the package (and dependencies) into the cache and Isar installs those packages from the cache instead from the remote repository. I have not tested that with aptly, but the link looks like it can achieve that: https://www.aptly.info/tutorial/pull/ When aptly is capable of downloading single packages (and dependencies) from a remote repo, then i strongly recommend using this tool instead of reprepro. Regards Benni