Acknowledgement sent
to Michele Alessandrini <michelealessandrini74@gmail.com>:
New Bug report received and forwarded. Copy sent to Reproducible builds folks <reproducible-builds@lists.alioth.debian.org>.
(Wed, 02 Sep 2015 11:06:06 GMT) (full text, mbox, link).
To: Debian Bug Tracking System <submit@bugs.debian.org>
Subject: diffoscope: tar.gz and tar.bz2 archives compared wrongly
Date: Wed, 02 Sep 2015 12:55:14 +0200
Package: diffoscope
Version: 31
Severity: important
When comparing tar.gz or tar.bz2 archives containing text files, some of which different, the output is a binary diff. Perhaps it only applies decompression and compares the two tar files, instead of their contents?
-- System Information:
Debian Release: stretch/sid
APT prefers testing
APT policy: (500, 'testing')
Architecture: amd64 (x86_64)
Foreign Architectures: i386
Kernel: Linux 4.0.0-2-amd64 (SMP w/4 CPU cores)
Locale: LANG=it_IT.utf8, LC_CTYPE=it_IT.utf8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)
Versions of packages diffoscope depends on:
ii python 2.7.9-1
ii python-debian 0.1.27
ii python-libarchive-c 2.1-3
ii python-magic 1:5.22+15-2
ii python-rpm 4.12.0.1+dfsg1-3
ii python-tlsh 3.2.1+20150727-1
pn python:any <none>
Versions of packages diffoscope recommends:
ii acl 2.2.52-2
pn binutils-multiarch <none>
ii bzip2 1.0.6-8
ii cpio 2.11+dfsg-4.1
ii default-jdk [java-sdk] 2:1.7-52
pn fontforge-extras <none>
ii genisoimage 9:1.1.11-3
ii gettext 0.19.4-1
pn ghc <none>
ii gnupg 1.4.19-5
pn mono-utils <none>
ii openjdk-7-jdk [java-sdk] 7u75-2.5.4-2
ii pdftk 2.02-3
ii poppler-utils 0.26.5-3
ii rpm2cpio 4.12.0.1+dfsg1-3
pn sng <none>
ii sqlite3 3.8.11.1-1
pn squashfs-tools <none>
ii unzip 6.0-18
ii vim-common 2:7.4.826-1
ii xz-utils 5.1.1alpha+20120614-2.1
diffoscope suggests no packages.
-- no debconf information
Acknowledgement sent
to Jérémy Bobbio <lunar@debian.org>:
Extra info received and forwarded to list. Copy sent to Reproducible builds folks <reproducible-builds@lists.alioth.debian.org>.
(Wed, 02 Sep 2015 12:39:20 GMT) (full text, mbox, link).
Control: retitle -1 diffoscope: implement fuzzy-matching across containers
Control: severity -1 wishlist
Michele Alessandrini:
> When comparing tar.gz or tar.bz2 archives containing text files, some
> of which different, the output is a binary diff. Perhaps it only
> applies decompression and compares the two tar files, instead of their
> contents?
That can be implemented by adding support for fuzzy-matching across
containers. This would also detect and compare properly a file being
compressed from one version to the next.
It's doable. The trickiest part in my mind so far is where to attach
them in the tree of differences.
--
Lunar .''`.
lunar@debian.org : :Ⓐ : # apt-get install anarchism
`. `'`
`-
Changed Bug title to 'diffoscope: implement fuzzy-matching across containers' from 'diffoscope: tar.gz and tar.bz2 archives compared wrongly'
Request was from Jérémy Bobbio <lunar@debian.org>
to 797759-submit@bugs.debian.org.
(Wed, 02 Sep 2015 12:39:20 GMT) (full text, mbox, link).
Severity set to 'wishlist' from 'important'
Request was from Jérémy Bobbio <lunar@debian.org>
to 797759-submit@bugs.debian.org.
(Wed, 02 Sep 2015 12:39:21 GMT) (full text, mbox, link).
Acknowledgement sent
to Michele Alessandrini <michelealessandrini74@gmail.com>:
Extra info received and forwarded to list. Copy sent to Reproducible builds folks <reproducible-builds@lists.alioth.debian.org>.
(Wed, 02 Sep 2015 15:06:03 GMT) (full text, mbox, link).
Il 02/09/2015 14:38, Jérémy Bobbio ha scritto:
> Control: retitle -1 diffoscope: implement fuzzy-matching across containers
> Control: severity -1 wishlist
>
> Michele Alessandrini:
>> When comparing tar.gz or tar.bz2 archives containing text files, some
>> of which different, the output is a binary diff. Perhaps it only
>> applies decompression and compares the two tar files, instead of their
>> contents?
>
> That can be implemented by adding support for fuzzy-matching across
> containers. This would also detect and compare properly a file being
> compressed from one version to the next.
>
> It's doable. The trickiest part in my mind so far is where to attach
> them in the tree of differences.
>
But isn't reading files inside archives a main feature of diffoscope?
Acknowledgement sent
to Jérémy Bobbio <lunar@debian.org>:
Extra info received and forwarded to list. Copy sent to Reproducible builds folks <reproducible-builds@lists.alioth.debian.org>.
(Wed, 02 Sep 2015 15:39:07 GMT) (full text, mbox, link).
Michele Alessandrini:
> Il 02/09/2015 14:38, Jérémy Bobbio ha scritto:
> >Control: retitle -1 diffoscope: implement fuzzy-matching across containers
> >Control: severity -1 wishlist
> >
> >Michele Alessandrini:
> >>When comparing tar.gz or tar.bz2 archives containing text files, some
> >>of which different, the output is a binary diff. Perhaps it only
> >>applies decompression and compares the two tar files, instead of their
> >>contents?
> >
> >That can be implemented by adding support for fuzzy-matching across
> >containers. This would also detect and compare properly a file being
> >compressed from one version to the next.
> >
> >It's doable. The trickiest part in my mind so far is where to attach
> >them in the tree of differences.
> >
>
> But isn't reading files inside archives a main feature of diffoscope?
It is. But .gz and .bz2 are two different file formats and diffoscope
treat them as such. It's like switching from .tar to .zip. It's quite a
difference and diffoscope will not blindly compare their content
currently.
Trust me, fuzzy matching accross containers is doable and will solve
exactly this problem, but also others.
--
Lunar .''`.
lunar@debian.org : :Ⓐ : # apt-get install anarchism
`. `'`
`-
Reply sent
to Mattia Rizzolo <mattia@debian.org>:
You have taken responsibility.
(Sat, 04 Nov 2017 16:06:03 GMT) (full text, mbox, link).
Notification sent
to Michele Alessandrini <michelealessandrini74@gmail.com>:
Bug acknowledged by developer.
(Sat, 04 Nov 2017 16:06:03 GMT) (full text, mbox, link).
Source: diffoscope
Source-Version: 88
We believe that the bug you reported is fixed in the latest version of
diffoscope, which is due to be installed in the Debian FTP archive.
A summary of the changes between this version and the previous one is
attached.
Thank you for reporting the bug, which will now be closed. If you
have further comments please address them to 797759@bugs.debian.org,
and the maintainer will reopen the bug report if appropriate.
Debian distribution maintenance software
pp.
Mattia Rizzolo <mattia@debian.org> (supplier of updated diffoscope package)
(This message was generated automatically at their request; if you
believe that there is a problem with it please contact the archive
administrators by mailing ftpmaster@ftp-master.debian.org)
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA512
Format: 1.8
Date: Sat, 04 Nov 2017 16:26:09 +0100
Source: diffoscope
Binary: diffoscope
Architecture: source
Version: 88
Distribution: unstable
Urgency: medium
Maintainer: Reproducible builds folks <reproducible-builds@lists.alioth.debian.org>
Changed-By: Mattia Rizzolo <mattia@debian.org>
Description:
diffoscope - in-depth comparison of files, archives, and directories
Closes: 797759877473877525880279
Changes:
diffoscope (88) unstable; urgency=medium
.
[ Holger Levsen ]
* Bump standards version to 4.1.1, no changes needed.
.
[ Mattia Rizzolo ]
* tests/comparators:
+ dtb: compatibility with version 1.4.5. Closes: #880279
.
[ Chris Lamb ]
* comparators:
+ binwalk: improve names in output of "internal" members. Closes: #877525
+ Omit misleading "any of" prefix when only complaining about one module
in ImportError messages.
* Don't crash on malformed md5sums files. Closes: #877473
* tests/comparators:
+ ps: ps2ascii > 9.21 now varies on timezone, so skip this test for now.
+ dtby: only parse the version number, not any "-dirty" suffix.
* debian/watch: use HTTPS URI.
.
[ Ximin Luo ]
* comparators:
+ utils/file: diff container metadata centrally. Closes: #797759
This fixes a last remaining bug in fuzzy-matching across containers.
+ Fix all the affected comparators after the above change.
Checksums-Sha1:
78952516d1b86557328590304545c331dd340722 3275 diffoscope_88.dsc
4c26cfb04daf1f36bab3646c04a536843af5fa53 680084 diffoscope_88.tar.xz
a6b171b4543840030b8e97ea66a9b09a38535239 19315 diffoscope_88_amd64.buildinfo
Checksums-Sha256:
53930bba7240d52dea766af962409dd1734fa59e0ab9946db3f2690ea1ab2744 3275 diffoscope_88.dsc
b2ecb1ae2c53a10d580732fd8eafec13154b1f862856e391d2acd45441b2c3b1 680084 diffoscope_88.tar.xz
e4daeaad80659bf56a5cc8c260ce32811216760fdd3b1028d454d10d7908b44b 19315 diffoscope_88_amd64.buildinfo
Files:
a07e8b42c5c667bebb6eb8041f247aff 3275 devel optional diffoscope_88.dsc
76ce6e09fc87300e6513e79caafe4877 680084 devel optional diffoscope_88.tar.xz
caa3f6298f8ec437b2c9c7661e2bd506 19315 devel optional diffoscope_88_amd64.buildinfo
-----BEGIN PGP SIGNATURE-----
iQIzBAEBCgAdFiEEi3hoeGwz5cZMTQpICBa54Yx2K60FAln939YACgkQCBa54Yx2
K61UBg//RqmXTOAh6I4JpC27Ke6IeMf4KFZRSC3JcyUjY8RxZ6wczdKw77PEftB7
gxSOAzbMgcmIRq5+MdYbZgTNbYAjprfGZ6h54zcMeYSMqzECDuVZFeR4Z/kJEPX4
C2Qg01fNMu6famXDp1PYoM96sGPHr8rZLj0ONYsCt3m+Es0yd1ausD7WA0idtSGT
c0IOB3SnFL22j1zxW+bt/Z+WEpdMJ3uc4kXYnMYAmUZrOV3QIsL1jfUANHLK8KNO
Ai+Q2W259INJKfQTBzc/KfISdX9YBytsX8L9N1Gnw0HzBaA0lqH3ML/9nX35X7mq
n90wuOdrH24I4IHpYdLb2jVH0EbS3omm5ydSzsxuWFckhB/lz0fHaWA0+BHjkrEZ
18OGF7NLUJE+p6ca6kPeX4WWWnTiruJ34iICC0xDOWvf7QEyr/19tuRyr+gUtQli
6qjTtKvoLrRY+ngMicRNSBAn+aYIyyVMTBgqn5VtHHQzCCIjv/lG296ZouRoo0AA
X9jLeKTQ/TngA+EhlDWI4Q89GWwpc1afRTD6zBNyJNvo40vB4mi+KXv/dQSr7BdI
1g56aZ1llLhGpOhRWyQHLhFrWItQVywSoFzt6foyXzFdgYCDQFs7XyLKWf0VILb9
2G8xNGgVjf0mjPqF19XM8AP8oCpFAgB79cfaMc7jpF0TIf+EPw4=
=xJR+
-----END PGP SIGNATURE-----
Bug archived.
Request was from Debbugs Internal Request <owner@bugs.debian.org>
to internal_control@bugs.debian.org.
(Fri, 08 Dec 2017 07:31:12 GMT) (full text, mbox, link).
Debbugs is free software and licensed under the terms of the GNU General
Public License version 2. The current version can be obtained
from https://bugs.debian.org/debbugs-source/.