Debian Bug report logs - #431231
tr: no UTF-8 support

version graph

Package: coreutils; Maintainer for coreutils is Michael Stone <[email protected]>; Source for coreutils is src:coreutils (PTS, buildd, popcon).

Reported by: Juhapekka Tolvanen <[email protected]>

Date: Sat, 30 Jun 2007 20:30:01 UTC

Severity: normal

Tags: confirmed, upstream

Merged with 139861, 388689, 613155, 649729, 721324

Found in versions coreutils/8.13-3, coreutils/5.97-5.3, coreutils/8.21-1, coreutils/5.97-5, coreutils/8.5-1, coreutils/5.96-3, coreutils/9.1-1, coreutils/6.10~20071127-1

Full log


🔗 View this message in rfc822 format

X-Loop: [email protected]
Subject: Bug#431231: tr fails with UTF-8
Reply-To: Juhapekka Tolvanen <[email protected]>, [email protected]
Resent-From: Juhapekka Tolvanen <[email protected]>
Resent-To: [email protected]
Resent-CC: Michael Stone <[email protected]>
Resent-Date: Tue, 25 Dec 2007 04:51:01 +0000
Resent-Message-ID: <[email protected]>
Resent-Sender: [email protected]
X-Debian-PR-Message: report 431231
X-Debian-PR-Package: coreutils
X-Debian-PR-Keywords: 
X-Debian-PR-Source: coreutils
Received: via spool by [email protected] id=B431231.119855814616194
          (code B ref 431231); Tue, 25 Dec 2007 04:51:01 +0000
Received: (at 431231) by bugs.debian.org; 25 Dec 2007 04:49:06 +0000
X-Spam-Checker-Version: SpamAssassin 3.1.4-bugs.debian.org_2005_01_02 
	(2006-07-26) on rietz.debian.org
X-Spam-Level: 
X-Spam-Status: No, score=-6.8 required=4.0 tests=BAYES_00,FORGED_RCVD_HELO,
	FOURLA,HAS_BUG_NUMBER autolearn=no 
	version=3.1.4-bugs.debian.org_2005_01_02
Received: from emh04.mail.saunalahti.fi ([62.142.5.110])
	by rietz.debian.org with esmtp (Exim 4.63)
	(envelope-from <[email protected]>)
	id 1J71iz-0004Bj-QB
	for [email protected]; Tue, 25 Dec 2007 04:49:06 +0000
Received: from saunalahti-vams (vs3-12.mail.saunalahti.fi [62.142.5.96])
	by emh04-2.mail.saunalahti.fi (Postfix) with SMTP id 493A713C3D0;
	Tue, 25 Dec 2007 06:49:04 +0200 (EET)
Received: from emh06.mail.saunalahti.fi ([62.142.5.116])
	by vs3-12.mail.saunalahti.fi ([62.142.5.96])
	with SMTP (gateway) id A057DC17DD7; Tue, 25 Dec 2007 06:49:04 +0200
Received: from juhtolv.dyndns.org (e82-103-193-107.elisa-laajakaista.fi [82.103.193.107])
	by emh06.mail.saunalahti.fi (Postfix) with ESMTP id 00CE0E51AC;
	Tue, 25 Dec 2007 06:48:59 +0200 (EET)
Received: by juhtolv.dyndns.org (Postfix, from userid 1000)
	id 91133332AE; Tue, 25 Dec 2007 06:48:58 +0200 (EET)
Date: Tue, 25 Dec 2007 06:48:59 +0200
From: Juhapekka Tolvanen <[email protected]>
To: Bob Proulx <[email protected]>, [email protected]
Cc: Hilko Bengen <[email protected]>, Colin Watson <[email protected]>
Message-ID: <[email protected]>
References: <[email protected]> <[email protected]>
MIME-Version: 1.0
Content-Type: text/plain; charset=iso-8859-1
Content-Disposition: inline
In-Reply-To: <[email protected]>
Organization: What?+ Me organized?+ Never!1
X-Mailer-URL: http://www.mutt.org/
X-Editor: Vim http://www.vim.org/
User-Agent: Mutt/1.5.17 (2007-11-01)
Content-Transfer-Encoding: quoted-printable
X-Antivirus: VAMS
On Sun, 01 Jul 2007, +01:24:14 EEST (UTC +0300),
Bob Proulx <[email protected]> pressed some keys:

> merge 431231 139861
> thanks
> 
> Juhapekka Tolvanen wrote:
> > ...report of a locale problem in tr deleted...
> 
> See also these related issues.
> 
>   http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=139861
>   http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=388689
> 
> It is a known deficiency in coreutils in general that the utilities
> are not multibyte aware.  The following can be found in the upstream
> source package TODO file.

Look at this:

% echo 'huuhaa öljy äiti über' | tr '[:lower:]' '[:upper:]'
HUUHAA öLJY äITI üBER
% echo 'huuhaa öljy äiti über' | /opt/heirloom/5bin/tr '[:lower:]' '[:upper:]'
HUUHAA ÖLJY ÄITI ÜBER

That Heirloom Toolchest is available here:

http://heirloom.sourceforge.net/

IMNSHO Debian project should package that toolchest, too. Maybe GNU
tools should be replaced with them, but that would be very radical.

Also other software from Heirloom -project should be Debian-packaged,
because they have superior UTF-8 -support when compared to
GNU-counterparts. For example their roff -implementation can handle
UTF-8 and OpenType-fonts.


-- 
Juhapekka "naula" Tolvanen * http colon slash slash iki dot fi slash juhtolv
"eiga wo miyou kimi no yakusoku dohri te wo tsunai de. yoru ni wa owakare
desu ringo to ichigo ga kusaru mae ni. yume wa hirogaru kimi no yakusoku
dohri kisu wo shi nagara."                                       Dir en grey




Send a report that this bug log contains spam.


Debian bug tracking system administrator <[email protected]>. Last modified: Tue May 13 11:22:43 2025; Machine Name: bembo

Debian Bug tracking system

Debbugs is free software and licensed under the terms of the GNU General Public License version 2. The current version can be obtained from https://bugs.debian.org/debbugs-source/.

Copyright © 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson, 2005-2017 Don Armstrong, and many other contributors.