|
From: Ma p. <the...@gm...> - 2016-04-06 13:29:24
|
Hello, I try to convert a ISO-8859 text file to UTF-8 Unicode text using DL. My problem is the ISO-8859 encoding is not present when I run iconv -l and the conversion fail. Do you have an idea how to add encoding for this tool ? Kind regards |
|
From: Philippe M. <ph...@oz...> - 2016-04-07 04:48:57
|
On Wed, Apr 06, 2016 at 03:29:16PM +0200, Ma poubelle wrote: > Hello, > > I try to convert a ISO-8859 text file to UTF-8 Unicode text using DL. My > problem is the ISO-8859 encoding is not present when I run iconv -l and the > conversion fail. > > Do you have an idea how to add encoding for this tool ? > There is no iso-8859 coding, there is iso-8859-x where x is a number with 1 for latin characters, 5 for cyrillic, 6 for arabic, 15 for latin with euro, etc. Can you share the exact command you are using? Philippe |
|
From: Ma p. <the...@gm...> - 2016-04-07 12:27:54
|
Hi Philippe, I know the iso-8859 is not a valid coding but this is the output of file command. this is the output i have when i try to convert my file devil:~/tmp# iconv -l The following list contain all the coded character sets known. This does not necessarily mean that all combinations of these names can be used for the FROM and TO command line parameters. One coded character set can be listed with several different names (aliases). 10646-1:1993, 10646-1:1993/UCS4, ANSI_X3.4-1968, ANSI_X3.4-1986, ANSI_X3.4, ASCII, CP367, CSASCII, CSUCS4, IBM367, ISO-10646, ISO-10646/UCS2, ISO-10646/UCS4, ISO-10646/UTF-8, ISO-10646/UTF8, ISO-IR-6, ISO-IR-193, ISO646-US, ISO_646.IRV:1991, OSF00010020, OSF00010100, OSF00010101, OSF00010102, OSF00010104, OSF00010105, OSF00010106, OSF05010001, UCS-2, UCS-2BE, UCS-2LE, UCS-4, UCS-4BE, UCS-4LE, UCS2, UCS4, UNICODEBIG, UNICODELITTLE, US-ASCII, US, UTF-8, UTF8, WCHAR_T devil:~/tmp# file source.CSV source.CSV: ISO-8859 text, with CRLF line terminators devil:~/tmp# iconv -f iso-8859-1 -t utf-8 source.CSV > dest.CSV iconv: conversion from `iso-8859-1' is not supported Try `iconv --help' or `iconv --usage' for more information. devil:~/tmp# iconv -t utf-8 source.CSV > dest.CSV iconv: illegal input sequence at position 185 devil:~/tmp# more /etc/Devil-release Devil-Linux 1.4-i686 I see that I try on old release of DL, I will try on a more recent release and I came back Regards Christian 2016-04-07 6:23 GMT+02:00 Philippe Marzouk <ph...@oz...>: > On Wed, Apr 06, 2016 at 03:29:16PM +0200, Ma poubelle wrote: > > Hello, > > > > I try to convert a ISO-8859 text file to UTF-8 Unicode text using DL. My > > problem is the ISO-8859 encoding is not present when I run iconv -l and > the > > conversion fail. > > > > Do you have an idea how to add encoding for this tool ? > > > > There is no iso-8859 coding, there is iso-8859-x where x is a number > with 1 for latin characters, 5 for cyrillic, 6 for arabic, 15 for latin > with euro, etc. > > Can you share the exact command you are using? > > Philippe > > > ------------------------------------------------------------------------------ > _______________________________________________ > Devil-linux-discuss mailing list > Dev...@li... > https://lists.sourceforge.net/lists/listinfo/devil-linux-discuss > |
|
From: Philippe M. <ph...@oz...> - 2016-04-07 20:45:32
|
I'm running 1.6.8 and I have a much longer list of character sets available. It may have been a bug in your particular version. Philippe On Thu, Apr 07, 2016 at 02:27:47PM +0200, Ma poubelle wrote: > Hi Philippe, > > I know the iso-8859 is not a valid coding but this is the output of file > command. > > this is the output i have when i try to convert my file > > devil:~/tmp# iconv -l > The following list contain all the coded character sets known. This does > not necessarily mean that all combinations of these names can be used for > the FROM and TO command line parameters. One coded character set can be > listed with several different names (aliases). > > 10646-1:1993, 10646-1:1993/UCS4, ANSI_X3.4-1968, ANSI_X3.4-1986, > ANSI_X3.4, > ASCII, CP367, CSASCII, CSUCS4, IBM367, ISO-10646, ISO-10646/UCS2, > ISO-10646/UCS4, ISO-10646/UTF-8, ISO-10646/UTF8, ISO-IR-6, ISO-IR-193, > ISO646-US, ISO_646.IRV:1991, OSF00010020, OSF00010100, OSF00010101, > OSF00010102, OSF00010104, OSF00010105, OSF00010106, OSF05010001, UCS-2, > UCS-2BE, UCS-2LE, UCS-4, UCS-4BE, UCS-4LE, UCS2, UCS4, UNICODEBIG, > UNICODELITTLE, US-ASCII, US, UTF-8, UTF8, WCHAR_T > devil:~/tmp# file source.CSV > source.CSV: ISO-8859 text, with CRLF line terminators > devil:~/tmp# iconv -f iso-8859-1 -t utf-8 source.CSV > dest.CSV > iconv: conversion from `iso-8859-1' is not supported > Try `iconv --help' or `iconv --usage' for more information. > devil:~/tmp# iconv -t utf-8 source.CSV > dest.CSV > iconv: illegal input sequence at position 185 > devil:~/tmp# more /etc/Devil-release > Devil-Linux 1.4-i686 > > I see that I try on old release of DL, I will try on a more recent release > and I came back > > Regards > > Christian > > 2016-04-07 6:23 GMT+02:00 Philippe Marzouk <ph...@oz...>: > > > On Wed, Apr 06, 2016 at 03:29:16PM +0200, Ma poubelle wrote: > > > Hello, > > > > > > I try to convert a ISO-8859 text file to UTF-8 Unicode text using DL. My > > > problem is the ISO-8859 encoding is not present when I run iconv -l and > > the > > > conversion fail. > > > > > > Do you have an idea how to add encoding for this tool ? > > > > > > > There is no iso-8859 coding, there is iso-8859-x where x is a number > > with 1 for latin characters, 5 for cyrillic, 6 for arabic, 15 for latin > > with euro, etc. > > > > Can you share the exact command you are using? > > > > Philippe > > > > > > ------------------------------------------------------------------------------ > > _______________________________________________ > > Devil-linux-discuss mailing list > > Dev...@li... > > https://lists.sourceforge.net/lists/listinfo/devil-linux-discuss > > > ------------------------------------------------------------------------------ > _______________________________________________ > Devil-linux-discuss mailing list > Dev...@li... > https://lists.sourceforge.net/lists/listinfo/devil-linux-discuss |
|
From: Ma p. <the...@gm...> - 2016-04-08 12:12:03
|
Hi, Work fine using 1.6.8. Thanks 2016-04-07 22:45 GMT+02:00 Philippe Marzouk <ph...@oz...>: > I'm running 1.6.8 and I have a much longer list of character sets > available. It may have been a bug in your particular version. > > Philippe > > On Thu, Apr 07, 2016 at 02:27:47PM +0200, Ma poubelle wrote: > > Hi Philippe, > > > > I know the iso-8859 is not a valid coding but this is the output of file > > command. > > > > this is the output i have when i try to convert my file > > > > devil:~/tmp# iconv -l > > The following list contain all the coded character sets known. This does > > not necessarily mean that all combinations of these names can be used for > > the FROM and TO command line parameters. One coded character set can be > > listed with several different names (aliases). > > > > 10646-1:1993, 10646-1:1993/UCS4, ANSI_X3.4-1968, ANSI_X3.4-1986, > > ANSI_X3.4, > > ASCII, CP367, CSASCII, CSUCS4, IBM367, ISO-10646, ISO-10646/UCS2, > > ISO-10646/UCS4, ISO-10646/UTF-8, ISO-10646/UTF8, ISO-IR-6, ISO-IR-193, > > ISO646-US, ISO_646.IRV:1991, OSF00010020, OSF00010100, OSF00010101, > > OSF00010102, OSF00010104, OSF00010105, OSF00010106, OSF05010001, UCS-2, > > UCS-2BE, UCS-2LE, UCS-4, UCS-4BE, UCS-4LE, UCS2, UCS4, UNICODEBIG, > > UNICODELITTLE, US-ASCII, US, UTF-8, UTF8, WCHAR_T > > devil:~/tmp# file source.CSV > > source.CSV: ISO-8859 text, with CRLF line terminators > > devil:~/tmp# iconv -f iso-8859-1 -t utf-8 source.CSV > dest.CSV > > iconv: conversion from `iso-8859-1' is not supported > > Try `iconv --help' or `iconv --usage' for more information. > > devil:~/tmp# iconv -t utf-8 source.CSV > dest.CSV > > iconv: illegal input sequence at position 185 > > devil:~/tmp# more /etc/Devil-release > > Devil-Linux 1.4-i686 > > > > I see that I try on old release of DL, I will try on a more recent > release > > and I came back > > > > Regards > > > > Christian > > > > 2016-04-07 6:23 GMT+02:00 Philippe Marzouk <ph...@oz...>: > > > > > On Wed, Apr 06, 2016 at 03:29:16PM +0200, Ma poubelle wrote: > > > > Hello, > > > > > > > > I try to convert a ISO-8859 text file to UTF-8 Unicode text using > DL. My > > > > problem is the ISO-8859 encoding is not present when I run iconv -l > and > > > the > > > > conversion fail. > > > > > > > > Do you have an idea how to add encoding for this tool ? > > > > > > > > > > There is no iso-8859 coding, there is iso-8859-x where x is a number > > > with 1 for latin characters, 5 for cyrillic, 6 for arabic, 15 for latin > > > with euro, etc. > > > > > > Can you share the exact command you are using? > > > > > > Philippe > > > > > > > > > > ------------------------------------------------------------------------------ > > > _______________________________________________ > > > Devil-linux-discuss mailing list > > > Dev...@li... > > > https://lists.sourceforge.net/lists/listinfo/devil-linux-discuss > > > > > > > ------------------------------------------------------------------------------ > > > _______________________________________________ > > Devil-linux-discuss mailing list > > Dev...@li... > > https://lists.sourceforge.net/lists/listinfo/devil-linux-discuss > > > > ------------------------------------------------------------------------------ > _______________________________________________ > Devil-linux-discuss mailing list > Dev...@li... > https://lists.sourceforge.net/lists/listinfo/devil-linux-discuss > |