Skip to content

Commit a8bd7e1

Browse files
committed
> Tatsuo Ishii wrote:
> > > > It was made to cope with encoding such as an Asian bloc in 7.2Beta2. > > > > > > > > Added ServerEncoding > > > > Korean (JOHAB), Thai (WIN874), > > > > Vietnamese (TCVN), Arabic (WIN1256) > > > > > > > > Added ClientEncoding > > > > Simplified Chinese (GBK), Korean (UHC) > > > > > > > > > > > > > http://www.sankyo-unyu.co.jp/Pool/postgresql-7.2b2.newencoding.diff.tar.gz > > > > (608K) > > > > > > Looks good. I need some people to review this for me. > > > > For me they look good too. The only missing part is a > > documentation. I will ask him to write it up. If he couldn't, I will > > do it for him. > > > The diff is 3mb > > > but appears to address only additions to multibyte. I have attached a > > > list of files it modifies. Also, look at the sizes of the mb/ > > > directory. It is getting large: > > > > > > 4 ./CVS > > > 6 ./Unicode/CVS > > > 3433 ./Unicode > > > 6197 . > > > > Yes. We definitely need the on-the-fly encoding addition capability: > > i.e. CREATE CHRACTER SET in the future... > > -- > > Tatsuo Ishii > > > > Address chainge. http://www.sankyo-unyu.co.jp/Pool/postgresql-7.2.newencoding.diff.gz Add PsqlODBC and document ...etc patch. Eiji Tokuya
1 parent 0319443 commit a8bd7e1

File tree

17 files changed

+5959
-4911
lines changed

17 files changed

+5959
-4911
lines changed

configure

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -895,14 +895,15 @@ if test x"${enable_multibyte+set}" = xset; then
895895
enable_multibyte=yes
896896

897897
case $enableval in
898-
SQL_ASCII|EUC_JP|EUC_CN|EUC_KR|EUC_TW|UNICODE|MULE_INTERNAL|LATIN1|LATIN2|LATIN3|LATIN4|LATIN5|LATIN6|LATIN7|LATIN8|LATIN9|LATIN10|KOI8|WIN|ALT|ISO_8859_5|ISO_8859_6|ISO_8859_7|ISO_8859_8)
898+
SQL_ASCII|EUC_JP|EUC_CN|EUC_KR|EUC_TW|JOHAB|UNICODE|MULE_INTERNAL|LATIN1|LATIN2|LATIN3|LATIN4|LATIN5|LATIN6|LATIN7|LATIN8|LATIN9|LATIN10|WIN1256|TCVN|WIN874|KOI8|WIN|ALT|ISO_8859_5|ISO_8859_6|ISO_8859_7|ISO_8859_8)
899899
MULTIBYTE=$enableval;;
900900
*)
901901
{ echo "configure: error: argument to --enable-multibyte must be one of:
902-
SQL_ASCII, EUC_JP, EUC_CN, EUC_KR, EUC_TW,
902+
SQL_ASCII, EUC_JP, EUC_CN, EUC_KR, EUC_TW, JOHAB,
903903
UNICODE, MULE_INTERNAL,
904904
LATIN1, LATIN2, LATIN3, LATIN4, LATIN5,
905905
LATIN6, LATIN7, LATIN8, LATIN9, LATIN10,
906+
WIN1256, TCVN, WIN874,
906907
KOI8, WIN, ALT,
907908
ISO_8859_5, ISO_8859_6, ISO_8859_7, ISO_8859_8
908909
Or do not specify an argument to the option to use the default." 1>&2; exit 1; };;

configure.in

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -179,16 +179,16 @@ PGAC_ARG_OPTARG(enable, multibyte, [ --enable-multibyte enable multibyte c
179179
[MULTIBYTE=SQL_ASCII],
180180
[
181181
case $enableval in
182-
SQL_ASCII|EUC_JP|EUC_CN|EUC_KR|EUC_TW|UNICODE|MULE_INTERNAL|LATIN1|LATIN2|LATIN3|LATIN4|LATIN5|LATIN6|LATIN7|LATIN8|LATIN9|LATIN10|KOI8|WIN|ALT|ISO_8859_5|ISO_8859_6|ISO_8859_7|ISO_8859_8)
182+
SQL_ASCII|EUC_JP|EUC_CN|EUC_KR|EUC_TW|JOHAB|UNICODE|MULE_INTERNAL|LATIN1|LATIN2|LATIN3|LATIN4|LATIN5|LATIN6|LATIN7|LATIN8|LATIN9|LATIN10|WIN1256|TCVN|WIN874|KOI8|WIN|ALT|ISO_8859_5|ISO_8859_6|ISO_8859_7|ISO_8859_8)
183183
MULTIBYTE=$enableval;;
184184
*)
185185
AC_MSG_ERROR(
186186
[argument to --enable-multibyte must be one of:
187-
SQL_ASCII, EUC_JP, EUC_CN, EUC_KR, EUC_TW,
187+
SQL_ASCII, EUC_JP, EUC_CN, EUC_KR, EUC_TW, JOHAB,
188188
UNICODE, MULE_INTERNAL,
189189
LATIN1, LATIN2, LATIN3, LATIN4, LATIN5,
190190
LATIN6, LATIN7, LATIN8, LATIN9, LATIN10,
191-
KOI8, WIN, ALT,
191+
WIN1256, TCVN, WIN874, KOI8, WIN, ALT,
192192
ISO_8859_5, ISO_8859_6, ISO_8859_7, ISO_8859_8
193193
Or do not specify an argument to the option to use the default.]);;
194194
esac

doc/README.mb.jp

Lines changed: 47 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -45,6 +45,7 @@ PostgreSQL 7.2 multi-byte (MB) support README 2001/9/18 $B:n@.(B
4545
EUC_CN GB $B$r%Y!<%9$K$7$?CfJ8(BEUC$B!%(Bcode set 2 $B$O(B
4646
SS2+2$B%P%$%H%3!<%I(B = 3$B%P%$%HI=8=$G$9!%(B
4747
EUC_KR $B4Z9q8l(B EUC$B!%(B
48+
JOHAB $B%O%s%0%k%Y!<%9$N4Z9q8l(BEUC.
4849
EUC_TW $BBfOQ$N(B EUC$B!%(Bcode set 2 $B$O(B
4950
SS2+$BLLHV9f(B+2$B%P%$%H%3!<%I(B = 4$B%P%$%HI=8=$G$9!%(B
5051
UNICODE UTF-8$B!%$?$@$7%5%]!<%H$9$k$N$O(B UCS-2 $B$NHO0O!$(B
@@ -56,6 +57,9 @@ PostgreSQL 7.2 multi-byte (MB) support README 2001/9/18 $B:n@.(B
5657
$B%-%j%kJ8;z(B KOI8(KOI8-R), WIN(CP1251), ALT(CP866)$B$r%5%]!<%H(B
5758
$B$7$F$$$^$9!%$b$A$m$s(B ISO 8859-5 $B$b;HMQ2DG=$G$9!%(B
5859
$B$3$N>l9g!$(B"LATIN5" $B$H$7$F;XDj$7$F2<$5$$!%(B
60+
WIN1256 $B%"%i%V=t9q8l(BWindows$BMQ%(%s%3!<%G%#%s%0(B.
61+
TCVN $B%Y%H%J%`8l(B."ABC"$B$d(B"VSCII"$B$b;HMQ2DG=(B.
62+
WIN874 $B%?%$8l(B.
5963

6064
$BA*Br$NL\0B$H$7$F$O!$1Q8l$HF|K\8l$7$+;H$o$J$$>l9g$O(B EUC_JP($BF1MM$K!$Cf(B
6165
$B9q8l$7$+;H$o$J$$>l9g$O(B EUC_CN... $B$J$I$H$J$j$^$9(B)$B!$$=$NB>$N8@8l$b;H$$$?(B
@@ -165,22 +169,40 @@ $ psql -l
165169
$B%(%s%3!<%G%#%s%0(B
166170
----------------------------------------------------------------
167171
EUC_JP EUC_JP, SJIS, UNICODE
168-
172+
169173
EUC_TW EUC_TW, BIG5, UNICODE
170-
174+
175+
EUC_CN EUC_CN, UNICODE
176+
177+
EUC_KR EUC_KR, UNICODE
178+
179+
JOHAB JOHAB, UNICODE
180+
171181
LATIN1,3,4 LATIN1,3,4, UNICODE
172182

173183
LATIN2 LATIN2, WIN1250, UNICODE
174-
184+
175185
LATIN5 LATIN5, WIN, ALT, UNICODE
176-
186+
187+
LATIN6,7,8,9,10 LATIN6,7,8,9,10, UNICODE
188+
189+
ISO_8859_5,6,7,8 ISO_8859_5,6,7,8, UNICODE
190+
191+
WIN1256 WIN1256, UNICODE
192+
193+
TCVN TCVN, UNICODE
194+
195+
WIN874 WIN874, UNICODE
196+
177197
MULE_INTERNAL EUC_JP, SJIS, EUC_KR, EUC_CN,
178198
EUC_TW, BIG5, LATIN1$B$+$i(B5,
179199
WIN, ALT, WIN1250
180200

181-
UNICODE EUC_JP, SJIS, EUC_KR, EUC_CN,
182-
EUC_TW, BIG5, LATIN1$B$+$i(B5,
183-
WIN, ALT, WIN1250
201+
UNICODE EUC_JP, SJIS, EUC_KR, UHC,
202+
EUC_CN, GBK, EUC_TW, BIG5,
203+
LATIN1$B$+$i(B10, ISO_8859_5$B$+$i(B8,
204+
WIN, ALT, WIN1250, WIN1256,
205+
TCVN, WIN874, JOHAB
184206
----------------------------------------------------------------
185207

186208
$B%P%C%/%(%s%I$H%U%m%s%H%(%s%I$N%(%s%3!<%G%#%s%0$,0[$J$k>l9g!$$=$N$3$H(B
@@ -390,12 +412,28 @@ o set client_encoding $B%3%^%s%I$r;H$&J}K!(B
390412
ISO 8859-3 8859-3.TXT
391413
ISO 8859-4 8859-4.TXT
392414
ISO 8859-5 8859-5.TXT
393-
EUC_JP JIS0201.TXT, JIS0208.TXT, JIS0212.TXT
415+
ISO 8859-6 8859-6.TXT
416+
ISO 8859-7 8859-7.TXT
417+
ISO 8859-8 8859-8.TXT
418+
ISO 8859-9 8859-9.TXT
419+
ISO 8859-10 8859-10.TXT
420+
ISO 8859-13 8859-13.TXT
421+
ISO 8859-14 8859-14.TXT
422+
ISO 8859-15 8859-15.TXT
423+
ISO 8859-16 8859-16.TXT
424+
EUC_JP JIS0201.TXT, JIS0208.TXT, JIS0212.TXT,
425+
CP932.TXT, sjis.map
394426
SJIS CP932.TXT
395427
EUC_CN GB2312.TXT
396-
EUC_KR OLD5601.TXT
428+
GBK CP936.TXT
429+
EUC_KR KSX1001.TXT
430+
UHC CP949.TXT
431+
JOHAB JOHAB.TXT
397432
EUC_TW CNS11643.TXT
398433
Big5 BIG5.TXT
434+
WIN1256 CP1256.TXT
435+
TCVN CP1258.TXT
436+
WIN874 CP874.TXT
399437
============================================================
400438

401439
$B<U<-!'(B

doc/src/sgml/charset.sgml

Lines changed: 57 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
<!-- $Header: /cvsroot/pgsql/doc/src/sgml/charset.sgml,v 2.21 2002/01/20 22:19:55 petere Exp $ -->
1+
<!-- $Header: /cvsroot/pgsql/doc/src/sgml/charset.sgml,v 2.22 2002/03/05 05:52:42 momjian Exp $ -->
22

33
<chapter id="charset">
44
<title>Localization</>
@@ -370,6 +370,10 @@ perl: warning: Falling back to the standard locale ("C").
370370
<entry><literal>EUC_KR</literal></entry>
371371
<entry>Korean <acronym>EUC</></entry>
372372
</row>
373+
<row>
374+
<entry><literal>JOHAB</literal></entry>
375+
<entry>Korean <acronym>EUC</> (Hangle base)</entry>
376+
</row>
373377
<row>
374378
<entry><literal>EUC_TW</literal></entry>
375379
<entry>Taiwan <acronym>EUC</acronym></entry>
@@ -450,6 +454,18 @@ perl: warning: Falling back to the standard locale ("C").
450454
<entry><literal>ALT</literal></entry>
451455
<entry>Windows CP866</entry>
452456
</row>
457+
<row>
458+
<entry><literal>WIN1256</literal></entry>
459+
<entry>Arabic Windows CP1256</entry>
460+
</row>
461+
<row>
462+
<entry><literal>TCVN</literal></entry>
463+
<entry>Vietnamese TCVN-5712(Windows CP1258)</entry>
464+
</row>
465+
<row>
466+
<entry><literal>WIN874</literal></entry>
467+
<entry>Thai Windows CP874</entry>
468+
</row>
453469
</tbody>
454470
</tgroup>
455471
</table>
@@ -580,6 +596,21 @@ $ <userinput>psql -l</userinput>
580596
<literal>UNICODE</literal>, <literal>MULE_INTERNAL</literal>
581597
</entry>
582598
</row>
599+
<row>
600+
<entry><literal>EUC_CN</literal></entry>
601+
<entry><literal>EUC_CN</literal>, <literal>UNICODE</literal>, <literal>MULE_INTERNAL</literal>
602+
</entry>
603+
</row>
604+
<row>
605+
<entry><literal>EUC_KR</literal></entry>
606+
<entry><literal>EUC_KR</literal>, <literal>UNICODE</literal>, <literal>MULE_INTERNAL</literal>
607+
</entry>
608+
</row>
609+
<row>
610+
<entry><literal>JOHAB</literal></entry>
611+
<entry><literal>JOHAB</literal>, <literal>UNICODE</literal>
612+
</entry>
613+
</row>
583614
<row>
584615
<entry><literal>EUC_TW</literal></entry>
585616
<entry><literal>EUC_TW</literal>, <literal>BIG5</literal>,
@@ -682,15 +713,19 @@ $ <userinput>psql -l</userinput>
682713
<entry><literal>UNICODE</literal></entry>
683714
<entry>
684715
<literal>EUC_JP</literal>, <literal>SJIS</literal>,
685-
<literal>EUC_KR</literal>, <literal>EUC_CN</literal>,
716+
<literal>EUC_KR</literal>, <literal>UHC</literal>, <literal>JOHAB</literal>,
717+
<literal>EUC_CN</literal>, <literal>GBK</literal>,
686718
<literal>EUC_TW</literal>, <literal>BIG5</literal>,
687719
<literal>LATIN1</literal> to <literal>LATIN10</literal>,
688720
<literal>ISO_8859_5</literal>,
689721
<literal>ISO_8859_6</literal>,
690722
<literal>ISO_8859_7</literal>,
691723
<literal>ISO_8859_8</literal>,
692-
<literal>WIN</literal>, <literal>ALT</literal>,
693-
<literal>KOI8</literal>
724+
<literal>WIN</literal>, <literal>ALT</literal>,
725+
<literal>KOI8</literal>,
726+
<literal>WIN1256</literal>,
727+
<literal>TCVN</literal>,
728+
<literal>WIN874</literal>,
694729
</entry>
695730
</row>
696731
<row>
@@ -720,6 +755,24 @@ $ <userinput>psql -l</userinput>
720755
<literal>UNICODE</literal>, <literal>MULE_INTERNAL</literal>
721756
</entry>
722757
</row>
758+
<row>
759+
<entry><literal>WIN1256</literal></entry>
760+
<entry><literal>WIN1256</literal>,
761+
<literal>UNICODE</literal>
762+
</entry>
763+
</row>
764+
<row>
765+
<entry><literal>TCVN</literal></entry>
766+
<entry><literal>TCVN</literal>,
767+
<literal>UNICODE</literal>
768+
</entry>
769+
</row>
770+
<row>
771+
<entry><literal>WIN874</literal></entry>
772+
<entry><literal>WIN874</literal>,
773+
<literal>UNICODE</literal>
774+
</entry>
775+
</row>
723776
</tbody>
724777
</tgroup>
725778
</table>

src/backend/utils/mb/Unicode/UCS_to_EUC_KR.pl

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22
#
33
# Copyright 2001 by PostgreSQL Global Development Group
44
#
5-
# $Id: UCS_to_EUC_KR.pl,v 1.2 2001/04/16 06:10:18 ishii Exp $
5+
# $Id: UCS_to_EUC_KR.pl,v 1.3 2002/03/05 05:52:45 momjian Exp $
66
#
77
# Generate UTF-8 <--> EUC_KR code conversion tables from
88
# map files provided by Unicode organization.
@@ -20,7 +20,7 @@
2020

2121
# first generate UTF-8 --> EUC_KR table
2222

23-
$in_file = "OLD5601.TXT";
23+
$in_file = "KSX1001.TXT";
2424

2525
open( FILE, $in_file ) || die( "cannot open $in_file" );
2626

0 commit comments

Comments
 (0)
pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy