Re: [PHP-I18N] Problem with specific kanji From: Moriyoshi Koizumi (moriyoshi <email protected>)
Date: 05/13/03

Hi,

Your problem is caused by the incompatiblity of character sets between
EUC-JP and SJIS-WIN, which could lead users to confusion; the SJIS-WIN
(aka CP932) character set has more characters than EUC-JP and the
character mapped to "\xfa\xb1" doesn't belong to a commonly available
character set.

In mbstring, SJIS is defined to consist of two character sets:
[JISX0201-1976] and [JISX0208-1983] and SJIS-WIN: [JISX0201-1976]
[JISX0208-1983] [IBM extensions] [NEC extensions] and [Microsoft
extensions], where "\xfa\b1" belongs to [IBM extensions].

The actual counterpart of Shift_JIS is EUC-JP, and that of SJIS-WIN is
"EUCJP-WIN".

Moriyoshi

"Muhammad Rofiq" <rofiq <email protected>> wrote:

>
> "Moriyoshi Koizumi" <moriyoshi <email protected>> wrote in message
> news:20030509222519YPfmB& <email protected>
> > "Muhammad Rofiq" <rofiq <email protected>> wrote:
> >
> > > $result = mb_convert_encoding($value, "EUC-JP", "SJIS-WIN");
> > >
> > > and when I revert back to SJIS-WIN using
> > >
> > > $original = mb_convert_encoding($result, "EUC-JP", "SJIS-WIN");
> >
> > Should this be
> >
> > $original = mb_convert_encoding($result, "SJIS-WIN", "EUC-JP");
> >
> > since it's reverse conversion?
>
> ups, sorry, i mean that way, i forget to switch it while doing copy-n-paste
> here.
>
> BTW I did switching it in my PHP program, and still moji-bake appears with
> kanji I mentioned before.

-- 
PHP Internationalization Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php