Date: 05/14/03
- Previous message: Moriyoshi Koizumi: "Re: [PHP-I18N] UTF-8 string validity detection"
- In reply to: Moriyoshi Koizumi: "Re: [PHP-I18N] Problem with specific kanji"
- Next in thread: Muhammad Rofiq: "[PHP-I18N] Re: Problem with specific kanji"
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
Thank you very much for this depth explanation, this really save my day.
So far all I know is the 3 encoding SJIS, EUC-JP, and ISO-2022-JP, this
encoding knowledge with -WIN suffix used in PHP is new for me. I'm curious
that this -WIN suffix only in PHP while in other programming term (eg.
MySQL), it's EUC-JP encoding already cover all charset that PHP's EUC-JP and
EUCJP-WIN has.
thank you
regards,
"Moriyoshi Koizumi" <moriyoshi <email protected>> wrote in message
news:200305140131331%FJ+j <email protected>
> Hi,
>
> Your problem is caused by the incompatiblity of character sets between
> EUC-JP and SJIS-WIN, which could lead users to confusion; the SJIS-WIN
> (aka CP932) character set has more characters than EUC-JP and the
> character mapped to "\xfa\xb1" doesn't belong to a commonly available
> character set.
>
> In mbstring, SJIS is defined to consist of two character sets:
> [JISX0201-1976] and [JISX0208-1983] and SJIS-WIN: [JISX0201-1976]
> [JISX0208-1983] [IBM extensions] [NEC extensions] and [Microsoft
> extensions], where "\xfa\b1" belongs to [IBM extensions].
>
> The actual counterpart of Shift_JIS is EUC-JP, and that of SJIS-WIN is
> "EUCJP-WIN".
>
> Moriyoshi
>
>
> "Muhammad Rofiq" <rofiq <email protected>> wrote:
>
> >
> > "Moriyoshi Koizumi" <moriyoshi <email protected>> wrote in message
> > news:20030509222519YPfmB& <email protected>
> > > "Muhammad Rofiq" <rofiq <email protected>> wrote:
> > >
> > > > $result = mb_convert_encoding($value, "EUC-JP", "SJIS-WIN");
> > > >
> > > > and when I revert back to SJIS-WIN using
> > > >
> > > > $original = mb_convert_encoding($result, "EUC-JP", "SJIS-WIN");
> > >
> > > Should this be
> > >
> > > $original = mb_convert_encoding($result, "SJIS-WIN", "EUC-JP");
> > >
> > > since it's reverse conversion?
> >
> > ups, sorry, i mean that way, i forget to switch it while doing
copy-n-paste
> > here.
> >
> > BTW I did switching it in my PHP program, and still moji-bake appears
with
> > kanji I mentioned before.
>
-- PHP Internationalization Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php
- Previous message: Moriyoshi Koizumi: "Re: [PHP-I18N] UTF-8 string validity detection"
- In reply to: Moriyoshi Koizumi: "Re: [PHP-I18N] Problem with specific kanji"
- Next in thread: Muhammad Rofiq: "[PHP-I18N] Re: Problem with specific kanji"
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

