RorohikoHōtaka

UTF-8 - whakawaehere pūāhua

tautoko Waehereao tata katoa ngā huinga pūāhua. Ko te puka pai o whakawaehere huinga pūāhua Waehereao he UTF-8 whakawaehere. tautoko ana te reira hototahi ki ASCII, ātete ki distortion o nga raraunga, te tōtika me te humarie ana o te tukatuka. Ko nga mea tuatahi te tuatahi.

Tapa puka

Rorohiko mahi rite kore anake tau ngā waitara pāngarau, me te te ngā o ngā waeine o te rokiroki, me te whāwhā raraunga whakaritea-rahi - paita me kupu moka-32. Me tango te paerewa whakawaehere tenei ki pūkete, ka whakatau pehea ki te tapae i te maha o ngā pūāhua.

I roto i ngā pūnaha rorohiko, te tau tōpu rongoa i roto i nga pūtau mahara o 8 paraire (1 paita), 16 ranei 32 paraire. Ia puka tautuhi te whakawaehere Unicode, e raupapa o ngā pūtau mahara ko te tau tōpū ōrite ki te tohu ngā. I roto i te paerewa i reira e toru ngā rerekē o Tapa pūāhua Unicode 8, 16 me te moka-32 poraka. Fakatatau ki ai, kei te mohiotia ratou rite UTF-8, UTF-16 me UTF-32. tu Ingoa UTF mō Waehereao Hōputu Huringa. Ia o nga momo e toru o te tikanga whakawaehere he he painga i roto i ngā tono rite pūāhua Waehereao māngai.

Ka taea te whakamahi whakamunatanga raraunga ki te tohu pūāhua katoa i roto i te paerewa Unicode. Ko te kupu, e ratou tino hototahi ki rongoā mo te momo o take, mā te whakamahi i ngā momo rerekē o te Tapa. unambiguously taea te tahuri ia waehere ki tetahi o te atu e rua i waho ngaronga o ngā raraunga.

parau tumu nenalozheniya

whakawhanakehia Ia o te whakawaehere puka Unicode i te tirohanga o te īnaki wāhanga kore. Hei tauira, Windows-932 hanga te pūāhua o tetahi e rua ranei paita o waehere. Tei te roa raupapa i runga i te paita tuatahi, na nga uara paita tino i roto i te raupapa o rua-paita me ritua paita kotahi. Heoi, ko te uara o te paita kotahi, me te autō raupapa paita kia hāngai. Tenei te tikanga mo te tauira e te rapu pūāhua D (waehere 44) e kitea ai e tomo hape ki te wahi tuarua o te raupapa o rua-paita pūāhua "D" (waehere 84 44). Hei kitea i roto i nei raupapa ko tika, kia tango i te hōtaka ki te pūkete te paita o mua.

He uaua te āhuatanga, ki te te ārahi me te autō paita kēmu. tikanga o tēnei e i roto i te tikanga ki te tango i te rangirua, ka waiho he tirohia whakamuri i mua i te taenga o te timatanga o te kuputuhi te raupapa waehere ahurei ranei. Ko tenei e kore anake hauarea, engari e kore e te tiaki i ngā hapa taea, mai hei kotahi anake paita he ki te kuputuhi tonu kua pānui.

Hōputu faafariuraa Waehereao mawehe tenei raruraru, no te kahore e te uara o te ārahi, makatea, me te wae kotahi o te rokiroki te mōhiohio taua. Ka āta whakarite tēnei e Unicode katoa mo te rapu me te whakarite, kore hoatu hua hape e tika ana ki te tāpiritanga o ngā wāhanga rerekē o te waehere pūāhua. Ko te take i enei momo o Tapa mau te nenalozheniya parau tumu, titore ratou i te tahi atu whakawaeheretanga maha-paita Āhia Te Tai Rāwhiti.

Ko tētahi atu āhuatanga nonintersection whakawaeheretanga Unicode ko e ia huru kua he rohe āta tāutuhia. Faaore i te hiahia ki te matawai i te maha tuatau o tohu o mua. Kei te te tahi mau taime i huaina tēnei āhuatanga whakawaehere whaiaro-clocking. Ka whakamōhio distortion o ngā waeine waehere te distortion o tetahi anake huru, a he tonu tonu nga pūāhua e karapoti ana. I roto i te faafariuraa hōputu moka-8, ki te nga ngā atatohu ki te paita, tīmata ki 10xxxxxx (i roto i te waehere ā-rua) ki te kitea e hiahiatia ana te tīmatanga o te tohu e no te kotahi ki te toru whakawhitinga whakamuri.

ōritetanga

tautoko tino Waehereao Consortium katoa 3 puka o whakawaeheretanga. He mea nui e kore e ki te whakahē i te UTF-8 ko Waehereao, rite hōputu faafariuraa katoa - rite momo whaimana o ahua o te Unicode paerewa pūāhua-whakawaehere.

Paita-takotoranga

Hei tohu UTF-32 pūāhua ka hiahia te kōwae waehere moka-32, e hāngai ana ki te waehere Waehereao. UTF-16 - tetahi ki te e rua wae moka-16. He UTF-8 whakamahi ake ki te 4 bytes.

hangaia whakawaehere UTF-8 te ki kia hototahi ki ngā pūnaha e hāngai ana ASCII-paita-hāngai. Te nuinga o te pūmanawa ngā me mahi o te hangarau mōhiohio mō te wa roa i okioki ki te kanohi o pūāhua i roto i te raupapa o paita. tikanga maha tei runga i te tamau o te whakawaeheretanga ASCII me whakamahi rānei mawehe nga pūāhua mana motuhake. He ara ohie ki te urutau ki ngā āhuatanga Unicode taea, te whakamahi i bit-8 waehere mō te māngai pūāhua Waehereao, tetahi pūāhua ASCII ōrite te pūāhua mana ranei. Ki tenei mutunga, a he reira UTF-8 whakawaehere.

roa tāupe

UTF-8 - Tapa o te roa tāupe, arā, o ngā waeine rokiroki moka-8, nga paraire o runga e tohu ki te e wahi o te raupapa o ia paita takitahi no. Ko tētahi whānuitanga o ngā uara wahi ki te huānga tuatahi o te raupapa waehere, tetahi - mo te muri. whakarato ana tēnei whakawaehere disjointness.

ASCII

waehere ASCII kei te tino tautoko whakawaehere UTF-8 (0x00-0x7F). tikanga o tēnei e te pūāhua Unicode U + 0000haora-U + 007F e tahuri ki paita kotahi 0x00-0x7F UTF-8, me te kupu hei indistinguishable i ASCII. Ano, ki te karo rangirua, te uara 0x00-0x7F kore whakamahia a muri ake nei i roto i te kanohi paita kotahi o pūāhua Waehereao. Hei whakawaehere neideograficheskih atu tohu i te ASCII, mā te whakamahi i te raupapa o rua paita. awhe tohu U + 0800-U + FFFF e faahoho'ahia e toru paita, a atu waehere ki neke atu i te U + FFFF rapu wha bytes.

poi o tono

whakawaehere UTF-8 hoatu te tikanga te hiahia i roto i te kawa HTML, me te rite.

kua meinga XML te paerewa tuatahi ki te tautoko tonu i hoki UTF-8 whakawaehere. tūtohu hoki whakahaere paerewa reira. Tautoko raruraru i roto i te wāhitau URL e he rerekē i te ASCII-pūāhua, i whakatau ka haere mai te W3C huinga me te rōpū engineering IETF ki te whakaaetanga i runga i te waehere o katoa wāhitau URL anake i roto i te UTF-8.

Hototahi ki ASCII whakahaere i te whakawhitinga ki te pūmanawa hou. Ki te UTF-8 mahi te nuinga o ētita kuputuhi, tae atu JEdit, Emacs, BBEdit, Eclipse, me "Pukatuhi" te pūnaha whakahaere Windows. No te tahi atu ahua o te whakawaehere Unicode kore e taea e whakamanamana o te tautoko taua o te taputapu.

Tapa painga ko e ngā reira o te raupapa o paita. Ki te UTF-8 aho he ngāwari ki te mahi i roto i C me ētahi atu reo hōtaka. Ko anake te ahua o te whakawaehere tenei, te tikanga e kore e rapua e tapanga ngā paita Bom he whakapuakanga whakawaehere i roto i XML ranei.

tukutahitanga-whaiaro

I roto i tētahi taiao e whakamahi ana ngā tohu moka-8 o te tukatuka whakaritea ki ētahi atu huinga pūāhua maha-paita, kua nga painga e whai ake nei UTF-8:

  • Kei roto i te raupapa waehere paita tuatahi mōhiohio e pā ana ki tona roa. faarahi tenei te tōtika o te rapu tika.
  • Ngāwari te kimi i te timatanga o te tohu rite te paita tīmatanga he iti ki te whānuitanga pūmau o uara.
  • No uara pūtahitanga paita.

Whakatauritea te painga

Ko kiato whakawaehere UTF-8. Ko ka whakamahia mō te whakawaehere pūāhua Āhia Te Tai Rāwhiti (Chinese, Japanese, Korean, Chinese tuhituhi mā te tohu) whakamahia ngā 3-paita. Hoki he iho ki ētahi atu momo o Tapa tere tukatuka UTF-8 whakawaehere. He rārangi kōmaka pūtakerua hua te hua taua rite te-rua kōmaka Waehereao.

Ko te kaupapa whakawaehere pūāhua

Ko te kaupapa whakawaehere pūāhua Kei tohu whakawaehere puka me te tikanga mo te paita kotahi wae waehere wāhi. Hei whakatau i te kaupapa whakawaehere whakarato Unicode paerewa te whakamahi o te tohu kia paita tuatahi (Bom, Paita tohu kia).

A, no te iti te Bom i roto i UTF-8 āhuatanga tūtohu e anake e tohutoro ki te whakamahi o ngā puka o Tapa. Raruraru i roto i te whakatau i te UTF-8 Endian i, rite tona rahi wae whakawaehere ko kotahi paita. Mā te whakamahi i te Bom mo tenei ahua o te waehere e kore e hiahiatia ana e kore tūtohu. kia Bom puta i roto i te kuputuhi ki te tahuri i te tahi atu codings te whakamahi i tohu kia paita waitohu ranei mo te UTF-8 whakawaehere. He he raupapa o 3 paita EF BB 16 16 BF 16.

Me pēhea te ki te whakaturia te whakawaehere UTF-8

Ko te HTML Tapa UTF-8 tāuta te ki te waehere e whai ake nei:

pane

Meta http-equiv = "Ihirangi-Momo" ihirangi = "kuputuhi / html; pūāhua = utf-8" ˃

I roto i te PHP whakaturia UTF-8 whakawaehere te whakamahi i te pane () mahi i te timatanga o te kōnae i muri i te whakatakoto i te hapa putanga uara taumata:

˂? Php

pūrongo_hapa (-1);

pane ( "Ihirangi-Momo: text / html; pūāhua = utf-8 ');

Hei te hono ki te pātengi raraunga MySQL whakaturia UTF-8 whakawaehere te:

˂? Php

mysql_set_charset ( 'UTF8');

Ko te whakawaehere CSS-kōnae ko tohua ngā pūāhua UTF-8 te rite e whai ake nei:

@charset "utf-8";

A, no te whakaora e koe te kōnae o ngā momo katoa whiriwhiri waehere UTF-8, kahore Bom, te kore e kore te pae e mahi. Ki te mahi i tenei i roto i DreamWeave Me ki tīpako i te tahua tūemi "Whakarerekētanga - Āhuatanga Whārangi - Taitara / Whakawaehere" ki te huri i te whakawaehere ki UTF-8. Aru e reloading te whārangi, tango i te tohu taki i "Tūhono Unicode waitohu (Bom)», me te tono i te huringa. Ki te i whakaurua tetahi kuputuhi i te whārangi ranei i roto i te pātengi raraunga puka tetahi o waehere, he mea e tika ana ki te anō-tomo anō-whakawaehere ranei. A, no te mahi koe ki ngā kīanga auau, kia mohio ki te whakamahi i te whakakē u.

Ka taea hoki e koe te whakaora i te kōnae i roto i te waehere UTF-8 i roto i te "Pukatuhi" o Windows. I muri te tīpako i te tahua tūemi "Kōnae - Tiaki I ..." ki te tāuta i te puka e tika ana o te whakawaehere me te whakaora i te kōnae i roto i te UTF-8.

I roto i te Pukatuhi ētita kuputuhi ++, ki te whakaturia te tahi atu atu i UTF-8, mā te tūemi tahua "Tahuri ki UTF-8 kahore Bom» huri te huru, me te whakaora i roto i te UTF-8.

kahore he rerekē

I roto i te horopaki o te ao, te wahi e murua rohe tōrangapū, me te wetereo, te huinga pūāhua e whai āhuatanga rohe, he o iti whakamahi. Waehereao Ko te huinga pūāhua kotahi e tautoko ana localizations katoa. He UTF-8 - he tauira o te whakatinanatanga tika o Waehereao, i te mea:

  • tautoko ana te reira i te whānui o ngā taputapu, tae atu hototahi ki te whakawaeheretanga ASCII ngā;
  • Ko reira ātete ki te distortion raraunga;
  • ohie me whai hua i roto i te maimoatanga;
  • Ko tüäpapa motuhake.

Ki te taeraa mai o te UTF-8 tautohe e pā ana ki te mea ahua o te whakawaehere pūāhua ranei huinga he pai, riro te reira horihori.

Similar articles

 

 

 

 

Trending Now

 

 

 

 

Newest

Copyright © 2018 mi.birmiss.com. Theme powered by WordPress.