Showing posts with label Unicode. Show all posts
Showing posts with label Unicode. Show all posts

Monday 8 September 2014

Unicode SVG gives correct applet


漢字 · かんじ

Aule Kanji Pages · Kanji Recog Pages

Exposing the Unicode values has resulted in a correction to my code-as-data ...



Can you see where that would have been ?



Wednesday 3 September 2014

Unicode sorted kanji Conning Halpern


漢字 · かんじ

Aule Kanji Pages · Kanji Recog Pages

Unicode-sorted kanji of Conning KLC with Halpern KLD id


This is my new Curl applet with a Unicode-sort version of the Conning and Halpern kanji with Kodansha KLC mapped to Kodansha KLD :



The RichText edit widget is a scratch work pad for building sets of kanji to review on any given day.


Tuesday 5 August 2014

kanji index for our KanjiRecog page 2


漢字 · かんじ

Aule Kanji Pages · Kanji Recog Pages

The 1,189 kanji in our TEXT PAGE 2 for KanjiRecog are

丁七万三上下不与世丙中丸丹主久之乗九也乱了予争事二五井亜交京人仁仇今介仕他付仙代令以仮仲件任企伊会伝伯伴伸似位住佐体何余佛作佳併使例侍侑供価侵係俊保信修俳俵俺倉個倍倒候倣倫健側偶傑備傳傷働像僕僚億優元兄充兆先光免児入全八公六共兵具典内円冊再冒冗写冠凝凡処凱出刀分切刊初判別利到制刷刻則前剣創劇力功加助努労効勇勉動勘務勝募勤勲化北匠区十千午半卒卓協南単卜印原厳去参又及友反収取受叙口古句可台史右号司合吉同名向君否含呂告周味呼命和哀品員唄唐商問啓喜喬営嗜嗣嘆嘉四回団困図固国國園土在圭地坂坊型垣城執基堀堂堯報場塔塗塩境墓増墨壊士声売変夏夕外多夢大天太夫失奈契女奴好如妥妨妻委姿娘娯婚嫌子字存孝季学孫宅宇守安完官宙定宝実客室宮宴家容宿寄寒寛實審寶寺対封専射将尊尋導小少尚尽尾局居屋屍展属履山岡岩岸峰島崇崎崑崩嵐嶺川州巡巣工左巧巨差巳巻市希帝師席帰常幅幡平年幸幹幼広底店府度座庫康廃廉延建弁式弓引弘弟弥張強当形彦彫彰影役彼待後従得復徳徴徹心必忍志応忠念怒怖思性恋恐恩息恵悔悠悩悪悼情惚惠想意愛感態慰憎憶懐成我戚戦戯戸戻房所手才打扱批承技投折抜抵担拒招拠括拾持指挙挟挫振授掛採接控推描提摩撃撮擦支改放政敏敗教敬数敵敷文斎料断新方於施旅旋族旗既日旧旨早旬旺昇昌明星映春昭是時晃晋晩景晴智暁暇暮暴曜曰曲更書曾最月有朋服朗望朝期木未末本杉村条来東松板林枚果枝枠枷柳査栄校根格桂桃案桑條棄棒棚森植検椿楠業極楼楽概榎構様樋模権横樹橋機櫃欠次欣欲歌止正武歳歴死残殴段殺殿母毎比民気水汁求江池汰決沙没沢河油治沼沿況泉法波注泳洋津活派浅浜浦浩浪浴海消涯淀淑淡深淳混添清済減渡湯満準溝滝演漢潜潤潮潰澤激濡瀬点為無然焼煙照煮熊熱燃爆父版牛牧物特犬状狂独猪献獄獅獣玉王玲珍現理瑞璧瓶甚生産甥用田由申男町画界番異疑痴療癌発登白百的益盗盛盟監目直相省看県眞真眠眼着督矢知石砂研砕砦破確磨示礼社神祭福秀私秋科秘秦称移税稚種稲稿穂究空突窓窮立章端竹笑笠第筆等筑答策算管節築篤籍米粂粵精系紀約納純級素細紳紹終組経結絞絡給統絵絶絹継続綴緊総緒線締編練縁繁繋繰續置署罵羅美群義翌習翻翼考者聖聞聰職肇肉肖育背能脂脇脚脱脳腕腰膳臨自興舎舘舞舟航般船良色芥花芳芸若苦英茅茉茶草荏荒荘莉菅菊菜華萌萩萬落葉著蒙蓮蔵薦薩薫藝藤蘭虎虹蛛蜘蝦蝶蟇衆行術街衛衝衣表衰裁装裏裕補裳製複西要見規視覚覧親観角解触言計訊記訣訪設許訳訴証評詞詩話該詳誉誌認誕誘語誤説読課調談論諦諸諾謎講謝識議譲護谷豆豊豚象貞負財販貫責貴買貸費賀資賛賞質購赤走起超越趣足跡路踏躍身車軋軍軒転載轟轢辰農辺辻込辿近返迫述追退送逃通逝造連週進逸遂遅遇運過道達遠遣適遭選遺邦邸郎郡部郷都配酒酔酷醉醜里重野量金釜針鈴鉄銀鋭録鍵鎌鑑長門閉開間関閲闘阿附降限院陣除陳陶陽隆隊階際隠雄雅集雑離難雨雪電露青静非面音響頁頃項順須頓頭頻頼題額顔顕願類風飛食飯飲養館香馬駿騎験骨高鰤鶏鷲黄黎黒龍

These kanji you may need to look up outside the usual books :

佛侑傳國堯實寶屍崑惠曰曾枷條櫃甥粂粵續聰茉莉蛛蜘蟇裳訣軋轢辿醉鰤 but remember : the point of the exercise is recognition as an aid to reading. Once you are used to "reducing" a text using "Replace" with nothing edits you may want to grab large pieces of text and remove all of the kana and most of the kanji you have mastered and then work down through them, removing what you recognize until you are forced to identify kanji by their features and your own mnemonics.



Wednesday 18 September 2013

250 Essential Kanji for Everyday Use, Bk1


Book 1 of the Tuttle series "250 Essential Kanji" has the oddity ( in my edition ) of not only having no Unicode value for a kanji but also no Tuttle flashcard number.

Take page 121, for example.

The kanji are entries 139, 140 and 141.

These are 証 確 and .

Their Tuttle flashcard numbers are 754, 805 and 978.

If you are interested in an Android app to accompany the book, drop a note at info AT aule-browser.com and I'll convert some Curl RIA desktop code into Curl CAEDE code and we'll have that app.

The app would show you that these kanji are

証  ショウ / あかし  8A3C
確  カク·コウ / たし.か·たし.かめる  78BA
認  ニン / みと.める·したた.める  8A8D

If you know the 4-character Unicode utf-16 value then I have a simple app that gives you a version of that kanji that you can then cut-n-paste into a web page, a URL address or a Word document – or into a Find field in a Japanese page that is encoded in EUC or Shift or which ever JIS or other encoding is confounding your search. One ring of code to bind them all.

Btw, their Kodansha Essential Kanji entries are 1672, 1409 and 1690.

And which two of the above kanji form the verb kakuninsuru ?  Sooo desu ne !




Saturday 24 August 2013

kana kanji font フォント


I am searching diligently for a Japanese Unicode web font ( フォント ) with fine ( light or extra-light ) kana (hiragana, katakana and furigana) but strongly stroked kanji so that in a phrase with only few kanji, the effect will be one of the kanji characters standing out.

The alternative is to use extensive markup or to script based on the binary value of the characters.

What approach will I take for web languages which have string types or values, but no char type?







Tuesday 30 July 2013

Shiki spring kigo haiku 1899


Here is a snapshot of the Curl web browser applet with Shiki’s spring 春 はる〕kigo haiku from 1899 in UTF-8 char-encoding.

Above you see the kigo index to the left and the pop-up menu to copy characters which can then be sought using CTRL-f and the Find menu.




Wednesday 24 July 2013

TEI Japanese HTML character encoding


Is there any longer a good reason for funded text initiatives on the web NOT to be UTF-8 Unicode ?

TEI at http://etext.lib.virginia.edu/japanese/hyakunin/frames/index/hyaku3euc.html#euc2 has a Japanese poetry page with a click-driven "swiping frames" metaphor - one of which is a view showing the kanji for haiku, waka or tanka in the char-encoding CHARSET=x-euc-jp 

Oi vey!

CHARSET=x-euc-jp ?!? No, not HTML from a server in Kyoto. The server is in VA. In collaboration with academics in PA.

There is an  anti-pattern  documented for this IT phenomenon in projects within organizations exempt from accountability and competitive pressure or managerial consequences ( and perhaps 2 or more anti-patterns specific to this web design and its survival on this academic web site.)

Ah, the peaceful, never over-loaded servers of  e-text  initiatives in academe ! Concurrent user load ? Not a worry.



Enhanced by Zemanta

Wednesday 24 April 2013

Wakan tool kanji tip


So I have copied a Japanese web page to my computer clipboard memory and now I have them displayed in Wakan using the "Character" menu choice for the clipboard content.  How do I now get a copy of any one of those unique and sorted kanji for a text search elsewhere ?

TIP : I display the UNICODE on the Wakan character details so I copy that 4-hex value and enter it into a CTRL-f search at http://kanji.aule-browser.com/henshall-sorted-urlencoded.html and very often get a hit and just copy that kanji from its row.

My next stop now that I have the kanji is often jisho.org

Why not just go to jisho and search by Unicode ? Because if I must do several kanji, I find that having the web page linked above open in a browser tab is simply faster - most often ... and the Henshall number is in plain view.




Saturday 20 April 2013

Kodansha Essential kanji 200 to 300


Here are some of the most frequent kanji with a Kodansha Essential id and mapped to Unicode in CSV format


ROW,FREQ,UNICODE,RR500,Henshall,KE-id
201,247,,4E21,R016,H0411,KE0015
202,271,,4E89,R021,H0529,KE0274
203,212,,4EF6,R000,H0660,KE0066
204,217,,4EFB,R000,H0764,KE0069
205,278,,4F01,R000,H1120,KE0048
206,276,位,4F4D,R000,H0421,KE0072
207,270,住,4F4F,R040,H0310,KE0078
208,219,使,4F7F,R042,H0287,KE0089
209,250,価,4FA1,R000,H0626,KE0086
210,232,係,4FC2,R000,H0268,KE0268
211,208,信,4FE1,R045,H0513,KE0513
212,216,側,5074,R000,H0535,KE0119
213,275,,518D,R060,H0679,KE0162
214,214,,5225,R069,H0579,KE0188
215,203,利,5229,R068,H0596,KE0189
216,260,勢,52E2,R000,H0518,KE0227
217,224,,534A,R082,H0195,KE0243
218,201,,53C2,R000,H0490,KE0263
219,284,口,53E3,R091,H0020,KE0276
220,262,台,53F0,R093,H0166,KE0280
221,243,,5404,R000,H0438,KE0281
222,235,品,54C1,R104,H0382,KE0328
223,213,,56E3,R000,H0749,KE0339
224,211,,5728,R118,H0684,KE0349
225,241,,57FA,R000,H0641,KE0354
226,231,,5897,R121,H0741,KE0388
227,202,売,58F2,R123,H0192,KE0397
228,238,,5909,R125,H0581,KE0401
229,244,始,59CB,R140,H0288,KE0439
230,230,官,5B98,R000,H0441,KE0469
231,264,,5BB9,R156,H0802,KE0485
232,287,少,5C11,R164,H0143,KE0512
233,286,局,5C40,R167,H0262,KE0524
234,245,,5CF6,R171,H0358,KE0543
235,299,工,5DE5,R174,H0113,KE0550
236,293,,5E38,R000,H0718,KE0566
237,263,広,5E83,R182,H0114,KE0581
238,300,,5EFA,R188,H0473,KE0600
239,251,式,5F0F,R189,H0295,KE0602
240,218,引,5F15,R190,H0077,KE0605
241,265,必,5FC5,R200,H0568,KE0767
242,226,応,5FDC,R000,H0622,KE0768
243,235,,60C5,R000,H0719,KE0819
244,233,,611F,R000,H0246,KE0792
245,221,所,6240,R217,H0312,KE0843
246,239,打,6253,R000,H0335,KE0854
247,236,投,6295,R000,H0357,KE0862
248,257,,6319,R000,H0458,KE0849
249,254,,63D0,R000,H0753,KE0915
250,288,放,653E,R000,H0391,KE0935
251,295,料,6599,R229,H0599,KE0951
252,226,昨,6628,R241,H0486,KE0986
253,282,有,6709,R252,H0385,KE1003
254,248,朝,671D,R257,H0175,KE1009
255,253,村,6751,R261,H0052,KE1073
256,258,果,679C,R267,H0627,KE1052
257,294,校,6821,R268,H0021,KE1091
258,281,格,683C,R000,H0633,KE1088
259,206,案,6848,R270,H0418,KE1060
260,290,検,691C,R000,H0663,KE1101
261,222,次,6B21,R279,H0292,KE1120
262,269,歳,6B73,R000,H1294,KE1131
263,229,死,6B7B,R285,H0286,KE1133
264,223,水,6C34,R291,H0040,KE1151
265,220,求,6C42,R000,H0455,KE1155
266,296,沢,6CA2,R000,H1552,KE1164
267,280,流,6D41,R302,H0409,KE1200
268,261,減,6E1B,R000,H0667,KE1219
269,267,演,6F14,R000,H0621,KE1239
270,274,,7121,R310,H0796,KE1280
271,215,物,7269,R314,H0387,KE1290
272,234,特,7279,R315,H0760,KE1293
273,298,状,72B6,R000,H0717,KE1296
274,240,男,7537,R325,H0054,KE1334
275,292,町,753A,R326,H0057,KE1335
276,283,疑,7591,R000,H0835,KE1347
277,246,直,76F4,R339,H0349,KE1377
278,279,真,771F,R341,H0514,KE1385
279,205,知,77E5,R343,H0169,KE1394
280,252,確,78BA,R000,H0634,KE1409
281,237,示,793A,R000,H0695,KE1413
282,242,私,79C1,R352,H0876,KE1428
283,289,税,7A0E,R000,H0727,KE1438
284,209,策,7B2C,R000,H0339,KE1473
285,256,終,7D42,R369,H0306,KE1526
286,204,組,7D44,R000,H0160,KE1529
287,277,置,7F6E,R375,H0545,KE1466
288,273,能,80FD,R000,H0766,KE1018
289,291,藤,85E4,R000,H000,KE0000
290,297,裁,88C1,R000,H0872,KE1624
291,259,西,897F,R402,H0252,KE1638
292,228,計,8A08,R408,H0105,KE1658
293,272,談,8AC7,R000,H0543,KE1698
294,227,論,8AD6,R421,H0996,KE1700
295,255,運,904B,R444,H0231,KE0704
296,285,過,904E,R446,H0629,KE0705
297,207,道,9053,R443,H0188,KE0710
298,210,集,96C6,R473,H0309,KE1851
299,268,電,96FB,R478,H0180,KE1862
300,249,革,9769,R000,H0821,KE1877

Tuesday 16 April 2013

101 to 200 frequent kanji

The following maps frequent newspaper kanji against those in Les 500 Kanji (with the addition of Unicode values.)

freq 101 - 200
ROW,FREQ,KANJI,UNICODE,RR#101,115,七,4E03,R007102,101,不,4E0D,R014103,135,世,4E16,R015104,180,予,4E88,R020105,178,交,4EA4,R023106,126,以,4EE5,R032107,103,作,4E5C,R039108,146,,4FDD,R000109,192,元,5143,R050110,173,先,5148,R052111,118,公,516C,R055112,174,共,5171,R056113,152,初,521D,R067114,197,,5224,R000115,108,,5236,R000116,130,,52A0,R000117,111,,52D9,R000118,185,,52DD,R000119,153,北,5317,R077120,137,区,533A,R078121,195,千,5343,R080122,154,午,5348,R110123,121,,5354,R000124,172,,539F,R000125,191,反,53CD,R088126,122,取,53D6,R089127,136,受,53D7,R090128,177,名,540D,R096129,182,向,5411,R099130,188,,544A,R000131,124,和,548C,R102132,167,,5831,R000133,139,多,591A,R129134,151,女,5973,R136135,187,,59D4,R000136,144,安,5B89,R148137,133,家,5BB6,R155138,114,小,5C0F,R163139,131,山,5C71,R169140,181,川,5DDD,R172141,128,平,5E73,R179142,170,府,5E9C,R184143,110,度,5EA6,R185144,112,強,5F37,R193145,175,,5F97,R000146,157,心,5FC3,R309147,132,思,601D,R206148,104,性,6027,R203149,116,成,6210,R214150,119,持,6301,R223151,155,指,6307,R223152,159,,652F,R000153,147,,6539,R000154,166,教,6559,R225155,148,数,6570,R227156,190,文,6587,R228157,169,書,66F8,R244158,117,期,671F,R258159,102,来,6765,R262160,184,,67FB,R000161,156,,6A29,R000162,127,機,6A5F,R277163,143,正,6B63,R282164,113,気,6C17,R290165,109,治,6CBB,R295166,171,活,6D3B,R300167,164,,6D3E,R000168,200,海,6D77,R298169,168,済,6E08,R305170,165,,70B9,R000171,161,産,7523,R321172,107,用,7528,R322173,199,画,753B,R327174,158,界,754C,R328175,163,百,767E,R335176,105,的,7684,R336177,140,県,770C,R340178,160,第,7B2C,R362179,162,結,7D50,R370180,125,,7D71,R000181,141,続,7D9A,R372182,129,,7DCF,R000183,196,考,8003,R382184,106,要,8981,R403185,176,,89E3,R000186,149,記,8A18,R409187,145,,8A2D,R000188,134,話,8A71,R413189,198,認,8A8D,R419190,179,,8CC7,R000191,189,,8ECD,R000192,194,近,8FD1,R435193,142,進,9032,R442194,123,都,90FD,R453195,193,重,91CD,R457196,120,野,91CE,R458197,150,院,9662,R469198,183,,969B,R000199,186,面,9762,R481200,138,,9818,R000
Values of R000 above ( a total of 26 ) indicate that the kanji was not found in the book Les 500 Kanji.  One should note that a Japanese newspaper is more likely to have mention of politics and military matters than an introductory reader.

Monday 15 April 2013

Raimbault Rouillé 100 of 500 kanji


The top 100 frequent newspaper kanji may not match to Les 500 Kanji by Isabelle Raimbault and Nathalie Rouillé.

Here is one such listing with Unicode values added :

ROW,FREQ,KANJI,UNICODE,RR500#
001,2,一,4E00,R1
002,14,三,4E09,R2
003,35,上,4E0A,R12
004,97,下,4E0B,R13
005,11,中,4E2D,R17
006,95,主,4E3B,R18
007,55,九,4E58,R9
008,18,事,4E8B,R22
009,9,二,4E8C,R2
010,31,五,4E94,R5
011,74,京,4EAC,R24
012,5,人,4EBA,R25
013,49,今,4ECA,R27
014,66,代,4EE3,R28
015,4,会,4F1A,R36
016,88,体,4F53,R41
017,39,,515A,R000
018,56,入,5165,R54
019,75,全,5168,R35
020,92,八,516B,R8
021,93,六,516D,R6
022,44,内,5158,R59
023,69,円,5186,R58
024,13,出,51FA,R64
025,24,分,5206,R65
026,27,前,524D,R70
027,62,力,529B,R71
028,73,動,52D5,R75
029,89,化,5316,R76
030,8,十,5341,R10
031,41,合,5408,R98
032,15,同,540C,R97
033,54,員,54E1,R105
034,64,問,5546,R107
035,47,四,56DB,R4
036,50,回,56DE,R110
037,3,国,56FD,R113
038,40,地,5730,R117
039,52,場,5834,R120
040,81,外,5916,R128
041,7,大,4EBA,R131
042,72,子,5B50,R144
043,63,学,5B66,R147
044,48,定,5B9A,R151
045,68,実,5B9F,R152
046,34,対,5BFE,R162
047,42,市,5E02,R176
048,6,年,5E74,R180
049,91,当,5F53,R165
050,26,後,5F8C,R196
051,99,意,610F,R210
052,76,戦,6226,R215
053,60,手,624B,R219
054,17,政,653F,R224
055,51,新,65B0,R230
056,46,方,65B9,R231
057,1,日,65E5,R234
058,57,明,660E,R236
059,16,時,6642,R243
060,82,最,6700,R246
061,23,月,6708,R251
062,10,本,672C,R260
063,37,東,6771,R265
064,43,業,696D,R273
065,84,,6C0F,R000
066,28,民,6C11,R289
067,71,,6C7A,R000
068,100,法,6CD5,R293
069,85,現,73FE,R319
070,86,理,7406,R318
071,29,生,751F,R320
072,90,田,7531,R323
073,32,発,767A,R333
074,76,目,76EE,R338
075,45,,76F8,R000
076,21,社,793E,R348
077,58,立,7ACB,R359
078,61,米,7C73,R364
079,94,,7D04,R000
080,79,経,7D4C,R368
081,38,者,8005,R382
082,19,自,81EA,R387
083,20,行,884C,R398
084,77,表,8868,R400
085,22,見,898B,R404
086,83,言,8A00,R407
087,87,調,8ABF,R420
088,25,,8B70,R000
089,80,通,901A,R438
090,30,連,9031,R440
091,57,,9078,R000
092,36,部,90E8,R452
093,53,金,91D1,R459
094,12,長,9577,R462
095,59,開,958B,R466
096,33,間,9593,R465
097,70,関,952A,R467
098,96,題,984C,R484
099,98,首,9996,R493
100,65,高,9AD8,R497
A trivial number of differences are due to R et R starting with the first ten numbers from 1 to 10. But each R000 above (a total of 7) is a kanji not found in Les 500 Kanji but listed as one of the top 100 newspaper kanji.

The above list was compiled using the Wakan Windows clipboard kanji viewer.




Thursday 7 March 2013

Curl Unicode parse for kanji


This small desktop utility displays a kanji given a Unicode utf-16 hex value.

The source is a text file that is easy to edit.

The utility is light enough to add to a linux run in RAM off the SDD or an SD card of a netbook.

Given a Unicode value in a paper source, I often want to quickly get a clipboard copy of that kanji into an e-text without opening a browser or an e-dict.

 
 

 
 

 
 

Tuesday 12 February 2013

mountains leveled


These mountain kanji are also from our Henshall sorted by UCS page :


  5C71    24  mountain · %e5%b1%b1
  5C90  1121  branch off  ·  fork in road, scene, arena, theater · %e5%b2%90
  5CA9   249  boulder  ·  rock, cliff · %e5%b2%a9
  5CAC  1840  headland  ·  cape, spit, promontory · %e5%b2%ac
  5CB3  1082  point  ·  peak, mountain · %e5%b2%b3
  5CB8   248  beach · %e5%b2%b8
  5CE0  1663  mountain peak  ·  mountain pass, climax, crest, (kokuji) · %e5%b3%a0
  5CE1  1164  gorge  ·  ravine · %e5%b3%a1
  5CF0  1799  summit  ·  peak · %e5%b3%b0
  5CF6   358  island · %e5%b3%b6
  5D07  1465  adore  ·  respect, revere, worship · %e5%b4%87
  5D0E  1297  promontory  ·  cape, spit · %e5%b4%8e
  5D29  1801  crumble  ·  die, demolish, level · %e5%b4%a9




Wednesday 16 January 2013

kanji from tree to plank


Below is the unicode ordering of common kanji from tree to plank.

木   \u6728    H69  tree  ·  wood
未   \u672A   H794  un-  ·  not yet, hitherto, still, even now, sign of the ram, 1-3PM, 8th sign of Zh zodiac
末   \u672B   H587  end  ·  close, tip, powder, posterity
本   \u672C    H70  book  ·  present, main, origin, true, real, counter for long cylindrical things
札   \u672D  H1304  tag  ·  paper money, counter for bonds, placard, bid
朱   \u6731  H1346  vermilion  ·  cinnabar, scarlet, red, bloody
朴   \u6734  H1819  crude  ·  simple, plain, docile
机   \u673A   H832  desk  ·  table
朽   \u673D  H1150  decay  ·  rot, remain in seclusion
杉   \u6749  H1467  cedar  ·  cryptomeria
材   \u6750   H485  lumber  ·  log, timber, wood, talent
村   \u6751    H52  village  ·  town
束   \u675F  H1535  bundle  ·  sheaf, ream, tie in bundles, govern, manage, control
条   \u6761   H716  article  ·  clause, item, stripe, streak
来   \u6765   H217  come  ·  due, next, cause, become
杯   \u676F  H1685  counter for cupfuls  ·  wine glass, glass, toast
東   \u6771   H184  east
松   \u677E  H1394  pine tree
板   \u677F   H373  plank  ·  board, plate, stage

The numbers Hnnnn are for Henshall's A Guide to Remembering … and the point is to construct a mnemonic tale with all 19 characters … preferrably one you can write down later with the kanji correct and legible. The '\u' numbers are the Unicode-16 values in hex.

Here is a web page with an image for the above.

The image from that page is here.




Monday 31 December 2012

flower Unicode kanji


These are kanji in UCS order from a sorted kanji web page. This segment of the Henshall basic kanji covers many of the jōyō kanji for the "grass" radical, or bushu 艸 艹.


芋   828B  1011  potato · %e8%8a%8b
芝   829D  1335  turf  ·  lawn · %e8%8a%9d
  82B1     9  flower · %e8%8a%b1
芳   82B3  1791  perfume  ·  balmy, favorable, fragrant · %e8%8a%b3
芸   82B8   470  technique  ·  art, craft, performance, acting, trick, stunt · %e8%8a%b8
芽   82BD   434  bud  ·  sprout, spear, germ · %e8%8a%bd
苗   82D7  1740  seedling  ·  sapling, shoot · %e8%8b%97
若   82E5   886  young  ·  if, perhaps, possibly, low number, immature · %e8%8b%a5
苦   82E6   264  suffering  ·  trial, worry, hardship, feel bitter, scowl · %e8%8b%a6
英   82F1   426  England  ·  English · %e8%8b%b1
茂   8302  1850  overgrown  ·  grow thick, be luxuriant · %e8%8c%82
茎   830E  1194  stalk  ·  stem · %e8%8c%8e
  8336   171  tea · %e8%8c%b6
  8349   162  grass  ·  weeds, herbs, pasture, write, draft · %e8%8d%89
荒   8352  1253  laid waste  ·  rough, rude, wild · %e8%8d%92
荘   8358  1515  villa  ·  inn, cottage, feudal manor · %e8%8d%98
荷   8377   239  baggage  ·  shoulder-pole load, bear or shoulder, load, cargo, freight · %e8%8d%b7
  83CA  1141  chrysanthemum · %e8%8f%8a
菌   83CC  1177  germ  ·  fungus, bacteria · %e8%8f%8c
菓   83D3  1047  candy  ·  cakes, fruit · %e8%8f%93
  83DC   483  vegetable  ·  side dish, greens · %e8%8f%9c
華   83EF  1046  splendor  ·  flower, petal, shine, luster, ostentatious, showy, gay, gorgeous · %e8%8f%af
落   843D   408  fall  ·  drop, come down · %e8%90%bd
  8449   405  leaf  ·  plane, lobe, needle, blade, spear, flat things cntr, fragment, piece · %e8%91%89
著   8457   937  renowned  ·  publish, write, remarkable, phenomenal, put on, don, wear, arrival, finish (race), suits of clothing cntr, literary work · %e8%91%97
葬   846C  1523  interment  ·  bury, shelve · %e8%91%ac
蒸   84B8   904  steam  ·  heat, sultry, foment, get musty · %e8%92%b8
蓄   84C4  1579  amass  ·  keeping a concubine, phonograph · %e8%93%84
蔵   8535   923  storehouse  ·  hide, own, have, possess · %e8%94%b5
薄   8584  1699  dilute  ·  thin, weak (tea) · %e8%96%84
薦   85A6  1499  recommend  ·  mat, advise, encourage, offer · %e8%96%a6
薪   85AA  1445  fuel  ·  firewood, kindling · %e8%96%aa
薫   85AB  1192  send forth fragrance  ·  fragrant, be scented, smoke (tobacco) · %e8%96%ab
薬   85AC   398  medicine  ·  chemical, enamel, gunpowder, benefit · %e8%96%ac
藩   85E9  1721  clan  ·  enclosure · %e8%97%a9
藻   85FB  1531  seaweed  ·  duckweed · %e8%97%bb

This selection is intended to demonstrate that it can be handy to have basic kanji in Unicode sort order.

re: bushu is 部首


Henshall Japanese kanji (3 pages, revised)


I have revised the layout of several pages of the complete Henshall kanji set of 1,945.
  1. Sorted by UCS with url encoded value
  2. Sorted by Henshall book entry number
  3. Sorted by Unicode UTF-16
A typical row from the last would be

  4F8B   605  example  ·  custom, usage, precedent




Tuesday 25 September 2012

tell tale poem poetry

Some kanji with UCS (Unicode) and Henshall numbers ; these can be used in the Search at aule-browser.

, 8A00,  274, say ことば

, 544A,  481, revelation, tell, inform, announce

, 53E5,  655, phrase, clause, sentence, passage, paragraph, counter for haiku  俳句 .

, 541F, 1182, versify, singing, recital

, 6587,   68, sentence, literature, style, art, decoration, figures, plan, literary radical (no. 67)

, 8A13,  656, instruction, Japanese character reading, explanation, read

楷, 6977, 2126, square character style, correctness

, 5B57,   28, character, letter, word, section of village

, 7AE0,  318, badge, chapter, composition, poem, design

, 8A60, 1016, recitation, poem, song, composing

, 8A69,  291, poem, poetry

, 8A5E,  879, part of speech, words, poetry

誌, 8A8C,  880, document, records

編, 7DE8,  785, compilation, knit, plait, braid, twist, editing, completed poem, part of a book

賦, 8CE6, 1758, levy, ode, prose, poem, tribute, installment

首, 9996,  139, neck, counter for songs and poems

本, 672C,   70, book, present, main, origin, true, real, counter for long cylindrical things

帳, 5E33,  347, notebook, account book, album, curtain, veil, net, tent

課, 8AB2,  433, chapter, lesson, section, department, division, counter for chapters (of a book)

, 8A9E,  112, word, speech, language

辞, 8F9E,  500, resign, word, term, expression

式, 5F0F,  295, style, ceremony, rite, function, method, system, form, expression

識, 8B58,  698, discriminating, know, write

軸, 8EF8, 1330, axis, pivot, stem, stalk, counter for book scrolls

行, 884C,  118, going, journey  詩の行 line of a poem

久, 4E45,  647, long time, old story

話, 8A71,  221, tale, talk  ·  compare http://www.unicode.org/cgi-bin/GetUnihanData.pl?codepoint=8BDD

記, 8A18,   95, scribe, account, narrative

修, 4FEE,  704, discipline, conduct oneself well, study, master

線, 7DDA,  329, line, track

Tuesday 7 August 2012

岸 河原


From one kanji  to a compound 河原 via a known kanji  and I finally "grok" them both !

see also:
遠島
火山列島
 
at edict2 in utf-8 PLAIN HTML (page may take 30 seconds to load - no scripts)

Be sure your browser view has character encoding set to utf-8 if you have an issue.

Split HTML pages (6 ? 7 ?) in lovely free HanaMinA font later today.