Showing posts with label plain HTML. Show all posts
Showing posts with label plain HTML. Show all posts

Thursday, 25 October 2012

kanjidic2 Japanese


I have revised my snapshot of the kanji with a Japanese reading in kanjidic2 as you can see at http://www.aule-browser.com/kanji/kanjidic2-all-revised.html.

This will result in some changes in other plain HTML kanji pages and in several Curl web and desktop app's as well. The present list is 12,335 characters extracted from the XML – IFF they have at least one ON- or KUN-reading.

The error arose from a missing character in some pages and edict2 app's. This is not as common a character as the top 3,000 or so - but it is in the English wiktionary at http://en.wiktionary.org/wiki/暝#Japanese.




Thursday, 11 October 2012

kanji by UCS

Here are some the the first, basic Japanese kanji sorted by their UCS value (UTF-16 codepoint)


一  丁  七  万  丈  三  上  下  不  与  世  丘  丙  両  並  中  丸  丹  主  久  乏  乗  乙  九  乱  乳  乾  事  二  亜  享  京  亭  人  仁  今

Is a particular pattern evident or helpful ?

I am in the process of adding a UCS - to- kanji -to- urlencoded - utf-8 page over at kanji.aule-browser.com which uses the Curl web content language (only a few lines of declarative script and a wee bit of procedural script required.)

UPDATE : that page with the HTML urlencoding for each character is at http://www.aule-browser.com/kanji/henshall-sorted-urlencoded.html .

A simpler plain HTML page is http://www.aule-browser.com/kanji/henshall-sorted-by-unicode.html .

Another safe, plain HTML page with no scripts, images, ads or other nuisance has the 1,945 Hernshall basic Japanese kanji sorted as they appear in the book - by their so-called Henshall number - is at http://www.aule-browser.com/kanji/henshall-sorted-by-id.html.

By viewing the page source in your browser you can see that there are no script or image elements to worry about - so you can safely copy this HTML text to your local machine to edit as you see fit.

The HTML text was generating using a Curl applet running off-line and parsing the Kanjidic2 XML.



Sunday, 9 September 2012

joyo files

I have added indexing to the HTML+Curl page for edict2 kanji when selecting a JOYO kanji at the HTML+Curl home for the Plain HTML pages for viewing the edict2 Japanese-English dictionary. The first widget is in file order (1 to 9), the second in UCS order and the third is yet to be modified (frequent kanji.)

Here is a screenshot:



Select the page by clicking on the HTML+Curl link if you have the Surge RTE browser plugin installed (available at www.curl.com ).

It still requires a tweak when viewed on a small screen.  That should not an issue when generated as an Android or iOS app using Curl's CAEDE framework.

Saturday, 8 September 2012

kanji SpinControl

I have added two kanji Spin widgets or SpinControl's at the HTML+Curl edict2 Japanese-English dictionary views page.

You can see them at the top in this screenprint:






The first spinner is for the jōyō kanji and the second holds the kanjidic2 kanji with a "frequency" value. The latter can be seen in the Curl app at http://www.aule-browser.com/kanji/kanjidic2-freq-curl.html.

Some of these pages will soon appear as Android and iOS app's thanks to the Curl CAEDE project.

Tuesday, 7 August 2012

Edict2 in plain HTML (no scripts, frames etc)

http://kanji.aule-browser.com/edict2-kanji-sliced-1-9.html

Edict2 rendered as 9 kanji pages and 3 katakana pages in plain HTML-only. These pages offer the ability to browse this extensive Japanese-English dictionary on-line.





Below is an example page:
That page is using the free HanMinA font which adds a few seconds to the initial download.


Look-up widgets to follow for HTML+JavaScript and HTML+Curl pages.


岸 河原


From one kanji  to a compound 河原 via a known kanji  and I finally "grok" them both !

see also:
遠島
火山列島
 
at edict2 in utf-8 PLAIN HTML (page may take 30 seconds to load - no scripts)

Be sure your browser view has character encoding set to utf-8 if you have an issue.

Split HTML pages (6 ? 7 ?) in lovely free HanaMinA font later today.