Skip to main content

Text chopping and glueing.


 

 

 

# !/bin/sh
# the next line restarts using tclsh \
exec tclsh "$0" "$@"

package require Gnocl

namespace eval text {}

## split text into enumerated list of paragraphs
# @param    txt        block of text to split
# @returns    enumerated list of paragraphs.

proc text::paras { txt } {
    set i 0
    foreach line [split $txt \n] {
        if { [string is space $line] } {
            incr i
        } else {
            dict append res $i "$line "
        }  
    }

    return $res
}

## split text into blocks based upon puncutation marks
# @param    txt        block of text to split
# @param    marks    valid list of punctuation marks
# @returns    enumerated list of blocks.

proc text::blocks {txt marks} {
    set i 0
    foreach ch [split $txt ""] {
        if { [string first $ch $marks] != -1 } { incr i}
        dict append res $i $ch
        if { [string first $ch $marks] != -1 } { incr i}
    }
    return $res
}

## concatenate enumerated list of text blocks
# @param    str
# @returns    concated text

proc text::recombine {str} {
    for {set k 0} {$k < [dict size $str]} {incr k} {
        append res [dict get $str $k]
    }
    return $res
}

## convenience wrapper around namespace procs
# @param    cmd
# @param    args
# @returns    formatted string
proc text {cmd args} {
    # check for valid command
    if { [lsearch [namespace eval ::text:: info procs] $cmd] < 0 } {
        set distanceToTop [info level]
        for {set i 0} {$i < $distanceToTop} {incr i} {
            set callerlevel [expr {$distanceToTop - $i}]
            append res [info level $callerlevel]\n
        }
        puts stderr "Error! No such command $cmd:\n$res"
        exit 0 }
    
    # call required command
    return [text::$cmd {*}$args]
}


#~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
# demo
#~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~


set txt(EN) \
{It was the best of times, it was the worst of times, it was the age of
wisdom, it was the age of foolishness, it was the epoch of belief, it
was the epoch of incredulity, it was the season of Light, it was the
season of Darkness, it was the spring of hope, it was the winter of
despair, we had everything before us, we had nothing before us, we were
all going direct to Heaven, we were all going direct the other way--in
short, the period was so far like the present period, that some of its
noisiest authorities insisted on its being received, for good or for
evil, in the superlative degree of comparison only.

There were a king with a large jaw and a queen with a plain face, on the
throne of England; there were a king with a large jaw and a queen with
a fair face, on the throne of France. In both countries it was clearer
than crystal to the lords of the State preserves of loaves and fishes,
that things in general were settled for ever.

It was the year of Our Lord one thousand seven hundred and seventy-five.
Spiritual revelations were conceded to England at that favoured period,
as at this. Mrs. Southcott had recently attained her five-and-twentieth
blessed birthday, of whom a prophetic private in the Life Guards had
heralded the sublime appearance by announcing that arrangements were
made for the swallowing up of London and Westminster. Even the Cock-lane
ghost had been laid only a round dozen of years, after rapping out its
messages, as the spirits of this very year last past (supernaturally
deficient in originality) rapped out theirs. Mere messages in the
earthly order of events had lately come to the English Crown and People,
from a congress of British subjects in America: which, strange
to relate, have proved more important to the human race than any
communications yet received through any of the chickens of the Cock-lane
brood.}

set txt(ZH) \
{大哉智度!萬聖資通,咸宗以成也。地合日照,無法不周,不恃不處,累彼有名,既外有名,亦病無形,兩忘玄莫,喟然無主,此智之紀也。

夫永壽莫美乎上乾,而齊之殤子;神偉莫美於凌虛,而同之[仁-二+肙]滯;至德莫大乎真人,而比之朽種;高妙莫大乎世雄,而喻之幻夢。

由此論之,亮為眾聖宗矣。何者?執道御有,卑高有差,此有為之域耳;非據真如、遊法性、冥然無名也。據真如、遊法性、冥然無名者,智度之奧室也。

名教遠想者,智度之蘧廬也。然在乎證者,莫不[貝*賓]其生無而惶胘;存乎邇者,莫不忿其蕩冥而誕誹。

道動必反,優劣致殊,眩誹不其宜乎!

不其宜乎!}


set marks(EN) [list .,\;:!?()]
set marks(ZH) [list ,。(;)?!]

set buff [text::paras $txt(EN)]

set blocks [text::blocks [dict get $buff 1] $marks(EN) ]

puts [dict get $buff 1]\n~~~~~
puts $blocks\n~~~~~
puts [text::recombine $blocks]\n~~~~~

# ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
set paras [text paras $txt(ZH)]
set blocs [text blocks [dict get $paras 1] $marks(ZH) ]
puts $blocs
puts [text recombine $blocs]
text xx
 

Comments

Popular posts from this blog

gnocl::calendar

Given this module some attention today. Added some of the more package wide options to the module and created customised handler for setting the month. (For some odd reason months are are counted 0-11 whereas days are 1-31.) There's still a little more to do to this one including the addition of code to store diary details. Here's the working test script to show the range of options at work. The percentage substitution string item %e explores something that I've been toying with, the name of the signal/event that initiated the call. Ok, a script can keep its own internal trace but who knows, it might prove useful. #--------------- # calendarTest.tcl #--------------- # Author:   William J Giddings # Date:     07/05/09 #--------------- #!/bin/sh # the next line restarts using tclsh \ exec tclsh "$0" "$@" #--------------- package require Gnocl set cal [gnocl::calendar] $cal configure -day 8 -month 7 -year 1956 $cal configure -rowHeight 1 -colWidth 1 $ca...

Gnocl Dashboard

Over the past few programming sessions I've been working on producing a central point, a dashboard, around which it's possible to see the various Gnocl widgets and commands in operation. In many ways like the demo script which shipped with the earlier releases of Gnocl but offers much more. The introspection functionality provides details of the various options and sub-commands of each Gnocl procedure which are displayed under the associated tab. Sample scripts are included for each item which offers newcomers a clearer insight into how make the most of what's on offer.

Getting Widget Style Properties

Until the move over to Gtk4, Gnocl is still built against the Gtk 2.21 libraries. One of the inconveniences of Gtk is getting and setting widget style settings which are considered to be set globally by the desktop style settings and not for the programmer to tinker around with. Needless to say, there are times when different defaults are preferred, largely to draw the users attention to 'something a bit different'. The function gtk_widget_modify_font  is a convenience function to set the widget basefont as shown in this snippet from the button.c module,  if ( options[baseFontIdx].status == GNOCL_STATUS_CHANGED ) { GtkWidget *label; label = gnoclFindChild ( GTK_WIDGET ( para->button ), GTK_TYPE_LABEL ); PangoFontDescription *font_desc = pango_font_description_from_string ( Tcl_GetString ( options[baseFontIdx].val.obj ) ); gtk_widget_modify_font ( GTK_WIDGET ( label ), font_desc ); pango_font_description_free ( font_desc ); } Unfortunately, there's no d...