Today I created a GDBM database for the Stanford TAP knowledge base. Some statistics: 216,000 input lines -- 10 MB 19 minutes 27,000 persons 120 Properties 532...
FYI The hardest part filtering out the concepts that need attention out of approx. 50,000 total concepts is done automatically by the mke commands do hotop od...
FYI I am making very rapid progress in connecting the last 50 nogenus concepts. I am using mke assertions and hierarchy editing commands: species isa genus; ...
some general thoughts on ke/mke data structures 1) the current mke problem a) RAM efficient -- only one GDBM table per relation -- no tables per concept b) I...
FYI This is how I created the 4 subhierarchy databases that you wanted. # online: save hierarchy database $ cd mke/db/usecs/entity $ mkzip E18 # download and...
FYI I'm dusting off my old ke implementation of relations. Definition includes key, e.g. gs is relation with arraymode = gdbm, arraykey = "$1_$2", format =...
Looking good. No GDBM crash. hogenus table is only 13 MB. access time is about 1 sec. Dick McCullough http://mkrmke.org ... From: Richard H. McCullough To:...
There's a clear GDBM garbage collection problem. Can Unicon tell GDBM to collect garbage? Here are some numbers. I reran TAP with hoKeyType = concept. cd...
After a little experimentation, I chose to run my collect every 25,000 lines. Processed http://mkrmke.net/knowledge/tap/newtap.mkr successfully: 216362 lines...
Hi Sandro This is a brief addendum to the info. in my Aug/10/2009 email. I have emphasized self-documentation in my new mke command, even including a list of...
FYI ODBC now correctly configured on my Windows XP desktop, as verified by this test program. Unicon open() aborts if user,password arguments are not present. ...
FYI I added a new option mke --collect which does a collect on every GDBM table. For the TAP KB, the size of the database .zip file was reduced from 52 MB to...
This new help file http://mkrmke.net/help/gdbmtable.txt documents the GDBM tables used by the mke program. Note that most tables have + and - values associated...
This new help file http://mkrmke.net/help/RegularExpression.txt documents the regular expression match used by the hofind command of mke. Dick McCullough ...
mke now includes the individual or concept distinction. isu/isp relate individual unit and genus iss/isg relate species and genus isa/isc relate individual...
FYI I split the opencyc-latest.owl file into 8 parts so I could use the U. of Manchester syntax converter. Results are in http://mkrmke.net/knowledge/opencyc/....
Exact matches are faster & simpler than regular expression matches. I decided to use separate commands for them do hofind od concept done; -- exact match do...
It is difficult to use this file. Whenever I want to check something, I am faced with a name so long I cannot possibly remember it. I can only copy/paste the...
After correcting several bugs in owl2mkr and mke, the processing times look very different. I also had to do GDBM garbage collections more often -- collect on...
Regarding GESS -- the mke online program serves the following functions: For each view [database] 1) stores propositions in GDBM tables 2) provides numerous...
I have merged all GDBM files into the opencyc/latest database, which can be viewed at http://mkrmke.net/cgi/mke.php The final .zip file for the database is...
What is the meaning of a comma in a concept name? example from opencyc/latest view rhm@rhm8200 /home/knowledge/opencyc $ mke ke$ do homatch od ^rdfs:label...
Sorry, false alarm. My output procedure, hoprint, created a comma-separated list of concepts that have the same rdfs:label. Dick McCullough http://mkrmke.org ...
Since there is a one-to-one correspondence between concepts and their cyc:CycAnnotations_v1#label attributes, I have modified mke to display output using the...