[Z3-zemantic] Re: Zemantic 0.5 released
Michel Pelletier
michel at dialnetwork.com
Tue Jul 19 20:04:28 CEST 2005
On Tue, 2005-07-19 at 12:29 +0200, Olivier Grisel wrote:
> Michel Pelletier wrote:
>
> > Nothing on this particular subject yet. More recently I've been working
> > on RDFS entailment in rdflib which is a priority right now,
>
> Interesting. What kind of 'entailement' are you working on ? RDF
> validation against a schema?
RDF and RDFS entailment as described in the W3C document 'RDF Semantics'
at:
http://www.w3.org/TR/rdf-mt/
It's a very technical document, but in particular see section 7.
Basically there exist through proof a set of rules and standard RDFS
axiomatic triples that are a consequence of RDF and RDFS's model. For
example, if you know that X is a rdfs:subProperty of Y, then you can
'entail' that both X and Y's rdf:type is rdfs:Property. Much of the
theory beyond that is beyond me. ;)
Many of the rules can be applied recursively and some even infinitely so
it's a tricky issue to solve. I'm working on a general framework in
rdflib that will allow users to plug in entailment policies that are
specific to their application. There will be provided a simple "poor
man's" policy that does the basics like property and class entailment.
Libraries like this will assist agents and search languages to derive
new information from a graph that may allow them to solve a particular
goal or find certain query results.
> Yes that what I thought. But the use of some semantic base (a RDF model
> of some kind of english dictionary) would allow us to use to do semantic
> inference on the content of the indexed document with a knowledge of
> synonyms, antonyms, ... This would require to some kind of structured
> thesaurus. The OpenOffice project has some GPL thesaurus that may help
> to build this. Wordnet is probably more advanced, but not free ... oh
> wait, it looks like it's BSD-like now: http://wordnet.princeton.edu/license
> So Wordnet is probably the way to go.
I think this is a great place to start and would love to help out on
that. BTW if it's important I'd like to point out that NLTK is GPL.
> We would also probably need to define some kind of semantic distance
> between the predicates of our common sense ontology and then be able to
> do appromatively correct queries on the model using a threshold on that
> distance. I'm not really aware of the research results on this kind of
> problems. Should you have any pointer on the topic, please feel free to
> share them :)
You lost me on this one. If there is an existing ontology then we
should just skip devising the common sense ontology.
> Yes, I guess we first need to work on some toy examples to work out what
> works and what's pure utopia. I will first have a look at the NLTK
> tutorials to get a better picture of what it actually does. BTW, do you
> think we should use NLTK-lite that looks more pythonic than the
> tradional NLTK?
Good ideas all, I've never heard of NLTK-lite, but NLTK is written in
pure python, no? What could be less pythonic than that? ;)
-Michel
More information about the Z3-zemantic
mailing list