Search the web
Sign In
New User? Sign Up
yars-dev · YARS RDF DB
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Want your group to be featured on the Yahoo! Groups website? Add a group photo to Flickr.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
Very Big Repository   Message List  
Reply | Forward Message #20 of 53 |
Re: [yars-dev] Very Big Repository

Hi Marcel,

On Sat, Jun 17, 2006 at 09:58:53PM -0000, marcelferrante wrote:
> Hi everybody. I'm new in this maillist so be patient if I am repeting
> anwsers.
>
> 1 - Beyond the Berkeley DB is there some other data structure used for
> manipulate a very big repository of rdf graphs ?
> a) In information retrieval area is used inverted files and suffix
> arrays. Who groups are studyind that ?

we use inverted files (Lucene) for indexing the strings in an RDF graph.
For the index on the triples/quads, I don't immediately see how
inverted files could be used here.

> 2 - There are other groups studyind RDF persistent storage. How the
> YARS Group manager the relationship with them? (Sesame, RDF store,
> RDFdb, RAP, JENA)

We use B+ trees for a complete index on the graph topology; Redland uses
hash tables for the most common access patterns; Jena and Sesame use
RDBMS as backend storage systems; Sesame also has a native B tree
implementation; Kowari uses AVL trees.

In my experience, Sesame's native B tree implementation is quite
scalable. However, BerkeleyDB has sophisticated caching and locking,
and should be more appropriate in a multi-threaded environment.
I don't have any experience with Kowari, but it might be worth a try.

> 3 - What is the start of art for RDF persistent storage for very very
> big rdf graphs repository in your opinion?

Our version of DBLP has more than 11 million triples, and query response
times are well below 2 sec.

What's very very big rdf graphs? What datasets are you looking at?

Regards,
Andreas.



Mon Jun 19, 2006 12:31 pm

andreasharth
Offline Offline
Send Email Send Email

Forward
Message #20 of 53 |
Expand Messages Author Sort by Date

Hi everybody. I'm new in this maillist so be patient if I am repeting anwsers. 1 - Beyond the Berkeley DB is there some other data structure used for ...
marcelferrante
Offline Send Email
Jun 17, 2006
9:59 pm

Hi Marcel, ... we use inverted files (Lucene) for indexing the strings in an RDF graph. For the index on the triples/quads, I don't immediately see how ...
Andreas Harth
andreasharth
Offline Send Email
Jun 19, 2006
12:26 pm
Advanced

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help