Search the web
Sign In
New User? Sign Up
billiontriples · The Billion Triples Challenge
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Show off your group to the world. Share a photo of your group with us.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
Messages 40 - 72 of 140   Oldest  |  < Older  |  Newer >  |  Newest
Messages: Simplify | Expand   (Group by Topic) Author Sort by Date ^
40
Dear All, In the past few days we had talked to several of you about providing data for the billion triples challenge. I would like to start a brief discussion...
Peter Mika
serendipity588
Offline Send Email
Feb 1, 2008
5:22 pm
41
Dear All, We are looking for persons or organizations who would like to offer their help in hosting the Billion Triples data set. This is an important part of...
Peter Mika
serendipity588
Offline Send Email
Feb 1, 2008
5:23 pm
42
Hi Peter, I vote for option # 2, Jans...
jans.aasman
jannesaasman
Offline Send Email
Feb 1, 2008
6:10 pm
43
My two cents: In the spirit of RDF, why not provide a 'directory' triple file that has resources identifying each file and provides timestamps, provenance etc...
N. Sivaramakrishnan
k2_181
Offline Send Email
Feb 1, 2008
6:16 pm
44
Hi, I'm new to this discussion list. I will introduce myself, I'm Marc-Alexandre Nolin from the Bio2RDF project (http://bio2rdf.org). His the billions triples...
Marc-Alexandre Nolin
marc_alexand...
Offline Send Email
Feb 1, 2008
6:30 pm
45
Hi, we can provide one such data hosting within Sindice. let us know when/how/what and we'll get it running. Giovanni ... important ... host...
gtummarello
Offline Send Email
Feb 1, 2008
6:31 pm
46
Hi, ... triples ... Turtle ... as already discussed, I'd prefer this solution. Filenames in the ZIP archive are the url-encoded URI of the file. Actually,...
andreasharth
Offline Send Email
Feb 1, 2008
7:17 pm
47
... Jim & Peter, As we do with DBpedia[1][2], we are happy to be one of hopefully numerous RDF data store providers for this effort. Count OpenLink Software in...
Kingsley Idehen
kidehen
Offline Send Email
Feb 1, 2008
10:37 pm
48
What licensing terms will the data be issued under? I encourage this project to adopt the ODC Public Domain Dedication and Licence, a licence that Talis and...
Ian Davis
ianalchemy
Offline Send Email
Feb 2, 2008
3:25 pm
49
Following some inquiries, i'd like to clarify that its not the main Sindice infrastructure providing a sparql endpoint (e.g. over the entire dataset), its just...
gtummarello
Offline Send Email
Feb 2, 2008
10:13 pm
50
Ian, good point, we will work hard to make sure all the data is freely sharable and displayable, having a good license that makes that clear would make a lot...
Jim Hendler
james.hendler
Offline Send Email
Feb 6, 2008
5:58 pm
51
Hi Andreas, I like this solution as well, the only thing I'm slightly worried about now is what happens when you unzip a large number of files. My extended ...
Peter Mika
serendipity588
Offline Send Email
Feb 7, 2008
4:48 pm
52
Hi Peter, ... from my experience, file systems will have trouble at some point when there are too many files around. Thus, we avoid writing individual files ...
Andreas Harth
andreasharth
Offline Send Email
Feb 7, 2008
6:29 pm
53
Hi list, ... I quite like this last solution for one, very selfish reason: this is very similar to the way the cache of Watson is organized. For example, ...
M.Daquin
mathieu_daquin
Offline Send Email
Feb 7, 2008
7:32 pm
54
http://www.sdforum.org/index.cfm?fuseaction=Page.viewPage <http://www.sdforum.org/index.cfm?fuseaction=Page.viewPage&pageId=656&parent ID=483&nodeID=1>...
Jeff Pollock
jeff_pollock
Offline Send Email
Feb 8, 2008
5:30 pm
56
Dear All, After some long and careful consideration, we have made the decision not to invent our own format for exchanging data but to rely on an existing ...
Peter Mika
serendipity588
Offline Send Email
Feb 27, 2008
3:15 pm
57
Hi Peter, I'm not entirely sure what you are going to give us access to. You (if everything goes right at Yahoo) will give us access to a 100 G crawl in...
jans.aasman
jannesaasman
Offline Send Email
Feb 27, 2008
4:32 pm
58
Hi Jans, The plan is to have the entire dataset available for download in the WARC format as a set of files. (Some users may have limitations storing files...
Peter Mika
serendipity588
Offline Send Email
Feb 27, 2008
4:42 pm
59
thanks for the clarification, jans...
Jans Aasman
jannesaasman
Offline Send Email
Feb 27, 2008
9:03 pm
60
Hello Peter, Do we have any codes written in Jena? - Amit ... the ... storing ... crawls. ... do if ... HTTP ... access ... access to a ... on ... an existing ...
crossthelimit
Online Now Send Email
Feb 28, 2008
3:49 pm
61
Hi Amit, No, I don't as I'm not familiar with Jena. But basically the MeasurableInputStream that you get as a result of the response.contentAsStream() call on...
Peter Mika
serendipity588
Offline Send Email
Feb 28, 2008
3:56 pm
62
Thnx for the info. - Amit ... that you ... download in ... limitations ... of ... response. The ... need to ... the ... GB. ... us ... based ... the ... on ......
crossthelimit
Online Now Send Email
Feb 28, 2008
4:10 pm
63
** our apologies if you receive multiple copies of this message ** ================================================================== CALL FOR PAPERS ESWC 2008...
Giovanni Tummarello
gtummarello
Offline Send Email
Feb 29, 2008
12:11 pm
64
ccsptutorial.info is sites for certification http://ccsptutorial.info/...
help.ittutor
Offline Send Email
Mar 1, 2008
11:31 am
65
REMINDER Subject: Are Scalable Graph Data Applications Possible? A Look at C-Store, Java, and Data Grid Approaches to Semantic Web Applications ...
Jeff Pollock
jeff_pollock
Offline Send Email
Mar 3, 2008
6:31 pm
67
All- Peter feels that we now have the collection and distribution of the triples underway, which means he gets to make me do some work finally... My role at...
Jim Hendler
james.hendler
Offline Send Email
Mar 26, 2008
5:23 pm
68
Here's my views: Triple Store: the big problem with semantic web, no matter how big promises it makes, is the amount of triples that can be stored and dealt...
crossthelimit
Online Now Send Email
Mar 27, 2008
11:27 am
69
Dear Jim, dear all May I propose another measuring criteria or facet of the challenge: Can a user interactively do something useful with the data? I think that...
avi.bernstein
Offline Send Email
Mar 27, 2008
7:40 pm
70
Yes, good point Avi! Sent from my iPhone...
Jim Hendler
james.hendler
Offline Send Email
Mar 27, 2008
8:37 pm
72
Hi All, The CFP and the dataset for the Billion Triples Challenge have been posted at [1]. Please let us know of any immediate problems you see with accessing...
Peter Mika
serendipity588
Offline Send Email
May 6, 2008
11:36 am
Messages 40 - 72 of 140   Oldest  |  < Older  |  Newer >  |  Newest
Advanced
Add to My Yahoo!      XML What's This?

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help