Search the web
Sign In
New User? Sign Up
simpoint
? Already a member? Sign in to Yahoo!

Yahoo! Groups Tips

Did you know...
Show off your group to the world. Share a photo of your group with us.

Best of Y! Groups

   Check them out and nominate your group.
Having problems with message search? Fill out this form to ensure your group is one of the first to be migrated to the new message search system.

Messages

  Messages Help
Advanced
visualizing generated clusters by projecting to lower dimensions?   Message List  
Reply | Forward Message #23 of 120 |
Re: [simpoint] visualizing generated clusters by projecting to lower dimensions?

Great, thanks!  I'll let you know if either approach works and what the differences between the two are.

Additionally, since Excel can't create XYZ bubble charts, do you know of any easy to use, free software that can plot data in three dimensions?  Ideally it would have to look professional so it could be used in industry publications.

Thanks,
Robert


On 4/26/06, Jeremy Lau <jl@...> wrote:
> I'd like to know how to take an existing clustering and without
> changing the clusters themselves I would like to project the cluster
> centerpoints onto lower dimensions (3 or 2 ideally) to help visualize
> the size and relative proximity of clusters chosen.  Is there a way to
> set the dimensionality used in the random linear projection without
> altering the clusters chosen?  Or alternatively, is there a way to
> load an existing vector file and reproject to lower dimensions?

You can load a vector file and reproject it to lower dimensions with
SimPoint. Just re-run SimPoint with "-k 1 -iters 1" to effectively
disable clustering, and tell SimPoint to project to 2 or 3 dimensions
with the "-dim" option, and tell it to save the projected vectors with
the "-saveVectorsTxtFmt" option. You can use these low-dimensionality
reprojected vectors with the cluster assignments from your initial
SimPoint run to visualize how vectors are assigned to clusters.

This technique can't be used to reproject the cluster centers, since
the cluster centers in the original clustering run used a different
projection matrix, and cluster centers are defined in the projected
space. It should be possible [although I haven't tried] to save the
projected vectors and centers from the original clustering run, and
then use the technique above to project the already-projected vectors
and centers to 2 or 3 dimensions.

So with the first idea, the original vectors are being projected
twice, once to a higher dimensionality for clustering, then to a lower
dimensionality for visualization. With the second idea, the original
vectors are projected once to a higher dimensionality for clustering,
then the resulting projected vectors are projected again for
visualization.

-jeremy



YAHOO! GROUPS LINKS






Thu Apr 27, 2006 12:34 am

morton2001
Offline Offline
Send Email Send Email

Forward
Message #23 of 120 |
Expand Messages Author Sort by Date

Hello all, and thanks for your efforts on this project. I'd like to know how to take an existing clustering and without changing the clusters themselves I...
morton2001
Offline Send Email
Apr 24, 2006
1:10 am

... You can load a vector file and reproject it to lower dimensions with SimPoint. Just re-run SimPoint with "-k 1 -iters 1" to effectively disable clustering,...
Jeremy Lau
aoxommoxoa
Offline Send Email
Apr 27, 2006
12:07 am

Great, thanks! I'll let you know if either approach works and what the differences between the two are. Additionally, since Excel can't create XYZ bubble...
Robert Morton
morton2001
Offline Send Email
Apr 27, 2006
12:38 am

... We used Matlab for the 3D projected-vector graphs you can find in some of our publications. Gnuplot works too, but the results aren't as pretty. -jeremy...
Jeremy Lau
aoxommoxoa
Offline Send Email
Apr 27, 2006
1:11 am
Advanced

Copyright © 2009 Yahoo! Inc. All rights reserved.
Privacy Policy - Terms of Service - Guidelines - Help