SallyHamouda posted on 03-10-2010 4:04 AM
Hi,
Thanks so much for this great work.
I was using GMM of the Infer.net for clustering. It works fine for those datasets that are small and with small number of features . But when I tried on data
size of 2000 point each point have 30 features for 5 clusters I’ve faced
OUT OF MEMORY EXCEPTION.
To solve this problem I’ve used shared variables, I ‘ve shared
weights , means and precision between data chuncks.
The problem that those variables are from Dirchlet , Gaussian and Wishart distributions and shared variables in Infer.net don't allow except Gaussian and Gamma. When I made
them all Gaussians I’ve faced the following exception:
Unhandled Exception: System.InvalidCastException: Unable to cast object of type 'MicrosoftResearch.Infer.Distributions.DistributionRefArray`2[MicrosoftResearch.
Infer.Distributions.VectorGaussian,MicrosoftResearch.Infer.Maths.Vector]' to type 'MicrosoftResearch.Infer.Distributions.VectorGaussian'.
Also I've a problem because the number of features is relatively large and the examples of the shared variables on the Infer.net site
are only for GMM of one component for one feature.
So can you please tell if it is possible to do clustering in the way discribed above using Infer.net and how also could
I deal with shared variables for multiple Gaussian models with data points have large number of features.
Thanks for your Support