1 Unsupervised RandomForest

rf1 <- randomForest(sdat, proximity = TRUE)
kern <- rf1$prox

pc1 <- prcomp(kern)
el1 <- getElbows(pc1$sdev, plot = TRUE)[1]

el2 <- getElbows(pc1$sdev, plot = FALSE)[2]

#edat <- scale(pc1$x[,1:el1], center = TRUE, scale = TRUE)
edat <- pc1$x[, 1:el1]

pairs(edat, col = gaba + 1, pch = 20, cex = 0.5)

1.1 Kernel/Proximity Matrix in heatmap

image(kern, col = viridis(255))

2 Results

2.1 1-d Heatmap

2.2 Location meda_plots

2.3 Outliers as given by randomForest

2.4 Correlation Matrix

2.5 Cumulative Variance with Elbows

2.6 Paired Hex-binned plot

2.7 Hierarchical GMM Classifications

2.8 Hierarchical GMM Dendrogram

2.9 Stacked Means

2.10 Cluster Means

3 Restricting hGMM to \(K = 2\)

Here we are restricting hierarchical GMM to only go through on level. We are comparing the cluster results to the gaba labels.

set.seed(3144)
h2 <- hmc(edat, maxDepth = 2, ccol = ccol)
h2lab <- viridis(max(h2$dat$labels$col))
h2col <- h2$dat$labels$col

3.1 K = 2 stacked means plot

p1 <- stackM(h2, ccol = "black", centered = TRUE, depth = 1)
p2 <- stackMraw(sdat, h2$dat$labels$L1, centered = TRUE, depth = 2, ccol = ccol)
grid.arrange(p1,p2, nrow=1)

3.2 Pairs plot colored by true gaba classification

cols <- c("black", "magenta")[gabaID$gaba+1]
acols <- alpha(cols, 0.35)
#pairs(h2$dat$data, pch = 19, cex = 0.7, col = acols)
plot(h2$dat$data, col = acols, pch = c(19,3)[gaba+1], cex = c(0.5,1)[gaba+1])

pairs(sdat, col = acols, pch = c(19,3)[gaba+1], cex = c(0.5,1)[gaba+1])

3.3 Pairs plot colored by hGMM cluster classification

acols2 <- alpha(h2lab[h2$dat$labels$col], 0.45)
par(bg = "gray45")
plot(h2$dat$data, pch = c(3,20)[gaba + 1], cex = 1, col = acols2)

pairs(sdat, pch = 19, cex = 0.7, col = acols2)

dev.off()

## null device 
##           1

4 Permutation test for ARI

p0 <- mclust::adjustedRandIndex(pred, gaba)
perms <- foreach(i = 1:1.5e4, .combine = c) %dopar% {
  set.seed(i*2)
  mclust::adjustedRandIndex(sample(pred), gaba)
}
pv0 <- sum(c(perms,p0) >= p0)/length(perms)

hist(perms, xlim = c(min(perms), p0 + 0.25*p0),
     main = "permutation test of ARI values", probability = TRUE)
#hist(perms, probability = TRUE)
abline(v = p0, col = 'red')

t1

##        truth
## pred    FALSE TRUE
##   FALSE   375   33
##   TRUE    333   52

5 Summary Table

measurment	value
Misclassification Rate	0.4615385
Accuracy	0.5384615
Sensitivity	0.6117647
Specificity	0.529661
Precision	0.1350649
Recall	0.6117647
ARI	0.00492
\(p\)-value for ARI	0.010333
F1-score	0.2212766
TP	52
FP	333
TN	375
FN	33

6 NDVIZ links

6.1 True Positive Links

True Positives
200 186 16
236 178 14
237 191 12
231 163 25
230 164 28
233 160 29
244 175 16
215 174 39
205 196 8
197 213 24
236 199 23
231 195 24
231 214 18
231 217 22
231 208 25
198 237 28
218 228 23
254 235 14
228 249 14
211 239 36
246 249 32
307 182 31
276 173 32
292 166 35
294 170 36
296 161 42
303 178 32
300 178 36
311 219 10
259 250 30
292 226 30
297 225 27
298 255 16
309 254 16
271 198 32
274 195 36
285 214 40
313 220 42
303 239 36
219 286 8
200 269 22
206 263 42
210 263 33
235 266 42
261 257 20
286 275 20
300 257 24
298 283 24
310 275 16
293 264 32
291 269 37
315 265 32

6.2 False Positive Links

False Positives
195 167 37
196 170 34
190 245 28
202 194 8
203 193 11
198 223 10
197 193 20
197 194 31
227 215 12
238 215 18
195 227 7
215 238 17
238 217 39
301 163 11
257 161 40
262 191 40
305 175 32
272 192 14
308 193 10
301 204 18
293 192 29
295 221 24
300 235 11
316 198 35
257 236 32
308 228 33
243 275 10
224 285 22
202 264 38
243 274 38
264 256 20
288 271 11
304 270 31

6.3 False Negative Links

False Negatives
190 181 22
191 161 43
188 171 40
191 190 32
190 183 42
197 157 15
222 155 13
201 156 22
205 158 27
228 156 20
233 154 30
235 156 31
251 159 29
207 180 6
203 182 10
207 183 9
193 188 12
202 190 10
217 189 13
214 169 24
206 179 20
200 187 21
192 180 27
202 178 31
196 189 26
210 188 19
213 185 31
234 189 12
237 188 14
234 174 19
227 172 30
251 165 19
241 169 16
231 176 18
235 179 29
227 190 26
229 186 30
230 191 29
253 177 24
195 156 35
194 158 34
194 154 38
198 154 38
205 155 32
204 159 42
219 158 32
225 155 35
254 154 37
192 160 41
222 163 38
218 169 42
221 184 34
218 184 36
237 166 34
237 161 40
252 171 32
230 177 32
228 183 38
230 186 38
247 176 36
250 179 32
241 185 36
189 207 8
189 205 22
191 217 36
191 231 36
190 253 42
199 206 12
205 216 11
206 216 23
195 218 24
220 213 18
218 216 22
227 207 11
250 207 10
249 220 10
249 195 19
248 193 21
247 206 24
236 217 26
247 212 17
251 219 23
242 215 26
250 208 24
249 215 24
210 233 8
216 233 9
206 245 14
215 247 13
192 224 20
195 232 25
210 231 22
216 237 21
218 235 26
195 253 19
200 252 27
208 244 20
231 236 8
255 227 13
235 254 9
240 241 10
254 240 14
226 226 19
233 236 26
254 224 23
247 234 24
236 251 19
254 241 28
199 193 34
195 198 34
200 198 33
204 193 36
222 198 38
198 216 39
213 220 38
226 198 36
230 204 38
238 197 41
237 207 43
251 194 38
254 200 32
251 192 41
253 201 40
250 206 40
228 215 39
230 221 37
251 212 40
200 227 37
194 232 33
208 228 33
210 232 34
210 247 35
216 250 34
223 247 40
211 249 40
229 225 39
245 238 40
236 244 35
234 240 40
237 249 43
240 254 34
241 248 43
258 156 6
265 159 6
283 158 22
297 157 8
315 157 9
310 157 20
309 156 25
266 168 13
286 165 6
262 177 8
269 172 19
258 171 28
282 175 22
278 173 29
258 182 16
266 182 21
256 186 18
262 185 18
264 188 22
264 176 28
276 183 16
281 181 28
275 184 28
311 168 14
300 191 8
299 187 12
299 163 26
301 163 31
288 170 28
311 159 34
256 160 34
261 167 33
272 166 32
274 173 42
265 182 43
276 191 35
294 161 32
295 172 36
300 164 42
309 175 34
292 177 36
288 186 34
265 206 6
266 196 14
269 206 8
267 203 12
268 223 10
257 197 28
283 196 30
256 218 17
263 212 30
266 221 28
272 208 21
286 213 30
282 216 28
294 205 14
300 205 15
309 192 6
305 196 10
301 209 6
301 209 6
296 212 8
301 216 10
299 197 31
313 193 21
308 208 18
305 223 17
311 216 20
264 231 6
285 224 7
260 244 7
260 247 11
256 242 14
264 250 8
275 243 10
284 244 8
275 252 10
261 234 20
265 239 19
258 255 28
278 254 18
286 254 24
300 245 14
312 245 8
312 226 17
309 227 29
306 234 29
313 237 24
312 233 30
290 241 22
294 243 28
302 248 24
296 250 30
310 248 26
258 203 32
259 205 40
267 204 40
287 198 40
258 220 38
296 195 38
291 204 39
300 215 42
309 218 36
272 230 43
274 236 43
265 250 37
274 242 36
276 245 36
275 251 38
277 251 36
281 248 42
289 239 32
300 232 35
292 232 41
315 230 40
299 240 34
314 245 34
315 241 37
191 274 29
188 279 31
188 256 32
191 285 37
198 266 8
194 262 22
202 258 28
220 271 20
221 268 22
211 263 26
222 262 27
201 273 22
196 279 29
200 277 28
222 279 21
214 282 18
255 266 9
233 281 15
251 285 6
253 285 8
228 263 20
240 257 20
252 262 18
242 260 26
231 274 16
227 285 19
232 277 31
247 284 16
255 283 27
249 284 31
199 260 43
216 256 34
213 265 38
217 265 40
196 274 37
204 281 40
209 284 36
244 262 38
248 268 39
243 258 41
254 271 40
234 276 37
233 280 36
285 259 6
271 274 10
263 284 11
285 276 13
286 278 14
276 281 12
267 260 17
280 260 26
272 266 29
284 282 30
294 268 8
304 272 13
313 281 10
291 263 22
301 259 19
292 263 28
298 258 24
309 263 20
317 264 20
310 272 18
258 262 42
272 260 36
276 263 39
302 261 42
316 256 36
307 267 42
293 280 32
297 275 40
291 281 42
314 272 32

RORB Data: pseudo-centroids from annotation polygons using Gaussian weighted average over 11x11x11 cubes ≈ 0.61 microns^3 followed by Unsupervised Random Forest.

2018-02-28

1 Unsupervised RandomForest

1.1 Kernel/Proximity Matrix in heatmap

2 Results

2.1 1-d Heatmap

2.2 Location meda_plots

2.3 Outliers as given by randomForest

2.4 Correlation Matrix

2.5 Cumulative Variance with Elbows

2.6 Paired Hex-binned plot

2.7 Hierarchical GMM Classifications

2.8 Hierarchical GMM Dendrogram

2.9 Stacked Means

2.10 Cluster Means

3 Restricting hGMM to \(K = 2\)

3.1 K = 2 stacked means plot

3.2 Pairs plot colored by true gaba classification

3.3 Pairs plot colored by hGMM cluster classification

4 Permutation test for ARI

5 Summary Table

6 NDVIZ links

6.1 True Positive Links

6.2 False Positive Links

6.3 False Negative Links