Recover one figure from public paper.
Material and methods
-
Data set:
GSE122083
/GSM3454528
-
R package:
Seurat
andigraph
Step 1. QC
-
Download data
-
Load data
-
Remove duplicated genes and Select max expression
|
|
- Normalizing the data
|
|
- Transform with log2()
|
|
Step 2. Identification of highly variable features (top5000 genes) for PCA analysis with Seurat
package
-
Seurat is a toolkit for quality control, analysis, and exploration of single cell RNA sequencing data.
-
Setup the Seurat Object
|
|
- PCA with
prcomp()
|
|
Step 3. k-means cluster
|
|
Step 4. KNN visualized
|
|
|
|
- Create igraph
|
|
|
|
- Define function(D,k) to enable to try different k
|
|
- Plot with different k values
|
|
In summary
-
PCA and KNN are common methods, herein, they are standard ways to implement.
-
Keep different parameters could get different results.