ggplot2 heatmap : how to preserve the label order?

If I recall, when calling factor(x) with the default levels argument, the levels are set as levels = sort(unique(x)).

You can override this action by setting levels = unique(x).

For example:

set.seed(1)
x = sample(letters, 100, replace = TRUE)
head(x, 5)

[1] "g" "j" "o" "x" "f"

levels(factor(x))

[1] "a" "b" "c" "d" "e" "f" "g" "h" "i" "j" "k" "l" "m" "n" "o" "p" "q" "r" "s"

[20] "t" "u" "v" "w" "x" "y" "z"

levels(factor(x, levels = unique(x)))

[1] "g" "j" "o" "x" "f" "y" "r" "q" "b" "e" "u" "m" "s" "z" "d" "k" "a" "w" "i"

[20] "p" "v" "c" "n" "t" "l" "h"

You can see that setting levels = unique(x) preserves the order of occurrence in the data.


If you want to keep the order directly from the csv file :

foomelt$COG <- factor(foomelt$COG, levels = unique(as.character(foo[[1]])))

Did you try reordering factor levels before plotting? e.g.

foomelt$COG = factor(foomelt$COG,levels(foomelt$COG)[c(2,1,3:8)])

(I can't try it right now, so I can't be sure that it works)

Tags:

R

Ggplot2

Heatmap