1 pg undersized health warn in rook ceph on single node cluster(minikube)

kubernetes minikube ceph rook-storage kubernetes-rook

As you mentioned in your question you should change your crush failure-domain-type to OSD that it means it will replicate your data between OSDs not hosts. By default it is host and when you have only one host it doesn't have any other hosts to replicate your data and so your pg will always be undersized.

You should set osd crush chooseleaf type = 0 in your ceph.conf before you create your monitors and OSDs.

This will replicate your data between OSDs rather that hosts.

kubernetes minikube ceph rook-storage kubernetes-rook

I came across this problem installing ceph using rook (v1.5.7) with a single data bearing host having multiple OSDs.

The install shipped with a default CRUSH rule replicated_rule which had host as the default failure domain:

$ ceph osd crush rule dump replicated_rule    {    "rule_id": 0,    "rule_name": "replicated_rule",    "ruleset": 0,    "type": 1,    "min_size": 1,    "max_size": 10,    "steps": [        {            "op": "take",            "item": -1,            "item_name": "default"        },        {            "op": "chooseleaf_firstn",            "num": 0,            "type": "host"        },        {            "op": "emit"        }    ]}

I had to find out the pool name associated with pg 1 that was "undersized", luckily in a default rook-ceph install, there's only one:

$ ceph osd pool lsdevice_health_metrics$ ceph pg ls-by-pool device_health_metricsPG   OBJECTS  DEGRADED  ...  STATE1.0        0         0  ...  active+undersized+remapped

And to confirm the pg is using the default rule:

$ ceph osd pool get device_health_metrics crush_rulecrush_rule: replicated_rule

Instead of modifying the default CRUSH rule, I opted to create a new replicated rule, but this time specifying the osd (aka device) type (docs: CRUSH map Types and Buckets), also assuming the default CRUSH root of default:

# osd crush rule create-replicated <name> <root> <type> [<class>]$ ceph osd crush rule create-replicated replicated_rule_osd default osd$ ceph osd crush rule dump replicated_rule_osd{    "rule_id": 1,    "rule_name": "replicated_rule_osd",    "ruleset": 1,    "type": 1,    "min_size": 1,    "max_size": 10,    "steps": [        {            "op": "take",            "item": -1,            "item_name": "default"        },        {            "op": "choose_firstn",            "num": 0,            "type": "osd"        },        {            "op": "emit"        }    ]}

And then assigning the new rule to the existing pool:

$ ceph osd pool set device_health_metrics crush_rule replicated_rule_osdset pool 1 crush_rule to replicated_rule_osd$ ceph osd pool get device_health_metrics crush_rulecrush_rule: replicated_rule_osd

Finally confirming pg state:

$ ceph pg ls-by-pool device_health_metricsPG   OBJECTS  DEGRADED  ...  STATE1.0        0         0  ...  active+clean

CodeHunter

1 pg undersized health warn in rook ceph on single node cluster(minikube)

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last