1 2sysfs - _The_ filesystem for exporting kernel objects. 3 4Patrick Mochel <mochel@osdl.org> 5Mike Murphy <mamurph@cs.clemson.edu> 6 7Revised: 16 August 2011 8Original: 10 January 2003 9 10 11What it is: 12~~~~~~~~~~~ 13 14sysfs is a ram-based filesystem initially based on ramfs. It provides 15a means to export kernel data structures, their attributes, and the 16linkages between them to userspace. 17 18sysfs is tied inherently to the kobject infrastructure. Please read 19Documentation/kobject.txt for more information concerning the kobject 20interface. 21 22 23Using sysfs 24~~~~~~~~~~~ 25 26sysfs is always compiled in if CONFIG_SYSFS is defined. You can access 27it by doing: 28 29 mount -t sysfs sysfs /sys 30 31 32Directory Creation 33~~~~~~~~~~~~~~~~~~ 34 35For every kobject that is registered with the system, a directory is 36created for it in sysfs. That directory is created as a subdirectory 37of the kobject's parent, expressing internal object hierarchies to 38userspace. Top-level directories in sysfs represent the common 39ancestors of object hierarchies; i.e. the subsystems the objects 40belong to. 41 42Sysfs internally stores a pointer to the kobject that implements a 43directory in the kernfs_node object associated with the directory. In 44the past this kobject pointer has been used by sysfs to do reference 45counting directly on the kobject whenever the file is opened or closed. 46With the current sysfs implementation the kobject reference count is 47only modified directly by the function sysfs_schedule_callback(). 48 49 50Attributes 51~~~~~~~~~~ 52 53Attributes can be exported for kobjects in the form of regular files in 54the filesystem. Sysfs forwards file I/O operations to methods defined 55for the attributes, providing a means to read and write kernel 56attributes. 57 58Attributes should be ASCII text files, preferably with only one value 59per file. It is noted that it may not be efficient to contain only one 60value per file, so it is socially acceptable to express an array of 61values of the same type. 62 63Mixing types, expressing multiple lines of data, and doing fancy 64formatting of data is heavily frowned upon. Doing these things may get 65you publicly humiliated and your code rewritten without notice. 66 67 68An attribute definition is simply: 69 70struct attribute { 71 char * name; 72 struct module *owner; 73 umode_t mode; 74}; 75 76 77int sysfs_create_file(struct kobject * kobj, const struct attribute * attr); 78void sysfs_remove_file(struct kobject * kobj, const struct attribute * attr); 79 80 81A bare attribute contains no means to read or write the value of the 82attribute. Subsystems are encouraged to define their own attribute 83structure and wrapper functions for adding and removing attributes for 84a specific object type. 85 86For example, the driver model defines struct device_attribute like: 87 88struct device_attribute { 89 struct attribute attr; 90 ssize_t (*show)(struct device *dev, struct device_attribute *attr, 91 char *buf); 92 ssize_t (*store)(struct device *dev, struct device_attribute *attr, 93 const char *buf, size_t count); 94}; 95 96int device_create_file(struct device *, const struct device_attribute *); 97void device_remove_file(struct device *, const struct device_attribute *); 98 99It also defines this helper for defining device attributes: 100 101#define DEVICE_ATTR(_name, _mode, _show, _store) \ 102struct device_attribute dev_attr_##_name = __ATTR(_name, _mode, _show, _store) 103 104For example, declaring 105 106static DEVICE_ATTR(foo, S_IWUSR | S_IRUGO, show_foo, store_foo); 107 108is equivalent to doing: 109 110static struct device_attribute dev_attr_foo = { 111 .attr = { 112 .name = "foo", 113 .mode = S_IWUSR | S_IRUGO, 114 }, 115 .show = show_foo, 116 .store = store_foo, 117}; 118 119Note as stated in include/linux/kernel.h "OTHER_WRITABLE? Generally 120considered a bad idea." so trying to set a sysfs file writable for 121everyone will fail reverting to RO mode for "Others". 122 123For the common cases sysfs.h provides convenience macros to make 124defining attributes easier as well as making code more concise and 125readable. The above case could be shortened to: 126 127static struct device_attribute dev_attr_foo = __ATTR_RW(foo); 128 129the list of helpers available to define your wrapper function is: 130__ATTR_RO(name): assumes default name_show and mode 0444 131__ATTR_WO(name): assumes a name_store only and is restricted to mode 132 0200 that is root write access only. 133__ATTR_RO_MODE(name, mode): fore more restrictive RO access currently 134 only use case is the EFI System Resource Table 135 (see drivers/firmware/efi/esrt.c) 136__ATTR_RW(name): assumes default name_show, name_store and setting 137 mode to 0644. 138__ATTR_NULL: which sets the name to NULL and is used as end of list 139 indicator (see: kernel/workqueue.c) 140 141Subsystem-Specific Callbacks 142~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 143 144When a subsystem defines a new attribute type, it must implement a 145set of sysfs operations for forwarding read and write calls to the 146show and store methods of the attribute owners. 147 148struct sysfs_ops { 149 ssize_t (*show)(struct kobject *, struct attribute *, char *); 150 ssize_t (*store)(struct kobject *, struct attribute *, const char *, size_t); 151}; 152 153[ Subsystems should have already defined a struct kobj_type as a 154descriptor for this type, which is where the sysfs_ops pointer is 155stored. See the kobject documentation for more information. ] 156 157When a file is read or written, sysfs calls the appropriate method 158for the type. The method then translates the generic struct kobject 159and struct attribute pointers to the appropriate pointer types, and 160calls the associated methods. 161 162 163To illustrate: 164 165#define to_dev(obj) container_of(obj, struct device, kobj) 166#define to_dev_attr(_attr) container_of(_attr, struct device_attribute, attr) 167 168static ssize_t dev_attr_show(struct kobject *kobj, struct attribute *attr, 169 char *buf) 170{ 171 struct device_attribute *dev_attr = to_dev_attr(attr); 172 struct device *dev = to_dev(kobj); 173 ssize_t ret = -EIO; 174 175 if (dev_attr->show) 176 ret = dev_attr->show(dev, dev_attr, buf); 177 if (ret >= (ssize_t)PAGE_SIZE) { 178 printk("dev_attr_show: %pS returned bad count\n", 179 dev_attr->show); 180 } 181 return ret; 182} 183 184 185 186Reading/Writing Attribute Data 187~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 188 189To read or write attributes, show() or store() methods must be 190specified when declaring the attribute. The method types should be as 191simple as those defined for device attributes: 192 193ssize_t (*show)(struct device *dev, struct device_attribute *attr, char *buf); 194ssize_t (*store)(struct device *dev, struct device_attribute *attr, 195 const char *buf, size_t count); 196 197IOW, they should take only an object, an attribute, and a buffer as parameters. 198 199 200sysfs allocates a buffer of size (PAGE_SIZE) and passes it to the 201method. Sysfs will call the method exactly once for each read or 202write. This forces the following behavior on the method 203implementations: 204 205- On read(2), the show() method should fill the entire buffer. 206 Recall that an attribute should only be exporting one value, or an 207 array of similar values, so this shouldn't be that expensive. 208 209 This allows userspace to do partial reads and forward seeks 210 arbitrarily over the entire file at will. If userspace seeks back to 211 zero or does a pread(2) with an offset of '0' the show() method will 212 be called again, rearmed, to fill the buffer. 213 214- On write(2), sysfs expects the entire buffer to be passed during the 215 first write. Sysfs then passes the entire buffer to the store() method. 216 A terminating null is added after the data on stores. This makes 217 functions like sysfs_streq() safe to use. 218 219 When writing sysfs files, userspace processes should first read the 220 entire file, modify the values it wishes to change, then write the 221 entire buffer back. 222 223 Attribute method implementations should operate on an identical 224 buffer when reading and writing values. 225 226Other notes: 227 228- Writing causes the show() method to be rearmed regardless of current 229 file position. 230 231- The buffer will always be PAGE_SIZE bytes in length. On i386, this 232 is 4096. 233 234- show() methods should return the number of bytes printed into the 235 buffer. 236 237- show() should only use sysfs_emit() or sysfs_emit_at() when formatting 238 the value to be returned to user space. 239 240- store() should return the number of bytes used from the buffer. If the 241 entire buffer has been used, just return the count argument. 242 243- show() or store() can always return errors. If a bad value comes 244 through, be sure to return an error. 245 246- The object passed to the methods will be pinned in memory via sysfs 247 referencing counting its embedded object. However, the physical 248 entity (e.g. device) the object represents may not be present. Be 249 sure to have a way to check this, if necessary. 250 251 252A very simple (and naive) implementation of a device attribute is: 253 254static ssize_t show_name(struct device *dev, struct device_attribute *attr, 255 char *buf) 256{ 257 return scnprintf(buf, PAGE_SIZE, "%s\n", dev->name); 258} 259 260static ssize_t store_name(struct device *dev, struct device_attribute *attr, 261 const char *buf, size_t count) 262{ 263 snprintf(dev->name, sizeof(dev->name), "%.*s", 264 (int)min(count, sizeof(dev->name) - 1), buf); 265 return count; 266} 267 268static DEVICE_ATTR(name, S_IRUGO, show_name, store_name); 269 270 271(Note that the real implementation doesn't allow userspace to set the 272name for a device.) 273 274 275Top Level Directory Layout 276~~~~~~~~~~~~~~~~~~~~~~~~~~ 277 278The sysfs directory arrangement exposes the relationship of kernel 279data structures. 280 281The top level sysfs directory looks like: 282 283block/ 284bus/ 285class/ 286dev/ 287devices/ 288firmware/ 289net/ 290fs/ 291 292devices/ contains a filesystem representation of the device tree. It maps 293directly to the internal kernel device tree, which is a hierarchy of 294struct device. 295 296bus/ contains flat directory layout of the various bus types in the 297kernel. Each bus's directory contains two subdirectories: 298 299 devices/ 300 drivers/ 301 302devices/ contains symlinks for each device discovered in the system 303that point to the device's directory under root/. 304 305drivers/ contains a directory for each device driver that is loaded 306for devices on that particular bus (this assumes that drivers do not 307span multiple bus types). 308 309fs/ contains a directory for some filesystems. Currently each 310filesystem wanting to export attributes must create its own hierarchy 311below fs/ (see ./fuse.txt for an example). 312 313dev/ contains two directories char/ and block/. Inside these two 314directories there are symlinks named <major>:<minor>. These symlinks 315point to the sysfs directory for the given device. /sys/dev provides a 316quick way to lookup the sysfs interface for a device from the result of 317a stat(2) operation. 318 319More information can driver-model specific features can be found in 320Documentation/driver-api/driver-model/. 321 322 323TODO: Finish this section. 324 325 326Current Interfaces 327~~~~~~~~~~~~~~~~~~ 328 329The following interface layers currently exist in sysfs: 330 331 332- devices (include/linux/device.h) 333---------------------------------- 334Structure: 335 336struct device_attribute { 337 struct attribute attr; 338 ssize_t (*show)(struct device *dev, struct device_attribute *attr, 339 char *buf); 340 ssize_t (*store)(struct device *dev, struct device_attribute *attr, 341 const char *buf, size_t count); 342}; 343 344Declaring: 345 346DEVICE_ATTR(_name, _mode, _show, _store); 347 348Creation/Removal: 349 350int device_create_file(struct device *dev, const struct device_attribute * attr); 351void device_remove_file(struct device *dev, const struct device_attribute * attr); 352 353 354- bus drivers (include/linux/device.h) 355-------------------------------------- 356Structure: 357 358struct bus_attribute { 359 struct attribute attr; 360 ssize_t (*show)(struct bus_type *, char * buf); 361 ssize_t (*store)(struct bus_type *, const char * buf, size_t count); 362}; 363 364Declaring: 365 366static BUS_ATTR_RW(name); 367static BUS_ATTR_RO(name); 368static BUS_ATTR_WO(name); 369 370Creation/Removal: 371 372int bus_create_file(struct bus_type *, struct bus_attribute *); 373void bus_remove_file(struct bus_type *, struct bus_attribute *); 374 375 376- device drivers (include/linux/device.h) 377----------------------------------------- 378 379Structure: 380 381struct driver_attribute { 382 struct attribute attr; 383 ssize_t (*show)(struct device_driver *, char * buf); 384 ssize_t (*store)(struct device_driver *, const char * buf, 385 size_t count); 386}; 387 388Declaring: 389 390DRIVER_ATTR_RO(_name) 391DRIVER_ATTR_RW(_name) 392 393Creation/Removal: 394 395int driver_create_file(struct device_driver *, const struct driver_attribute *); 396void driver_remove_file(struct device_driver *, const struct driver_attribute *); 397 398 399Documentation 400~~~~~~~~~~~~~ 401 402The sysfs directory structure and the attributes in each directory define an 403ABI between the kernel and user space. As for any ABI, it is important that 404this ABI is stable and properly documented. All new sysfs attributes must be 405documented in Documentation/ABI. See also Documentation/ABI/README for more 406information. 407