• Home
  • Line#
  • Scopes#
  • Navigate#
  • Raw
  • Download
1
2sysfs - _The_ filesystem for exporting kernel objects.
3
4Patrick Mochel	<mochel@osdl.org>
5Mike Murphy <mamurph@cs.clemson.edu>
6
7Revised:    16 August 2011
8Original:   10 January 2003
9
10
11What it is:
12~~~~~~~~~~~
13
14sysfs is a ram-based filesystem initially based on ramfs. It provides
15a means to export kernel data structures, their attributes, and the
16linkages between them to userspace.
17
18sysfs is tied inherently to the kobject infrastructure. Please read
19Documentation/kobject.txt for more information concerning the kobject
20interface.
21
22
23Using sysfs
24~~~~~~~~~~~
25
26sysfs is always compiled in if CONFIG_SYSFS is defined. You can access
27it by doing:
28
29    mount -t sysfs sysfs /sys
30
31
32Directory Creation
33~~~~~~~~~~~~~~~~~~
34
35For every kobject that is registered with the system, a directory is
36created for it in sysfs. That directory is created as a subdirectory
37of the kobject's parent, expressing internal object hierarchies to
38userspace. Top-level directories in sysfs represent the common
39ancestors of object hierarchies; i.e. the subsystems the objects
40belong to.
41
42Sysfs internally stores a pointer to the kobject that implements a
43directory in the kernfs_node object associated with the directory. In
44the past this kobject pointer has been used by sysfs to do reference
45counting directly on the kobject whenever the file is opened or closed.
46With the current sysfs implementation the kobject reference count is
47only modified directly by the function sysfs_schedule_callback().
48
49
50Attributes
51~~~~~~~~~~
52
53Attributes can be exported for kobjects in the form of regular files in
54the filesystem. Sysfs forwards file I/O operations to methods defined
55for the attributes, providing a means to read and write kernel
56attributes.
57
58Attributes should be ASCII text files, preferably with only one value
59per file. It is noted that it may not be efficient to contain only one
60value per file, so it is socially acceptable to express an array of
61values of the same type.
62
63Mixing types, expressing multiple lines of data, and doing fancy
64formatting of data is heavily frowned upon. Doing these things may get
65you publicly humiliated and your code rewritten without notice.
66
67
68An attribute definition is simply:
69
70struct attribute {
71        char                    * name;
72        struct module		*owner;
73        umode_t                 mode;
74};
75
76
77int sysfs_create_file(struct kobject * kobj, const struct attribute * attr);
78void sysfs_remove_file(struct kobject * kobj, const struct attribute * attr);
79
80
81A bare attribute contains no means to read or write the value of the
82attribute. Subsystems are encouraged to define their own attribute
83structure and wrapper functions for adding and removing attributes for
84a specific object type.
85
86For example, the driver model defines struct device_attribute like:
87
88struct device_attribute {
89	struct attribute	attr;
90	ssize_t (*show)(struct device *dev, struct device_attribute *attr,
91			char *buf);
92	ssize_t (*store)(struct device *dev, struct device_attribute *attr,
93			 const char *buf, size_t count);
94};
95
96int device_create_file(struct device *, const struct device_attribute *);
97void device_remove_file(struct device *, const struct device_attribute *);
98
99It also defines this helper for defining device attributes:
100
101#define DEVICE_ATTR(_name, _mode, _show, _store) \
102struct device_attribute dev_attr_##_name = __ATTR(_name, _mode, _show, _store)
103
104For example, declaring
105
106static DEVICE_ATTR(foo, S_IWUSR | S_IRUGO, show_foo, store_foo);
107
108is equivalent to doing:
109
110static struct device_attribute dev_attr_foo = {
111	.attr = {
112		.name = "foo",
113		.mode = S_IWUSR | S_IRUGO,
114	},
115	.show = show_foo,
116	.store = store_foo,
117};
118
119Note as stated in include/linux/kernel.h "OTHER_WRITABLE?  Generally
120considered a bad idea." so trying to set a sysfs file writable for
121everyone will fail reverting to RO mode for "Others".
122
123For the common cases sysfs.h provides convenience macros to make
124defining attributes easier as well as making code more concise and
125readable. The above case could be shortened to:
126
127static struct device_attribute dev_attr_foo = __ATTR_RW(foo);
128
129the list of helpers available to define your wrapper function is:
130__ATTR_RO(name): assumes default name_show and mode 0444
131__ATTR_WO(name): assumes a name_store only and is restricted to mode
132                 0200 that is root write access only.
133__ATTR_RO_MODE(name, mode): fore more restrictive RO access currently
134                 only use case is the EFI System Resource Table
135                 (see drivers/firmware/efi/esrt.c)
136__ATTR_RW(name): assumes default name_show, name_store and setting
137                 mode to 0644.
138__ATTR_NULL: which sets the name to NULL and is used as end of list
139                 indicator (see: kernel/workqueue.c)
140
141Subsystem-Specific Callbacks
142~~~~~~~~~~~~~~~~~~~~~~~~~~~~
143
144When a subsystem defines a new attribute type, it must implement a
145set of sysfs operations for forwarding read and write calls to the
146show and store methods of the attribute owners.
147
148struct sysfs_ops {
149        ssize_t (*show)(struct kobject *, struct attribute *, char *);
150        ssize_t (*store)(struct kobject *, struct attribute *, const char *, size_t);
151};
152
153[ Subsystems should have already defined a struct kobj_type as a
154descriptor for this type, which is where the sysfs_ops pointer is
155stored. See the kobject documentation for more information. ]
156
157When a file is read or written, sysfs calls the appropriate method
158for the type. The method then translates the generic struct kobject
159and struct attribute pointers to the appropriate pointer types, and
160calls the associated methods.
161
162
163To illustrate:
164
165#define to_dev(obj) container_of(obj, struct device, kobj)
166#define to_dev_attr(_attr) container_of(_attr, struct device_attribute, attr)
167
168static ssize_t dev_attr_show(struct kobject *kobj, struct attribute *attr,
169                             char *buf)
170{
171        struct device_attribute *dev_attr = to_dev_attr(attr);
172        struct device *dev = to_dev(kobj);
173        ssize_t ret = -EIO;
174
175        if (dev_attr->show)
176                ret = dev_attr->show(dev, dev_attr, buf);
177        if (ret >= (ssize_t)PAGE_SIZE) {
178                printk("dev_attr_show: %pS returned bad count\n",
179                                dev_attr->show);
180        }
181        return ret;
182}
183
184
185
186Reading/Writing Attribute Data
187~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
188
189To read or write attributes, show() or store() methods must be
190specified when declaring the attribute. The method types should be as
191simple as those defined for device attributes:
192
193ssize_t (*show)(struct device *dev, struct device_attribute *attr, char *buf);
194ssize_t (*store)(struct device *dev, struct device_attribute *attr,
195                 const char *buf, size_t count);
196
197IOW, they should take only an object, an attribute, and a buffer as parameters.
198
199
200sysfs allocates a buffer of size (PAGE_SIZE) and passes it to the
201method. Sysfs will call the method exactly once for each read or
202write. This forces the following behavior on the method
203implementations:
204
205- On read(2), the show() method should fill the entire buffer.
206  Recall that an attribute should only be exporting one value, or an
207  array of similar values, so this shouldn't be that expensive.
208
209  This allows userspace to do partial reads and forward seeks
210  arbitrarily over the entire file at will. If userspace seeks back to
211  zero or does a pread(2) with an offset of '0' the show() method will
212  be called again, rearmed, to fill the buffer.
213
214- On write(2), sysfs expects the entire buffer to be passed during the
215  first write. Sysfs then passes the entire buffer to the store() method.
216  A terminating null is added after the data on stores. This makes
217  functions like sysfs_streq() safe to use.
218
219  When writing sysfs files, userspace processes should first read the
220  entire file, modify the values it wishes to change, then write the
221  entire buffer back.
222
223  Attribute method implementations should operate on an identical
224  buffer when reading and writing values.
225
226Other notes:
227
228- Writing causes the show() method to be rearmed regardless of current
229  file position.
230
231- The buffer will always be PAGE_SIZE bytes in length. On i386, this
232  is 4096.
233
234- show() methods should return the number of bytes printed into the
235  buffer. This is the return value of scnprintf().
236
237- show() must not use snprintf() when formatting the value to be
238  returned to user space. If you can guarantee that an overflow
239  will never happen you can use sprintf() otherwise you must use
240  scnprintf().
241
242- store() should return the number of bytes used from the buffer. If the
243  entire buffer has been used, just return the count argument.
244
245- show() or store() can always return errors. If a bad value comes
246  through, be sure to return an error.
247
248- The object passed to the methods will be pinned in memory via sysfs
249  referencing counting its embedded object. However, the physical
250  entity (e.g. device) the object represents may not be present. Be
251  sure to have a way to check this, if necessary.
252
253
254A very simple (and naive) implementation of a device attribute is:
255
256static ssize_t show_name(struct device *dev, struct device_attribute *attr,
257                         char *buf)
258{
259	return scnprintf(buf, PAGE_SIZE, "%s\n", dev->name);
260}
261
262static ssize_t store_name(struct device *dev, struct device_attribute *attr,
263                          const char *buf, size_t count)
264{
265        snprintf(dev->name, sizeof(dev->name), "%.*s",
266                 (int)min(count, sizeof(dev->name) - 1), buf);
267	return count;
268}
269
270static DEVICE_ATTR(name, S_IRUGO, show_name, store_name);
271
272
273(Note that the real implementation doesn't allow userspace to set the
274name for a device.)
275
276
277Top Level Directory Layout
278~~~~~~~~~~~~~~~~~~~~~~~~~~
279
280The sysfs directory arrangement exposes the relationship of kernel
281data structures.
282
283The top level sysfs directory looks like:
284
285block/
286bus/
287class/
288dev/
289devices/
290firmware/
291net/
292fs/
293
294devices/ contains a filesystem representation of the device tree. It maps
295directly to the internal kernel device tree, which is a hierarchy of
296struct device.
297
298bus/ contains flat directory layout of the various bus types in the
299kernel. Each bus's directory contains two subdirectories:
300
301	devices/
302	drivers/
303
304devices/ contains symlinks for each device discovered in the system
305that point to the device's directory under root/.
306
307drivers/ contains a directory for each device driver that is loaded
308for devices on that particular bus (this assumes that drivers do not
309span multiple bus types).
310
311fs/ contains a directory for some filesystems.  Currently each
312filesystem wanting to export attributes must create its own hierarchy
313below fs/ (see ./fuse.txt for an example).
314
315dev/ contains two directories char/ and block/. Inside these two
316directories there are symlinks named <major>:<minor>.  These symlinks
317point to the sysfs directory for the given device.  /sys/dev provides a
318quick way to lookup the sysfs interface for a device from the result of
319a stat(2) operation.
320
321More information can driver-model specific features can be found in
322Documentation/driver-api/driver-model/.
323
324
325TODO: Finish this section.
326
327
328Current Interfaces
329~~~~~~~~~~~~~~~~~~
330
331The following interface layers currently exist in sysfs:
332
333
334- devices (include/linux/device.h)
335----------------------------------
336Structure:
337
338struct device_attribute {
339	struct attribute	attr;
340	ssize_t (*show)(struct device *dev, struct device_attribute *attr,
341			char *buf);
342	ssize_t (*store)(struct device *dev, struct device_attribute *attr,
343			 const char *buf, size_t count);
344};
345
346Declaring:
347
348DEVICE_ATTR(_name, _mode, _show, _store);
349
350Creation/Removal:
351
352int device_create_file(struct device *dev, const struct device_attribute * attr);
353void device_remove_file(struct device *dev, const struct device_attribute * attr);
354
355
356- bus drivers (include/linux/device.h)
357--------------------------------------
358Structure:
359
360struct bus_attribute {
361        struct attribute        attr;
362        ssize_t (*show)(struct bus_type *, char * buf);
363        ssize_t (*store)(struct bus_type *, const char * buf, size_t count);
364};
365
366Declaring:
367
368static BUS_ATTR_RW(name);
369static BUS_ATTR_RO(name);
370static BUS_ATTR_WO(name);
371
372Creation/Removal:
373
374int bus_create_file(struct bus_type *, struct bus_attribute *);
375void bus_remove_file(struct bus_type *, struct bus_attribute *);
376
377
378- device drivers (include/linux/device.h)
379-----------------------------------------
380
381Structure:
382
383struct driver_attribute {
384        struct attribute        attr;
385        ssize_t (*show)(struct device_driver *, char * buf);
386        ssize_t (*store)(struct device_driver *, const char * buf,
387                         size_t count);
388};
389
390Declaring:
391
392DRIVER_ATTR_RO(_name)
393DRIVER_ATTR_RW(_name)
394
395Creation/Removal:
396
397int driver_create_file(struct device_driver *, const struct driver_attribute *);
398void driver_remove_file(struct device_driver *, const struct driver_attribute *);
399
400
401Documentation
402~~~~~~~~~~~~~
403
404The sysfs directory structure and the attributes in each directory define an
405ABI between the kernel and user space. As for any ABI, it is important that
406this ABI is stable and properly documented. All new sysfs attributes must be
407documented in Documentation/ABI. See also Documentation/ABI/README for more
408information.
409