123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374 |
- Busybox TODO
- Stuff that needs to be done. This is organized by who plans to get around to
- doing it eventually, but that doesn't mean they "own" the item. If you want to
- do one of these bounce an email off the person it's listed under to see if they
- have any suggestions how they plan to go about it, and to minimize conflicts
- between your work and theirs. But otherwise, all of these are fair game.
- Rob Landley suggested this:
- Implement bb_realpath() that can handle NULL on non-glibc.
- Remove obsolete _() wrapper crud for internationalization we don't do.
- Figure out where we need utf8 support, and add it.
- sh
- The command shell situation is a mess. We have two different
- shells that don't really share any code, and the "standalone shell" doesn't
- work all that well (especially not in a chroot environment), due to apps not
- being reentrant.
- Do a SUSv3 audit
- Look at the full Single Unix Specification version 3 (available online at
- "http://www.opengroup.org/onlinepubs/009695399/nfindex.html") and
- figure out which of our apps are compliant, and what we're missing that
- we might actually care about.
- Even better would be some kind of automated compliance test harness that
- exercises each command line option and the various corner cases.
- Internationalization
- How much internationalization should we do?
- The low hanging fruit is UTF-8 character set support. We should do this.
- (Vodz pointed out the shell's cmdedit as needing work here. What else?)
- We also have lots of hardwired english text messages. Consolidating this
- into some kind of message table not only makes translation easier, but
- also allows us to consolidate redundant (or close) strings.
- We probably don't want to be bloated with locale support. (Not unless we
- can cleanly export it from our underlying C library without having to
- concern ourselves with it directly. Perhaps a few specific things like a
- config option for "date" are low hanging fruit here?)
- What level should things happen at? How much do we care about
- internationalizing the text console when X11 and xterms are so much better
- at it? (There's some infrastructure here we don't implement: The
- "unicode_start" and "unicode_stop" shell scripts need "vt-is-UTF8" and a
- --unicode option to loadkeys. That implies a real loadkeys/dumpkeys
- implementation to replace loadkmap/dumpkmap. Plus messing with console font
- loading. Is it worth it, or do we just say "use X"?)
- Individual compilation of applets.
- It would be nice if busybox had the option to compile to individual applets,
- for people who want an alternate implementation less bloated than the gnu
- utils (or simply with less political baggage), but without it being one big
- executable.
- Turning libbb into a real dll is another possibility, especially if libbb
- could export some of the other library interfaces we've already more or less
- got the code for (like zlib).
- buildroot - Make a "dogfood" option
- Busybox 1.1 will be capable of replacing most gnu packages for real world
- use, such as developing software or in a live CD. It needs wider testing.
- Busybox should now be able to replace bzip2, coreutils, e2fsprogs, file,
- findutils, gawk, grep, inetutils, less, modutils, net-tools, patch, procps,
- sed, shadow, sysklogd, sysvinit, tar, util-linux, and vim. The resulting
- system should be self-hosting (I.E. able to rebuild itself from source
- code). This means it would need (at least) binutils, gcc, and make, or
- equivalents.
- It would be a good "eating our own dogfood" test if buildroot had the option
- of using a "make allyesconfig" busybox instead of the all of the above
- packages. Anything that's wrong with the resulting system, we can fix. (It
- would be nice to be able to upgrade busybox to be able to replace bash and
- diffutils as well, but we're not there yet.)
- One example of an existing system that does this already is Firmware Linux:
- http://www.landley.net/code/firmware
- initramfs
- Busybox should have a sample initramfs build script. This depends on
- bbsh, mdev, and switch_root.
- mkdep
- Write a mkdep that doesn't segfault if there's a directory it doesn't
- have permission to read, isn't based on manually editing the output of
- lexx and yacc, doesn't make such a mess under include/config, etc.
- Group globals into unions of structures.
- Go through and turn all the global and static variables into structures,
- and have all those structures be in a big union shared between processes,
- so busybox uses less bss. (This is a big win on nommu machines.) See
- sed.c and mdev.c for examples.
- Go through bugs.busybox.net and close out all of that somehow.
- This one's open to everybody, but I'll wind up doing it...
- Bernhard Reutner-Fischer <busybox@busybox.net> suggests to look at these:
- New debug options:
- -Wlarger-than-127
- Cleanup any big users
- Collate BUFSIZ IOBUF_SIZE MY_BUF_SIZE PIPE_PROGRESS_SIZE BUFSIZE PIPESIZE
- make bb_common_bufsiz1 configurable, size wise.
- make pipesize configurable, size wise.
- Use bb_common_bufsiz1 throughout applets!
- As yet unclaimed:
- ----
- diff
- Make sure we handle empty files properly:
- From the patch man page:
- you can remove a file by sending out a context diff that compares
- the file to be deleted with an empty file dated the Epoch. The
- file will be removed unless patch is conforming to POSIX and the
- -E or --remove-empty-files option is not given.
- ---
- patch
- Should have simple fuzz factor support to apply patches at an offset which
- shouldn't take up too much space.
- And while we're at it, a new patch filename quoting format is apparently
- coming soon: http://marc.theaimsgroup.com/?l=git&m=112927316408690&w=2
- ---
- stty / catv
- stty's visible() function and catv's guts are identical. Merge them into
- an appropriate libbb function.
- ---
- struct suffix_mult
- Several duplicate users of: grep -r "1024\*1024" * -B2 -A1
- Merge to a single size_suffixes[] in libbb.
- Users: head tail od_bloaty hexdump and (partially as it wouldn't hurt) svlogd
- ---
- tail
- ./busybox tail -f foo.c~ TODO
- should not print fmt=header_fmt for subsequent date >> TODO; i.e. only
- fmt+ if another (not the current) file did change
- Architectural issues:
- bb_close() with fsync()
- We should have a bb_close() in place of normal close, with a CONFIG_ option
- to not just check the return value of close() for an error, but fsync().
- Close can't reliably report anything useful because if write() accepted the
- data then it either went out to the network or it's in cache or a pipe
- buffer. Either way, there's no guarantee it'll make it to its final
- destination before close() gets called, so there's no guarantee that any
- error will be reported.
- You need to call fsync() if you care about errors that occur after write(),
- but that can have a big performance impact. So make it a config option.
- ---
- Unify archivers
- Lots of archivers have the same general infrastructure. The directory
- traversal code should be factored out, and the guts of each archiver could
- be some setup code and a series of callbacks for "add this file",
- "add this directory", "add this symlink" and so on.
- This could clean up tar and zip, and make it cheaper to add cpio and ar
- write support, and possibly even cheaply add things like mkisofs or
- mksquashfs someday, if they become relevant.
- ---
- Text buffer support.
- Several existing applets (sort, vi, less...) read
- a whole file into memory and act on it. Use open_read_close().
- ---
- Memory Allocation
- We have a CONFIG_BUFFER mechanism that lets us select whether to do memory
- allocation on the stack or the heap. Unfortunately, we're not using it much.
- We need to audit our memory allocations and turn a lot of malloc/free calls
- into RESERVE_CONFIG_BUFFER/RELEASE_CONFIG_BUFFER.
- For a start, see e.g. make EXTRA_CFLAGS=-Wlarger-than-64
- And while we're at it, many of the CONFIG_FEATURE_CLEAN_UP #ifdefs will be
- optimized out by the compiler in the stack allocation case (since there's no
- free for an alloca()), and this means that various cleanup loops that just
- call free might also be optimized out by the compiler if written right, so
- we can yank those #ifdefs too, and generally clean up the code.
- ---
- Switch CONFIG_SYMBOLS to ENABLE_SYMBOLS
- In busybox 1.0 and earlier, configuration was done by CONFIG_SYMBOLS
- that were either defined or undefined to indicate whether the symbol was
- selected in the .config file. They were used with #ifdefs, ala:
- #ifdef CONFIG_SYMBOL
- if (other_test) {
- do_code();
- }
- #endif
- In 1.1, we have new ENABLE_SYMBOLS which are always defined (as 0 or 1),
- meaning you can still use them for preprocessor tests by replacing
- "#ifdef CONFIG_SYMBOL" with "#if ENABLE_SYMBOL". But more importantly, we
- can use them as a true or false test in normal C code:
- if (ENABLE_SYMBOL && other_test) {
- do_code();
- }
- (Optimizing away if() statements that resolve to a constant value
- is known as "dead code elimination", an optimization so old and simple that
- Turbo Pascal for DOS did it twenty years ago. Even modern mini-compilers
- like the Tiny C Compiler (tcc) and the Small Device C Compiler (SDCC)
- perform dead code elimination.)
- Right now, busybox.h is #including both "config.h" (defining the
- CONFIG_SYMBOLS) and "bb_config.h" (defining the ENABLE_SYMBOLS). At some
- point in the future, it would be nice to wean ourselves off of the
- CONFIG versions. (Among other things, some defective build environments
- leak the Linux kernel's CONFIG_SYMBOLS into the system's standard #include
- files. We've experienced collisions before.)
- ---
- FEATURE_CLEAN_UP
- This is more an unresolved issue than a to-do item. More thought is needed.
- Normally we rely on exit() to free memory, close files and unmap segments
- for us. This makes most calls to free(), close(), and unmap() optional in
- busybox applets that don't intend to run for very long, and optional stuff
- can be omitted to save size.
- The idea was raised that we could simulate fork/exit with setjmp/longjmp
- for _really_ brainless embedded systems, or speed up the standalone shell
- by not forking. Doing so would require a reliable FEATURE_CLEAN_UP.
- Unfortunately, this isn't as easy as it sounds.
- The problem is, lots of things exit(), sometimes unexpectedly (xmalloc())
- and sometimes reliably (bb_perror_msg_and_die() or show_usage()). This
- jumps out of the normal flow control and bypasses any cleanup code we
- put at the end of our applets.
- It's possible to add hooks to libbb functions like xmalloc() and xopen()
- to add their entries to a linked list, which could be traversed and
- freed/closed automatically. (This would need to be able to free just the
- entries after a checkpoint to be usable for a forkless standalone shell.
- You don't want to free the shell's own resources.)
- Right now, FEATURE_CLEAN_UP is more or less a debugging aid, to make things
- like valgrind happy. It's also documentation of _what_ we're trusting
- exit() to clean up for us. But new infrastructure to auto-free stuff would
- render the existing FEATURE_CLEAN_UP code redundant.
- For right now, exit() handles it just fine.
- Minor stuff:
- watchdog.c could autodetect the timer duration via:
- if(!ioctl (fd, WDIOC_GETTIMEOUT, &tmo)) timer_duration = 1 + (tmo / 2);
- Unfortunately, that needs linux/watchdog.h and that contains unfiltered
- kernel types on some distros, which breaks the build.
- ---
- use bb_error_msg where appropriate: See
- egrep "(printf.*\([[:space:]]*(stderr|2)|[^_]write.*\([[:space:]]*(stderr|2))"
- ---
- use bb_perror_msg where appropriate: See
- egrep "[^_]perror"
- ---
- possible code duplication ingroup() and is_a_group_member()
- ---
- Move __get_hz() to a better place and (re)use it in route.c, ash.c
- ---
- See grep -r strtod
- Alot of duplication that wants cleanup.
- ---
- in_ether duplicated in network/{interface,ifconfig}.c
- ---
- unify progress_meter. wget, flash_eraseall, pipe_progress, fbsplash, setfiles.
- ---
- support start-stop-daemon -d <chdir-path>
- Code cleanup:
- Replace deprecated functions.
- ---
- vdprintf() -> similar sized functionality
- ---
- (TODO list after discussion 11.05.2009)
- * shrink tc/brctl/ip
- tc/brctl seem like fairly large things to try and tackle in your timeframe,
- and i think people have posted attempts in the past. Adding additional
- options to ip though seems reasonable.
- * add tests for some applets
- * implement POSIX utilities and audit them for POSIX conformance. then
- audit them for GNU conformance. then document all your findings in a new
- doc/conformance.txt file while perhaps implementing some of the missing
- features.
- you can find the latest POSIX documentation (1003.1-2008) here:
- http://www.opengroup.org/onlinepubs/9699919799/
- and the complete list of all utilities that POSIX covers:
- http://www.opengroup.org/onlinepubs/9699919799/idx/utilities.html
- The first step would to generate a file/matrix what is already archived
- (also IPV6)
- * ntpdate/ntpd (see ntpclient and openntp for examples)
- * implement 'at'
- * rpcbind (former portmap) or equivalent
- so that we don't have to use -o nolock on nfs mounts
- * check IPV6 compliance
- * generate a mini example using kernel+busybox only (+libc) for example
- * more support for advanced linux 2.6.x features, see: iotop
- most likely there is more
- * even more support for statistics: mpstat, iostat, powertop....
- Unicode work needed:
- Unicode support uses libc multibyte functions if LOCALE_SUPPORT is on
- (in this case, the code will also support many more encodings),
- or uses a limited subset of re-implemented multibyte functions
- which only understand "one byte == one char" and unicode.
- This is useful if you build against uclibc with locale support disabled.
- Unicode-dependent applets must call check_unicode_in_env() when they
- begin executing.
- Applet code may conditionalize on UNICODE_SUPPORT in order to use
- more efficient code if unicode support is not requested.
- Available functions (if you need more, implement them in libbb/unicode.c
- so that they work without LOCALE_SUPPORT too):
- int bb_mbstrlen(str) - multibyte-aware strlen
- size_t mbstowcs(wdest, src, n)
- size_t wcstombs(dest, wsrc, n)
- size_t wcrtomb(str, wc, wstate)
- int iswspace(wc)
- int iswalnum(wc)
- int iswpunct(wc)
- Applets which only need to align columns on screen correctly:
- ls - already done, use source as an example
- df
- dumpleases
- lsmod
- Applets which need to account for Unicode chars
- while processing the output:
- [un]expand
- fold
- man
- watch
- cut (-b and -c are currently the same, needs fixing)
- These applets need to ensure that unicode input
- is handled correctly (say, <unicode><backspace> sequence):
- getty, login
- rm -i
- unzip (overwrite prompt)
- Viewers/editors are more difficult (many cases to get right).
- libbb/lineedit.c is an example how to do it:
- less, most, ed, vi
- awk
- [ef]grep
- sed
- Probably needs some specialized work:
- loadkeys
|