123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286 |
- __________________________________________________________________________
- This is the Info-ZIP file ZipPorts, last updated on 17 February 1996.
- __________________________________________________________________________
- This document defines a set of rules and guidelines for those who wish to
- contribute patches to Zip and UnZip (or even entire ports to new operating
- systems). The list below is something between a style sheet and a "Miss
- Manners" etiquette guide. While Info-ZIP encourages contributions and
- fixes from anyone who finds something worth changing, we are also aware
- of the fact that no two programmers have the programming style and that
- unrestrained changes by a few dozen contributors would result in hideously
- ugly (and unmaintainable) Frankenstein code. So consider the following an
- attempt by the maintainers to maintain sanity as well as useful code.
- (The first version of this document was called either "ZipRules" or the
- "No Feelthy ..." file and was compiled by David Kirschbaum in consulta-
- tion with Mark Adler, Cave McNewt and others. The current incarnation
- expands upon the original with insights gained from a few more years of
- happy hacking...)
- Summary:
- (0) The Platinum Rule: DON'T BREAK EXISTING PORTS
- (0.1) The Golden Rule: DO UNTO THE CODE AS OTHERS HAVE DONE BEFORE
- (0.2) The Silver Rule: DO UNTO THE LATEST BETA CODE
- (0.3) The Bronze Rule: NO FEELTHY PIGGYBACKS
- (1) NO FEELTHY TABS
- (2) NO FEELTHY CARRIAGE RETURNS
- (3) NO FEELTHY 8-BIT CHARS
- (4) NO FEELTHY LEFT-JUSTIFIED DASHES
- (5) NO FEELTHY FANCY_FILENAMES
- (6) NO FEELTHY NON-ZIPFILES AND NO FEELTHY E-MAIL BETAS
- (7) NO FEELTHY E-MAIL BINARIES
- Explanations:
- (0) The Platinum Rule: DON'T BREAK EXISTING PORTS
- No doubt about it, this is the one which really pisses us off and
- pretty much guarantees that your port or patch will be ignored and/
- or laughed at. Examples range from the *really* severe cases which
- "port" by ripping out all of the existing multi-OS code, to more
- subtle oopers like relying on a local capability which doesn't exist
- on other OSes or in older compilers (e.g., the use of ANSI "#elif"
- or "#pragma" or "##" constructs, C++ comments, GNU extensions, etc.).
- As to the former, use #ifdefs for your new code (see rule 0.3). And
- as to the latter, trust us--there are few things we'd like better
- than to be able to use some of the elegant "new" features out there
- (many of which have been around for a decade or more). But our code
- still compiles on machines dating back even longer, at least in spirit
- --e.g., the AT&T 3B1 family and Dynix/ptx. Until we say otherwise,
- dinosaurs are supported.
- (0.1) The Golden Rule: DO UNTO THE CODE AS OTHERS HAVE DONE BEFORE
- In other words, try to fit into the local style of programming--no
- matter how painful it may be. This includes cosmetic aspects like
- indenting the same amount (both in the main C code and in the in-
- clude files), using braces and comments similarly, NO TABS (see rule
- #1), etc.; but also more substantive things like (for UnZip) putting
- character strings into static (far) variables and using the LoadFar-
- String macros to avoid overflowing limited MS-DOS data segments, and
- using the ugly Info() macro instead of the more usual *printf()
- functions so that dynamic-link-library ports are simpler. NEVER put
- single-OS code (e.g., OS/2) of more than two or three lines into the
- main (generic) modules; those are shared by everybody, and nobody else
- cares about it or wants to see it.
- Note that not only do Zip and UnZip differ in these respects, so do
- individual parts of each program. While it would be nice to have
- global consistency, cosmetic changes are not a high priority; for
- now we'll settle for local consistency--i.e., don't make things any
- worse than they already are.
- Exception (BIG exception): single-letter variable names. Despite
- the prevailing practice in much of Zip and parts of UnZip, and de-
- spite the fact that one-letter variables allow you to pack really
- cool, compact and complicated expressions onto one line, they also
- make the code very difficult to maintain and are therefore *strongly*
- discouraged. Don't ask us who is responsible in the first place;
- while this sort of brain damage is not uncommon among former BASIC
- programmers, it is nevertheless a lifelong embarrassment, and we do
- try to pity the poor sod (that is, when we're not chasing bugs and
- cursing him). :-)
- (0.2) The Silver Rule: DO UNTO THE LATEST BETA CODE
- Few things are as annoying as receiving a large patch which obviously
- represents a lot of time and careful work but which is relative to
- an old version of Info-ZIP code. As wonderful as Larry Wall's patch
- program is at applying context diffs to modified code, we regularly
- make near-global changes and/or reorganize big chunks of the sources
- (particularly in UnZip), and "patch" can't work miracles--big changes
- invariably break any patch which is relative to an old version of the
- code.
- Bottom line: contact the Info-ZIP core team FIRST (via the zip-bugs
- e-mail address) and get up to date with the latest code before begin-
- ning a big new port. And try to *stay* up to date while working on
- your port--at least, as much as possible.
- (0.3) The Bronze Rule: NO FEELTHY PIGGYBACKS
- UnZip is currently ported to something like 12 operating systems
- (a few more or less depending on how one counts), and each of these,
- with the possible exception of VM/CMS, has a unique macro identifying
- it: AMIGA, ATARI_ST, __human68k__, MACOS, MSDOS, MVS, OS2, TOPS20,
- UNIX, VMS, WIN32. Zip is moving in the same direction. New ports
- should NOT piggyback one of the existing ports unless they are sub-
- stantially similar--for example, Minix and Coherent are basically Unix
- and therefore are included in the UNIX macro, but DOS djgpp ports and
- OS/2 emx ports (both of which use the Unix-originated GNU C compiler
- and often have "unix" defined by default) are obviously *not* Unix.
- [The existing MTS port is a special exception; basically only one per-
- son knows what MTS really is, and he's not telling. Presumably it's
- not very close to Unix, but it's not worth arguing about it now.]
- Along the same lines, neither OS/2 nor Human68K is the same as (or
- even close to) MS-DOS. MVS and VM/CMS, on the other hand, are quite
- similar to each other and are therefore combined in most places.
- Bottom line: when adding a new port (e.g., QDOS), create a new macro
- for it ("QDOS"), a new subdirectory ("qdos") and a new source file for
- OS-specific code ("qdos/qdos.c"). Use #ifdefs to fit any OS-specific
- changes into the existing code (e.g., unzpriv.h). If it's close enough
- to an existing port that piggybacking is a temptation, define a new
- "combination macro" (e.g., "CMS_MVS") and replace the old macros as
- required. (This last applies to UnZip, at least; the old preference
- in Zip was fewer macros and long #ifdef lines, so talk to Onno or Jean-
- loup about that.) See also rule 0.1.
- (Note that, for UnZip, new ports need not attempt to deal with all
- features. Among other things, the wildcard-zipfile code in do_wild()
- may be replaced with a supplied dummy version, since opendir/readdir/
- closedir() or the equivalent can be difficult to implement.)
- (1) NO FEELTHY TABS
- Some editors and e-mail systems either have no capability to use
- and/or display tab characters (ASCII 9) correctly, or they use non-
- standard or variable-width tab columns, or other horrors. Some edi-
- tors auto-convert spaces to tabs, after which the blind use of "diff
- -c" results in a huge and mostly useless patch. Yes, *we* know about
- diff's "-b" option, but not everyone does. And yes, we also know this
- makes the source files bigger, even after compression; so be it. If
- we *really* cared that much about the size of the sources, we'd still
- be writing Unix-only utilities.
- Bottom line: use spaces, not tabs.
- Exception: some of the makefiles (the Unix one in particular) require
- tabs as part of the syntax.
- Related utility programs:
- Unix, OS/2 and MS-DOS: expand, unexpand.
- MS-DOS: Buerg's TABS; Toad Hall's TOADSOFT.
- And some editors have the conversion built-in.
- (2) NO FEELTHY CARRIAGE RETURNS
- All source, documentation and other text files shall have Unix style
- line endings (LF only, a.k.a. ctrl-J), not the DOS/OS2/NT CR+LF or Mac
- CR-only line endings.
- Reason: "real programmers" in any environment can convert back and
- forth between Unix and DOS/Mac style. All PC compilers but a few old
- Borland versions can use either Unix or MS-DOS end-of-lines. Buerg's
- LIST (file-display utility) for MS-DOS can use Unix or MS-DOS EOLs.
- Both Zip and UnZip can convert line-endings as appropriate. But Unix
- utilities like diff and patch die a horrible death (or produce horrible
- output) if the target files have CRs.
- Related utilities: flip for Unix, OS/2 and MS-DOS; Unix "tr".
- Exceptions: documentation in pre-compiled binary distributions should
- be in the local (target) format.
- (3) NO FEELTHY 8-BIT CHARS
- Do all your editing in a plain-text ASCII editor. No WordPerfect, MS
- Word, WordStar document mode, or other word processor files, thenkyew.
- No desktop publishing. *Especially* no EBCDIC. No TIFFs, no GIFs, no
- embedded pictures or dancing ladies (too bad, Cave Newt). [Sigh... -CN]
- Reason: compatibility with different consoles. My old XT clone is
- the most limited!
- Exceptions: some Macintosh makefiles apparently require some 8-bit
- characters; the Human68k port uses 8-bit characters for Kanji or Kana
- comments (I think); etc.
- Related utilities: vi, emacs, EDLIN, Turbo C editor, other programmers'
- editors, various word processor -> text conversion utilities.
- (4) NO FEELTHY LEFT-JUSTIFIED DASHES
- Always precede repeated dashes (------) with one or more leading non-
- dash characters: spaces, tabs, pound signs (#), comments (/*), what-
- ever.
- Reason: sooner or later your source file will be e-mailed through an
- undigestifier utility, most of which treat leading dashes as end-of-
- message separators. We'd rather not have your code broken up into a
- dozen separate untitled messages, thank you.
- (5) NO FEELTHY FANCY_FILENAMES
- Assume the worst: that someone on a brain-damaged DOS system has to
- work with everything your magic fingers produced. Keep the filenames
- unimaginative and within MS-DOS limits (i.e., ordinary A..Z, 1..9,
- "-$_!"-type characters, in the 8.3 "filename.ext" format). Mac and
- Unix users, giggle all you want, but no spaces or multiple dots.
- Reason: compatibility with different file systems. MS-DOS FAT is the
- most limited, with the exception of CompuServe (6.3, argh).
- Exceptions: slightly longer names are occasionally acceptable within
- OS-specific subdirectories, but don't do that unless there's a good
- reason for it.
- (6) NO FEELTHY NON-ZIPFILES AND NO FEELTHY E-MAIL BETAS
- Beta testers and developers are in general expected to have both
- ftp capability and the ability to deal with zipfiles. Those without
- should either find a friend who does or else learn about ftp-mailers.
- Reason: the core development team barely has time to work on the
- code, much less prepare oddball formats and/or mail betas out (and
- the situation is getting worse, sigh).
- Exceptions: anyone seriously proposing to do a new port will be
- given special treatment, particularly with respect to UnZip; we
- obviously realize that bootstrapping a completely new port can be
- quite difficult and have no desire to make it even harder due to
- lack of access to the latest code (rule 0.2).
- Public releases of UnZip, on the other hand, will be available in
- two formats: .tar.Z (16-bit compress'd tar) and .zip (either "plain"
- or self-extracting). Zip sources and executables will generally only
- be distributed in .zip format, since Zip is pretty much useless without
- UnZip.
- (7) NO FEELTHY E-MAIL BINARIES
- Binary files (e.g., executables, test zipfiles, etc.) should NEVER
- be mailed raw. Where possible, they should be uploaded via ftp in
- BINARY mode; if that's impossible, Mark's "ship" ASCII-encoder should
- be used; and if that's unavailable, uuencode or xxencode should be
- used. Weirdo NeXTmail, mailtool and MIME formats are also Right Out.
- Files larger than 50KB may need to be broken into pieces for mailing
- (be sure to label them in order!), unless "ship" is used (it can
- auto-split, label and mail files if told to do so). If Down Under
- is involved, files must be broken into under-20KB chunks.
- Reasons: to prevent sounds of gagging mailers from resounding through-
- out the land. To be relatively efficient in the binary->ASCII conver-
- sion. (Yeah, yeah, I know, there's better conversions out there. But
- not as widely known, and they often break on BITNET gateways.)
- Related utilities: ship, uuencode, uudecode, uuxfer20, quux, others.
- Just make sure they don't leave embedded or trailing spaces (that is,
- they should use the "`" character in place of ASCII 32). Otherwise
- mailers are prone to truncate or whatever.
- Greg Roelofs (a.k.a. Cave Newt)
- Info-ZIP UnZip maintainer
- David Kirschbaum
- former Info-ZIP Coordinator
|