coherent/g/usr/bin/gzip/inflate.c - annotate

Return to inflate.c CVS log
Up to [MW Coherent from dump] / coherent / g / usr / bin / gzip
Annotation of coherent/g/usr/bin/gzip/inflate.c, revision 1.1

1.1     ! root        1: /* inflate.c -- Not copyrighted 1992 by Mark Adler
        !             2:    version c10p1, 10 January 1993 */
        !             3: 
        !             4: /* You can do whatever you like with this source file, though I would
        !             5:    prefer that if you modify it and redistribute it that you include
        !             6:    comments to that effect with your name and the date.  Thank you.
        !             7:    [The history has been moved to the file ChangeLog.]
        !             8:  */
        !             9: 
        !            10: /*
        !            11:    Inflate deflated (PKZIP's method 8 compressed) data.  The compression
        !            12:    method searches for as much of the current string of bytes (up to a
        !            13:    length of 258) in the previous 32K bytes.  If it doesn't find any
        !            14:    matches (of at least length 3), it codes the next byte.  Otherwise, it
        !            15:    codes the length of the matched string and its distance backwards from
        !            16:    the current position.  There is a single Huffman code that codes both
        !            17:    single bytes (called "literals") and match lengths.  A second Huffman
        !            18:    code codes the distance information, which follows a length code.  Each
        !            19:    length or distance code actually represents a base value and a number
        !            20:    of "extra" (sometimes zero) bits to get to add to the base value.  At
        !            21:    the end of each deflated block is a special end-of-block (EOB) literal/
        !            22:    length code.  The decoding process is basically: get a literal/length
        !            23:    code; if EOB then done; if a literal, emit the decoded byte; if a
        !            24:    length then get the distance and emit the referred-to bytes from the
        !            25:    sliding window of previously emitted data.
        !            26: 
        !            27:    There are (currently) three kinds of inflate blocks: stored, fixed, and
        !            28:    dynamic.  The compressor deals with some chunk of data at a time, and
        !            29:    decides which method to use on a chunk-by-chunk basis.  A chunk might
        !            30:    typically be 32K or 64K.  If the chunk is uncompressible, then the
        !            31:    "stored" method is used.  In this case, the bytes are simply stored as
        !            32:    is, eight bits per byte, with none of the above coding.  The bytes are
        !            33:    preceded by a count, since there is no longer an EOB code.
        !            34: 
        !            35:    If the data is compressible, then either the fixed or dynamic methods
        !            36:    are used.  In the dynamic method, the compressed data is preceded by
        !            37:    an encoding of the literal/length and distance Huffman codes that are
        !            38:    to be used to decode this block.  The representation is itself Huffman
        !            39:    coded, and so is preceded by a description of that code.  These code
        !            40:    descriptions take up a little space, and so for small blocks, there is
        !            41:    a predefined set of codes, called the fixed codes.  The fixed method is
        !            42:    used if the block codes up smaller that way (usually for quite small
        !            43:    chunks), otherwise the dynamic method is used.  In the latter case, the
        !            44:    codes are customized to the probabilities in the current block, and so
        !            45:    can code it much better than the pre-determined fixed codes.
        !            46:  
        !            47:    The Huffman codes themselves are decoded using a mutli-level table
        !            48:    lookup, in order to maximize the speed of decoding plus the speed of
        !            49:    building the decoding tables.  See the comments below that precede the
        !            50:    lbits and dbits tuning parameters.
        !            51:  */
        !            52: 
        !            53: 
        !            54: /*
        !            55:    Notes beyond the 1.93a appnote.txt:
        !            56: 
        !            57:    1. Distance pointers never point before the beginning of the output
        !            58:       stream.
        !            59:    2. Distance pointers can point back across blocks, up to 32k away.
        !            60:    3. There is an implied maximum of 7 bits for the bit length table and
        !            61:       15 bits for the actual data.
        !            62:    4. If only one code exists, then it is encoded using one bit.  (Zero
        !            63:       would be more efficient, but perhaps a little confusing.)  If two
        !            64:       codes exist, they are coded using one bit each (0 and 1).
        !            65:    5. There is no way of sending zero distance codes--a dummy must be
        !            66:       sent if there are none.  (History: a pre 2.0 version of PKZIP would
        !            67:       store blocks with no distance codes, but this was discovered to be
        !            68:       too harsh a criterion.)  Valid only for 1.93a.  2.04c does allow
        !            69:       zero distance codes, which is sent as one code of zero bits in
        !            70:       length.
        !            71:    6. There are up to 286 literal/length codes.  Code 256 represents the
        !            72:       end-of-block.  Note however that the static length tree defines
        !            73:       288 codes just to fill out the Huffman codes.  Codes 286 and 287
        !            74:       cannot be used though, since there is no length base or extra bits
        !            75:       defined for them.  Similarily, there are up to 30 distance codes.
        !            76:       However, static trees define 32 codes (all 5 bits) to fill out the
        !            77:       Huffman codes, but the last two had better not show up in the data.
        !            78:    7. Unzip can check dynamic Huffman blocks for complete code sets.
        !            79:       The exception is that a single code would not be complete (see #4).
        !            80:    8. The five bits following the block type is really the number of
        !            81:       literal codes sent minus 257.
        !            82:    9. Length codes 8,16,16 are interpreted as 13 length codes of 8 bits
        !            83:       (1+6+6).  Therefore, to output three times the length, you output
        !            84:       three codes (1+1+1), whereas to output four times the same length,
        !            85:       you only need two codes (1+3).  Hmm.
        !            86:   10. In the tree reconstruction algorithm, Code = Code + Increment
        !            87:       only if BitLength(i) is not zero.  (Pretty obvious.)
        !            88:   11. Correction: 4 Bits: # of Bit Length codes - 4     (4 - 19)
        !            89:   12. Note: length code 284 can represent 227-258, but length code 285
        !            90:       really is 258.  The last length deserves its own, short code
        !            91:       since it gets used a lot in very redundant files.  The length
        !            92:       258 is special since 258 - 3 (the min match length) is 255.
        !            93:   13. The literal/length and distance code bit lengths are read as a
        !            94:       single stream of lengths.  It is possible (and advantageous) for
        !            95:       a repeat code (16, 17, or 18) to go across the boundary between
        !            96:       the two sets of lengths.
        !            97:  */
        !            98: 
        !            99: #ifndef lint
        !           100: static char rcsid[] = "$Id: inflate.c,v 0.10 1993/02/04 13:21:06 jloup Exp $";
        !           101: #endif
        !           102: 
        !           103: #include "tailor.h"
        !           104: #include "gzip.h"
        !           105: #define slide window
        !           106: 
        !           107: #include <stdio.h>
        !           108: 
        !           109: #if defined(STDC_HEADERS) || !defined(NO_STDLIB_H)
        !           110: #  include <sys/types.h>
        !           111: #  include <stdlib.h>
        !           112: #endif
        !           113: 
        !           114: /* Huffman code lookup table entry--this entry is four bytes for machines
        !           115:    that have 16-bit pointers (e.g. PC's in the small or medium model).
        !           116:    Valid extra bits are 0..13.  e == 15 is EOB (end of block), e == 16
        !           117:    means that v is a literal, 16 < e < 32 means that v is a pointer to
        !           118:    the next table, which codes e - 16 bits, and lastly e == 99 indicates
        !           119:    an unused code.  If a code with e == 99 is looked up, this implies an
        !           120:    error in the data. */
        !           121: struct huft {
        !           122:   uch e;                /* number of extra bits or operation */
        !           123:   uch b;                /* number of bits in this code or subcode */
        !           124:   union {
        !           125:     ush n;              /* literal, length base, or distance base */
        !           126:     struct huft *t;     /* pointer to next level of table */
        !           127:   } v;
        !           128: };
        !           129: 
        !           130: 
        !           131: /* Function prototypes */
        !           132: int huft_build OF((unsigned *, unsigned, unsigned, ush *, ush *,
        !           133:                    struct huft **, int *));
        !           134: int huft_free OF((struct huft *));
        !           135: int inflate_codes OF((struct huft *, struct huft *, int, int));
        !           136: int inflate_stored OF((void));
        !           137: int inflate_fixed OF((void));
        !           138: int inflate_dynamic OF((void));
        !           139: int inflate_block OF((int *));
        !           140: int inflate OF((void));
        !           141: 
        !           142: 
        !           143: /* The inflate algorithm uses a sliding 32K byte window on the uncompressed
        !           144:    stream to find repeated byte strings.  This is implemented here as a
        !           145:    circular buffer.  The index is updated simply by incrementing and then
        !           146:    and'ing with 0x7fff (32K-1). */
        !           147: /* It is left to other modules to supply the 32K area.  It is assumed
        !           148:    to be usable as if it were declared "uch slide[32768];" or as just
        !           149:    "uch *slide;" and then malloc'ed in the latter case.  The definition
        !           150:    must be in unzip.h, included above. */
        !           151: /* unsigned wp;             current position in slide */
        !           152: #define wp outcnt
        !           153: #define flush_output(w) (wp=(w),flush_window())
        !           154: 
        !           155: /* Tables for deflate from PKZIP's appnote.txt. */
        !           156: static unsigned border[] = {    /* Order of the bit length code lengths */
        !           157:         16, 17, 18, 0, 8, 7, 9, 6, 10, 5, 11, 4, 12, 3, 13, 2, 14, 1, 15};
        !           158: static ush cplens[] = {         /* Copy lengths for literal codes 257..285 */
        !           159:         3, 4, 5, 6, 7, 8, 9, 10, 11, 13, 15, 17, 19, 23, 27, 31,
        !           160:         35, 43, 51, 59, 67, 83, 99, 115, 131, 163, 195, 227, 258, 0, 0};
        !           161:         /* note: see note #13 above about the 258 in this list. */
        !           162: static ush cplext[] = {         /* Extra bits for literal codes 257..285 */
        !           163:         0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 2, 2, 2, 2,
        !           164:         3, 3, 3, 3, 4, 4, 4, 4, 5, 5, 5, 5, 0, 99, 99}; /* 99==invalid */
        !           165: static ush cpdist[] = {         /* Copy offsets for distance codes 0..29 */
        !           166:         1, 2, 3, 4, 5, 7, 9, 13, 17, 25, 33, 49, 65, 97, 129, 193,
        !           167:         257, 385, 513, 769, 1025, 1537, 2049, 3073, 4097, 6145,
        !           168:         8193, 12289, 16385, 24577};
        !           169: static ush cpdext[] = {         /* Extra bits for distance codes */
        !           170:         0, 0, 0, 0, 1, 1, 2, 2, 3, 3, 4, 4, 5, 5, 6, 6,
        !           171:         7, 7, 8, 8, 9, 9, 10, 10, 11, 11,
        !           172:         12, 12, 13, 13};
        !           173: 
        !           174: 
        !           175: 
        !           176: /* Macros for inflate() bit peeking and grabbing.
        !           177:    The usage is:
        !           178:    
        !           179:         NEEDBITS(j)
        !           180:         x = b & mask_bits[j];
        !           181:         DUMPBITS(j)
        !           182: 
        !           183:    where NEEDBITS makes sure that b has at least j bits in it, and
        !           184:    DUMPBITS removes the bits from b.  The macros use the variable k
        !           185:    for the number of bits in b.  Normally, b and k are register
        !           186:    variables for speed, and are initialized at the begining of a
        !           187:    routine that uses these macros from a global bit buffer and count.
        !           188: 
        !           189:    If we assume that EOB will be the longest code, then we will never
        !           190:    ask for bits with NEEDBITS that are beyond the end of the stream.
        !           191:    So, NEEDBITS should not read any more bytes than are needed to
        !           192:    meet the request.  Then no bytes need to be "returned" to the buffer
        !           193:    at the end of the last block.
        !           194: 
        !           195:    However, this assumption is not true for fixed blocks--the EOB code
        !           196:    is 7 bits, but the other literal/length codes can be 8 or 9 bits.
        !           197:    (The EOB code is shorter than other codes becuase fixed blocks are
        !           198:    generally short.  So, while a block always has an EOB, many other
        !           199:    literal/length codes have a significantly lower probability of
        !           200:    showing up at all.)  However, by making the first table have a
        !           201:    lookup of seven bits, the EOB code will be found in that first
        !           202:    lookup, and so will not require that too many bits be pulled from
        !           203:    the stream.
        !           204:  */
        !           205: 
        !           206: ulg bb;                         /* bit buffer */
        !           207: unsigned bk;                    /* bits in bit buffer */
        !           208: 
        !           209: ush mask_bits[] = {
        !           210:     0x0000,
        !           211:     0x0001, 0x0003, 0x0007, 0x000f, 0x001f, 0x003f, 0x007f, 0x00ff,
        !           212:     0x01ff, 0x03ff, 0x07ff, 0x0fff, 0x1fff, 0x3fff, 0x7fff, 0xffff
        !           213: };
        !           214: 
        !           215: #ifdef CRYPT
        !           216:   uch cc;
        !           217: #  define NEXTBYTE() \
        !           218:      (decrypt ? (cc = get_byte(), zdecode(cc), cc) : get_byte())
        !           219: #else
        !           220: #  define NEXTBYTE()  (uch)get_byte()
        !           221: #endif
        !           222: #define NEEDBITS(n) {while(k<(n)){b|=((ulg)NEXTBYTE())<<k;k+=8;}}
        !           223: #define DUMPBITS(n) {b>>=(n);k-=(n);}
        !           224: 
        !           225: 
        !           226: /*
        !           227:    Huffman code decoding is performed using a multi-level table lookup.
        !           228:    The fastest way to decode is to simply build a lookup table whose
        !           229:    size is determined by the longest code.  However, the time it takes
        !           230:    to build this table can also be a factor if the data being decoded
        !           231:    is not very long.  The most common codes are necessarily the
        !           232:    shortest codes, so those codes dominate the decoding time, and hence
        !           233:    the speed.  The idea is you can have a shorter table that decodes the
        !           234:    shorter, more probable codes, and then point to subsidiary tables for
        !           235:    the longer codes.  The time it costs to decode the longer codes is
        !           236:    then traded against the time it takes to make longer tables.
        !           237: 
        !           238:    This results of this trade are in the variables lbits and dbits
        !           239:    below.  lbits is the number of bits the first level table for literal/
        !           240:    length codes can decode in one step, and dbits is the same thing for
        !           241:    the distance codes.  Subsequent tables are also less than or equal to
        !           242:    those sizes.  These values may be adjusted either when all of the
        !           243:    codes are shorter than that, in which case the longest code length in
        !           244:    bits is used, or when the shortest code is *longer* than the requested
        !           245:    table size, in which case the length of the shortest code in bits is
        !           246:    used.
        !           247: 
        !           248:    There are two different values for the two tables, since they code a
        !           249:    different number of possibilities each.  The literal/length table
        !           250:    codes 286 possible values, or in a flat code, a little over eight
        !           251:    bits.  The distance table codes 30 possible values, or a little less
        !           252:    than five bits, flat.  The optimum values for speed end up being
        !           253:    about one bit more than those, so lbits is 8+1 and dbits is 5+1.
        !           254:    The optimum values may differ though from machine to machine, and
        !           255:    possibly even between compilers.  Your mileage may vary.
        !           256:  */
        !           257: 
        !           258: 
        !           259: int lbits = 9;          /* bits in base literal/length lookup table */
        !           260: int dbits = 6;          /* bits in base distance lookup table */
        !           261: 
        !           262: 
        !           263: /* If BMAX needs to be larger than 16, then h and x[] should be ulg. */
        !           264: #define BMAX 16         /* maximum bit length of any code (16 for explode) */
        !           265: #define N_MAX 288       /* maximum number of codes in any set */
        !           266: 
        !           267: 
        !           268: unsigned hufts;         /* track memory usage */
        !           269: 
        !           270: 
        !           271: int huft_build(b, n, s, d, e, t, m)
        !           272: unsigned *b;            /* code lengths in bits (all assumed <= BMAX) */
        !           273: unsigned n;             /* number of codes (assumed <= N_MAX) */
        !           274: unsigned s;             /* number of simple-valued codes (0..s-1) */
        !           275: ush *d;                 /* list of base values for non-simple codes */
        !           276: ush *e;                 /* list of extra bits for non-simple codes */
        !           277: struct huft **t;        /* result: starting table */
        !           278: int *m;                 /* maximum lookup bits, returns actual */
        !           279: /* Given a list of code lengths and a maximum table size, make a set of
        !           280:    tables to decode that set of codes.  Return zero on success, one if
        !           281:    the given code set is incomplete (the tables are still built in this
        !           282:    case), two if the input is invalid (all zero length codes or an
        !           283:    oversubscribed set of lengths), and three if not enough memory. */
        !           284: {
        !           285:   unsigned a;                   /* counter for codes of length k */
        !           286:   unsigned c[BMAX+1];           /* bit length count table */
        !           287:   unsigned f;                   /* i repeats in table every f entries */
        !           288:   int g;                        /* maximum code length */
        !           289:   int h;                        /* table level */
        !           290:   register unsigned i;          /* counter, current code */
        !           291:   register unsigned j;          /* counter */
        !           292:   register int k;               /* number of bits in current code */
        !           293:   int l;                        /* bits per table (returned in m) */
        !           294:   register unsigned *p;         /* pointer into c[], b[], or v[] */
        !           295:   register struct huft *q;      /* points to current table */
        !           296:   struct huft r;                /* table entry for structure assignment */
        !           297:   struct huft *u[BMAX];         /* table stack */
        !           298:   unsigned v[N_MAX];            /* values in order of bit length */
        !           299:   register int w;               /* bits before this table == (l * h) */
        !           300:   unsigned x[BMAX+1];           /* bit offsets, then code stack */
        !           301:   unsigned *xp;                 /* pointer into x */
        !           302:   int y;                        /* number of dummy codes added */
        !           303:   unsigned z;                   /* number of entries in current table */
        !           304: 
        !           305: 
        !           306:   /* Generate counts for each bit length */
        !           307:   memzero(c, sizeof(c));
        !           308:   p = b;  i = n;
        !           309:   do {
        !           310:     Tracecv(*p, (stderr, (n-i >= ' ' && n-i <= '~' ? "%c %d\n" : "0x%x %d\n"), 
        !           311:            n-i, *p));
        !           312:     c[*p++]++;                  /* assume all entries <= BMAX */
        !           313:   } while (--i);
        !           314:   if (c[0] == n)                /* null input--all zero length codes */
        !           315:   {
        !           316:     *t = (struct huft *)NULL;
        !           317:     *m = 0;
        !           318:     return 0;
        !           319:   }
        !           320: 
        !           321: 
        !           322:   /* Find minimum and maximum length, bound *m by those */
        !           323:   l = *m;
        !           324:   for (j = 1; j <= BMAX; j++)
        !           325:     if (c[j])
        !           326:       break;
        !           327:   k = j;                        /* minimum code length */
        !           328:   if ((unsigned)l < j)
        !           329:     l = j;
        !           330:   for (i = BMAX; i; i--)
        !           331:     if (c[i])
        !           332:       break;
        !           333:   g = i;                        /* maximum code length */
        !           334:   if ((unsigned)l > i)
        !           335:     l = i;
        !           336:   *m = l;
        !           337: 
        !           338: 
        !           339:   /* Adjust last length count to fill out codes, if needed */
        !           340:   for (y = 1 << j; j < i; j++, y <<= 1)
        !           341:     if ((y -= c[j]) < 0)
        !           342:       return 2;                 /* bad input: more codes than bits */
        !           343:   if ((y -= c[i]) < 0)
        !           344:     return 2;
        !           345:   c[i] += y;
        !           346: 
        !           347: 
        !           348:   /* Generate starting offsets into the value table for each length */
        !           349:   x[1] = j = 0;
        !           350:   p = c + 1;  xp = x + 2;
        !           351:   while (--i) {                 /* note that i == g from above */
        !           352:     *xp++ = (j += *p++);
        !           353:   }
        !           354: 
        !           355: 
        !           356:   /* Make a table of values in order of bit lengths */
        !           357:   p = b;  i = 0;
        !           358:   do {
        !           359:     if ((j = *p++) != 0)
        !           360:       v[x[j]++] = i;
        !           361:   } while (++i < n);
        !           362: 
        !           363: 
        !           364:   /* Generate the Huffman codes and for each, make the table entries */
        !           365:   x[0] = i = 0;                 /* first Huffman code is zero */
        !           366:   p = v;                        /* grab values in bit order */
        !           367:   h = -1;                       /* no tables yet--level -1 */
        !           368:   w = -l;                       /* bits decoded == (l * h) */
        !           369:   u[0] = (struct huft *)NULL;   /* just to keep compilers happy */
        !           370:   q = (struct huft *)NULL;      /* ditto */
        !           371:   z = 0;                        /* ditto */
        !           372: 
        !           373:   /* go through the bit lengths (k already is bits in shortest code) */
        !           374:   for (; k <= g; k++)
        !           375:   {
        !           376:     a = c[k];
        !           377:     while (a--)
        !           378:     {
        !           379:       /* here i is the Huffman code of length k bits for value *p */
        !           380:       /* make tables up to required level */
        !           381:       while (k > w + l)
        !           382:       {
        !           383:         h++;
        !           384:         w += l;                 /* previous table always l bits */
        !           385: 
        !           386:         /* compute minimum size table less than or equal to l bits */
        !           387:         z = (z = g - w) > (unsigned)l ? l : z;  /* upper limit on table size */
        !           388:         if ((f = 1 << (j = k - w)) > a + 1)     /* try a k-w bit table */
        !           389:         {                       /* too few codes for k-w bit table */
        !           390:           f -= a + 1;           /* deduct codes from patterns left */
        !           391:           xp = c + k;
        !           392:           while (++j < z)       /* try smaller tables up to z bits */
        !           393:           {
        !           394:             if ((f <<= 1) <= *++xp)
        !           395:               break;            /* enough codes to use up j bits */
        !           396:             f -= *xp;           /* else deduct codes from patterns */
        !           397:           }
        !           398:         }
        !           399:         z = 1 << j;             /* table entries for j-bit table */
        !           400: 
        !           401:         /* allocate and link in new table */
        !           402:         if ((q = (struct huft *)malloc((z + 1)*sizeof(struct huft))) ==
        !           403:             (struct huft *)NULL)
        !           404:         {
        !           405:           if (h)
        !           406:             huft_free(u[0]);
        !           407:           return 3;             /* not enough memory */
        !           408:         }
        !           409:         hufts += z + 1;         /* track memory usage */
        !           410:         *t = q + 1;             /* link to list for huft_free() */
        !           411:         *(t = &(q->v.t)) = (struct huft *)NULL;
        !           412:         u[h] = ++q;             /* table starts after link */
        !           413: 
        !           414:         /* connect to last table, if there is one */
        !           415:         if (h)
        !           416:         {
        !           417:           x[h] = i;             /* save pattern for backing up */
        !           418:           r.b = (uch)l;         /* bits to dump before this table */
        !           419:           r.e = (uch)(16 + j);  /* bits in this table */
        !           420:           r.v.t = q;            /* pointer to this table */
        !           421:           j = i >> (w - l);     /* (get around Turbo C bug) */
        !           422:           u[h-1][j] = r;        /* connect to last table */
        !           423:         }
        !           424:       }
        !           425: 
        !           426:       /* set up table entry in r */
        !           427:       r.b = (uch)(k - w);
        !           428:       if (p >= v + n)
        !           429:         r.e = 99;               /* out of values--invalid code */
        !           430:       else if (*p < s)
        !           431:       {
        !           432:         r.e = (uch)(*p < 256 ? 16 : 15);    /* 256 is end-of-block code */
        !           433:         r.v.n = *p++;           /* simple code is just the value */
        !           434:       }
        !           435:       else
        !           436:       {
        !           437:         r.e = (uch)e[*p - s];   /* non-simple--look up in lists */
        !           438:         r.v.n = d[*p++ - s];
        !           439:       }
        !           440: 
        !           441:       /* fill code-like entries with r */
        !           442:       f = 1 << (k - w);
        !           443:       for (j = i >> w; j < z; j += f)
        !           444:         q[j] = r;
        !           445: 
        !           446:       /* backwards increment the k-bit code i */
        !           447:       for (j = 1 << (k - 1); i & j; j >>= 1)
        !           448:         i ^= j;
        !           449:       i ^= j;
        !           450: 
        !           451:       /* backup over finished tables */
        !           452:       while ((i & ((1 << w) - 1)) != x[h])
        !           453:       {
        !           454:         h--;                    /* don't need to update q */
        !           455:         w -= l;
        !           456:       }
        !           457:     }
        !           458:   }
        !           459: 
        !           460: 
        !           461:   /* Return true (1) if we were given an incomplete table */
        !           462:   return y != 0 && g != 1;
        !           463: }
        !           464: 
        !           465: 
        !           466: 
        !           467: int huft_free(t)
        !           468: struct huft *t;         /* table to free */
        !           469: /* Free the malloc'ed tables built by huft_build(), which makes a linked
        !           470:    list of the tables it made, with the links in a dummy first entry of
        !           471:    each table. */
        !           472: {
        !           473:   register struct huft *p, *q;
        !           474: 
        !           475: 
        !           476:   /* Go through linked list, freeing from the malloced (t[-1]) address. */
        !           477:   p = t;
        !           478:   while (p != (struct huft *)NULL)
        !           479:   {
        !           480:     q = (--p)->v.t;
        !           481:     free(p);
        !           482:     p = q;
        !           483:   } 
        !           484:   return 0;
        !           485: }
        !           486: 
        !           487: 
        !           488: int inflate_codes(tl, td, bl, bd)
        !           489: struct huft *tl, *td;   /* literal/length and distance decoder tables */
        !           490: int bl, bd;             /* number of bits decoded by tl[] and td[] */
        !           491: /* inflate (decompress) the codes in a deflated (compressed) block.
        !           492:    Return an error code or zero if it all goes ok. */
        !           493: {
        !           494:   register unsigned e;  /* table entry flag/number of extra bits */
        !           495:   unsigned n, d;        /* length and index for copy */
        !           496:   unsigned w;           /* current window position */
        !           497:   struct huft *t;       /* pointer to table entry */
        !           498:   unsigned ml, md;      /* masks for bl and bd bits */
        !           499:   register ulg b;       /* bit buffer */
        !           500:   register unsigned k;  /* number of bits in bit buffer */
        !           501: 
        !           502: 
        !           503:   /* make local copies of globals */
        !           504:   b = bb;                       /* initialize bit buffer */
        !           505:   k = bk;
        !           506:   w = wp;                       /* initialize window position */
        !           507: 
        !           508:   /* inflate the coded data */
        !           509:   ml = mask_bits[bl];           /* precompute masks for speed */
        !           510:   md = mask_bits[bd];
        !           511:   for (;;)                      /* do until end of block */
        !           512:   {
        !           513:     NEEDBITS((unsigned)bl)
        !           514:     if ((e = (t = tl + ((unsigned)b & ml))->e) > 16)
        !           515:       do {
        !           516:         if (e == 99)
        !           517:           return 1;
        !           518:         DUMPBITS(t->b)
        !           519:         e -= 16;
        !           520:         NEEDBITS(e)
        !           521:       } while ((e = (t = t->v.t + ((unsigned)b & mask_bits[e]))->e) > 16);
        !           522:     DUMPBITS(t->b)
        !           523:     if (e == 16)                /* then it's a literal */
        !           524:     {
        !           525:       slide[w++] = (uch)t->v.n;
        !           526:       Tracevv((stderr, "%c", slide[w-1]));
        !           527:       if (w == WSIZE)
        !           528:       {
        !           529:         flush_output(w);
        !           530:         w = 0;
        !           531:       }
        !           532:     }
        !           533:     else                        /* it's an EOB or a length */
        !           534:     {
        !           535:       /* exit if end of block */
        !           536:       if (e == 15)
        !           537:         break;
        !           538: 
        !           539:       /* get length of block to copy */
        !           540:       NEEDBITS(e)
        !           541:       n = t->v.n + ((unsigned)b & mask_bits[e]);
        !           542:       DUMPBITS(e);
        !           543: 
        !           544:       /* decode distance of block to copy */
        !           545:       NEEDBITS((unsigned)bd)
        !           546:       if ((e = (t = td + ((unsigned)b & md))->e) > 16)
        !           547:         do {
        !           548:           if (e == 99)
        !           549:             return 1;
        !           550:           DUMPBITS(t->b)
        !           551:           e -= 16;
        !           552:           NEEDBITS(e)
        !           553:         } while ((e = (t = t->v.t + ((unsigned)b & mask_bits[e]))->e) > 16);
        !           554:       DUMPBITS(t->b)
        !           555:       NEEDBITS(e)
        !           556:       d = w - t->v.n - ((unsigned)b & mask_bits[e]);
        !           557:       DUMPBITS(e)
        !           558:       Tracevv((stderr,"\\[%d,%d]", w-d, n));
        !           559: 
        !           560:       /* do the copy */
        !           561:       do {
        !           562:         n -= (e = (e = WSIZE - ((d &= WSIZE-1) > w ? d : w)) > n ? n : e);
        !           563: #if !defined(NOMEMCPY) && !defined(DEBUG)
        !           564:         if (w - d >= e)         /* (this test assumes unsigned comparison) */
        !           565:         {
        !           566:           memcpy(slide + w, slide + d, e);
        !           567:           w += e;
        !           568:           d += e;
        !           569:         }
        !           570:         else                      /* do it slow to avoid memcpy() overlap */
        !           571: #endif /* !NOMEMCPY */
        !           572:           do {
        !           573:             slide[w++] = slide[d++];
        !           574:            Tracevv((stderr, "%c", slide[w-1]));
        !           575:           } while (--e);
        !           576:         if (w == WSIZE)
        !           577:         {
        !           578:           flush_output(w);
        !           579:           w = 0;
        !           580:         }
        !           581:       } while (n);
        !           582:     }
        !           583:   }
        !           584: 
        !           585: 
        !           586:   /* restore the globals from the locals */
        !           587:   wp = w;                       /* restore global window pointer */
        !           588:   bb = b;                       /* restore global bit buffer */
        !           589:   bk = k;
        !           590: 
        !           591:   /* done */
        !           592:   return 0;
        !           593: }
        !           594: 
        !           595: 
        !           596: 
        !           597: int inflate_stored()
        !           598: /* "decompress" an inflated type 0 (stored) block. */
        !           599: {
        !           600:   unsigned n;           /* number of bytes in block */
        !           601:   unsigned w;           /* current window position */
        !           602:   register ulg b;       /* bit buffer */
        !           603:   register unsigned k;  /* number of bits in bit buffer */
        !           604: 
        !           605: 
        !           606:   /* make local copies of globals */
        !           607:   b = bb;                       /* initialize bit buffer */
        !           608:   k = bk;
        !           609:   w = wp;                       /* initialize window position */
        !           610: 
        !           611: 
        !           612:   /* go to byte boundary */
        !           613:   n = k & 7;
        !           614:   DUMPBITS(n);
        !           615: 
        !           616: 
        !           617:   /* get the length and its complement */
        !           618:   NEEDBITS(16)
        !           619:   n = ((unsigned)b & 0xffff);
        !           620:   DUMPBITS(16)
        !           621:   NEEDBITS(16)
        !           622:   if (n != (unsigned)((~b) & 0xffff))
        !           623:     return 1;                   /* error in compressed data */
        !           624:   DUMPBITS(16)
        !           625: 
        !           626: 
        !           627:   /* read and output the compressed data */
        !           628:   while (n--)
        !           629:   {
        !           630:     NEEDBITS(8)
        !           631:     slide[w++] = (uch)b;
        !           632:     if (w == WSIZE)
        !           633:     {
        !           634:       flush_output(w);
        !           635:       w = 0;
        !           636:     }
        !           637:     DUMPBITS(8)
        !           638:   }
        !           639: 
        !           640: 
        !           641:   /* restore the globals from the locals */
        !           642:   wp = w;                       /* restore global window pointer */
        !           643:   bb = b;                       /* restore global bit buffer */
        !           644:   bk = k;
        !           645:   return 0;
        !           646: }
        !           647: 
        !           648: 
        !           649: 
        !           650: int inflate_fixed()
        !           651: /* decompress an inflated type 1 (fixed Huffman codes) block.  We should
        !           652:    either replace this with a custom decoder, or at least precompute the
        !           653:    Huffman tables. */
        !           654: {
        !           655:   int i;                /* temporary variable */
        !           656:   struct huft *tl;      /* literal/length code table */
        !           657:   struct huft *td;      /* distance code table */
        !           658:   int bl;               /* lookup bits for tl */
        !           659:   int bd;               /* lookup bits for td */
        !           660:   unsigned l[288];      /* length list for huft_build */
        !           661: 
        !           662: 
        !           663:   /* set up literal table */
        !           664:   for (i = 0; i < 144; i++)
        !           665:     l[i] = 8;
        !           666:   for (; i < 256; i++)
        !           667:     l[i] = 9;
        !           668:   for (; i < 280; i++)
        !           669:     l[i] = 7;
        !           670:   for (; i < 288; i++)          /* make a complete, but wrong code set */
        !           671:     l[i] = 8;
        !           672:   bl = 7;
        !           673:   if ((i = huft_build(l, 288, 257, cplens, cplext, &tl, &bl)) != 0)
        !           674:     return i;
        !           675: 
        !           676: 
        !           677:   /* set up distance table */
        !           678:   for (i = 0; i < 30; i++)      /* make an incomplete code set */
        !           679:     l[i] = 5;
        !           680:   bd = 5;
        !           681:   if ((i = huft_build(l, 30, 0, cpdist, cpdext, &td, &bd)) > 1)
        !           682:   {
        !           683:     huft_free(tl);
        !           684:     return i;
        !           685:   }
        !           686: 
        !           687: 
        !           688:   /* decompress until an end-of-block code */
        !           689:   if (inflate_codes(tl, td, bl, bd))
        !           690:     return 1;
        !           691: 
        !           692: 
        !           693:   /* free the decoding tables, return */
        !           694:   huft_free(tl);
        !           695:   huft_free(td);
        !           696:   return 0;
        !           697: }
        !           698: 
        !           699: 
        !           700: 
        !           701: int inflate_dynamic()
        !           702: /* decompress an inflated type 2 (dynamic Huffman codes) block. */
        !           703: {
        !           704:   int i;                /* temporary variables */
        !           705:   unsigned j;
        !           706:   unsigned l;           /* last length */
        !           707:   unsigned m;           /* mask for bit lengths table */
        !           708:   unsigned n;           /* number of lengths to get */
        !           709:   struct huft *tl;      /* literal/length code table */
        !           710:   struct huft *td;      /* distance code table */
        !           711:   int bl;               /* lookup bits for tl */
        !           712:   int bd;               /* lookup bits for td */
        !           713:   unsigned nb;          /* number of bit length codes */
        !           714:   unsigned nl;          /* number of literal/length codes */
        !           715:   unsigned nd;          /* number of distance codes */
        !           716: #ifdef PKZIP_BUG_WORKAROUND
        !           717:   unsigned ll[288+32];  /* literal/length and distance code lengths */
        !           718: #else
        !           719:   unsigned ll[286+30];  /* literal/length and distance code lengths */
        !           720: #endif
        !           721:   register ulg b;       /* bit buffer */
        !           722:   register unsigned k;  /* number of bits in bit buffer */
        !           723: 
        !           724: 
        !           725:   /* make local bit buffer */
        !           726:   b = bb;
        !           727:   k = bk;
        !           728: 
        !           729: 
        !           730:   /* read in table lengths */
        !           731:   NEEDBITS(5)
        !           732:   nl = 257 + ((unsigned)b & 0x1f);      /* number of literal/length codes */
        !           733:   DUMPBITS(5)
        !           734:   NEEDBITS(5)
        !           735:   nd = 1 + ((unsigned)b & 0x1f);        /* number of distance codes */
        !           736:   DUMPBITS(5)
        !           737:   NEEDBITS(4)
        !           738:   nb = 4 + ((unsigned)b & 0xf);         /* number of bit length codes */
        !           739:   DUMPBITS(4)
        !           740: #ifdef PKZIP_BUG_WORKAROUND
        !           741:   if (nl > 288 || nd > 32)
        !           742: #else
        !           743:   if (nl > 286 || nd > 30)
        !           744: #endif
        !           745:     return 1;                   /* bad lengths */
        !           746: 
        !           747: 
        !           748:   /* read in bit-length-code lengths */
        !           749:   for (j = 0; j < nb; j++)
        !           750:   {
        !           751:     NEEDBITS(3)
        !           752:     ll[border[j]] = (unsigned)b & 7;
        !           753:     DUMPBITS(3)
        !           754:   }
        !           755:   for (; j < 19; j++)
        !           756:     ll[border[j]] = 0;
        !           757: 
        !           758: 
        !           759:   /* build decoding table for trees--single level, 7 bit lookup */
        !           760:   bl = 7;
        !           761:   if ((i = huft_build(ll, 19, 19, NULL, NULL, &tl, &bl)) != 0)
        !           762:   {
        !           763:     if (i == 1)
        !           764:       huft_free(tl);
        !           765:     return i;                   /* incomplete code set */
        !           766:   }
        !           767: 
        !           768: 
        !           769:   /* read in literal and distance code lengths */
        !           770:   n = nl + nd;
        !           771:   m = mask_bits[bl];
        !           772:   i = l = 0;
        !           773:   while ((unsigned)i < n)
        !           774:   {
        !           775:     NEEDBITS((unsigned)bl)
        !           776:     j = (td = tl + ((unsigned)b & m))->b;
        !           777:     DUMPBITS(j)
        !           778:     j = td->v.n;
        !           779:     if (j < 16)                 /* length of code in bits (0..15) */
        !           780:       ll[i++] = l = j;          /* save last length in l */
        !           781:     else if (j == 16)           /* repeat last length 3 to 6 times */
        !           782:     {
        !           783:       NEEDBITS(2)
        !           784:       j = 3 + ((unsigned)b & 3);
        !           785:       DUMPBITS(2)
        !           786:       if ((unsigned)i + j > n)
        !           787:         return 1;
        !           788:       while (j--)
        !           789:         ll[i++] = l;
        !           790:     }
        !           791:     else if (j == 17)           /* 3 to 10 zero length codes */
        !           792:     {
        !           793:       NEEDBITS(3)
        !           794:       j = 3 + ((unsigned)b & 7);
        !           795:       DUMPBITS(3)
        !           796:       if ((unsigned)i + j > n)
        !           797:         return 1;
        !           798:       while (j--)
        !           799:         ll[i++] = 0;
        !           800:       l = 0;
        !           801:     }
        !           802:     else                        /* j == 18: 11 to 138 zero length codes */
        !           803:     {
        !           804:       NEEDBITS(7)
        !           805:       j = 11 + ((unsigned)b & 0x7f);
        !           806:       DUMPBITS(7)
        !           807:       if ((unsigned)i + j > n)
        !           808:         return 1;
        !           809:       while (j--)
        !           810:         ll[i++] = 0;
        !           811:       l = 0;
        !           812:     }
        !           813:   }
        !           814: 
        !           815: 
        !           816:   /* free decoding table for trees */
        !           817:   huft_free(tl);
        !           818: 
        !           819: 
        !           820:   /* restore the global bit buffer */
        !           821:   bb = b;
        !           822:   bk = k;
        !           823: 
        !           824: 
        !           825:   /* build the decoding tables for literal/length and distance codes */
        !           826:   bl = lbits;
        !           827:   if ((i = huft_build(ll, nl, 257, cplens, cplext, &tl, &bl)) != 0)
        !           828:   {
        !           829:     if (i == 1) {
        !           830:       fprintf(stderr, " incomplete literal tree\n");
        !           831:       huft_free(tl);
        !           832:     }
        !           833:     return i;                   /* incomplete code set */
        !           834:   }
        !           835:   bd = dbits;
        !           836:   if ((i = huft_build(ll + nl, nd, 0, cpdist, cpdext, &td, &bd)) != 0)
        !           837:   {
        !           838:     if (i == 1) {
        !           839:       fprintf(stderr, " incomplete distance tree\n");
        !           840: #ifdef PKZIP_BUG_WORKAROUND
        !           841:       i = 0;
        !           842:     }
        !           843: #else
        !           844:       huft_free(td);
        !           845:     }
        !           846:     huft_free(tl);
        !           847:     return i;                   /* incomplete code set */
        !           848: #endif
        !           849:   }
        !           850: 
        !           851: 
        !           852:   /* decompress until an end-of-block code */
        !           853:   if (inflate_codes(tl, td, bl, bd))
        !           854:     return 1;
        !           855: 
        !           856: 
        !           857:   /* free the decoding tables, return */
        !           858:   huft_free(tl);
        !           859:   huft_free(td);
        !           860:   return 0;
        !           861: }
        !           862: 
        !           863: 
        !           864: 
        !           865: int inflate_block(e)
        !           866: int *e;                 /* last block flag */
        !           867: /* decompress an inflated block */
        !           868: {
        !           869:   unsigned t;           /* block type */
        !           870:   register ulg b;       /* bit buffer */
        !           871:   register unsigned k;  /* number of bits in bit buffer */
        !           872: 
        !           873: 
        !           874:   /* make local bit buffer */
        !           875:   b = bb;
        !           876:   k = bk;
        !           877: 
        !           878: 
        !           879:   /* read in last block bit */
        !           880:   NEEDBITS(1)
        !           881:   *e = (int)b & 1;
        !           882:   DUMPBITS(1)
        !           883: 
        !           884: 
        !           885:   /* read in block type */
        !           886:   NEEDBITS(2)
        !           887:   t = (unsigned)b & 3;
        !           888:   DUMPBITS(2)
        !           889: 
        !           890: 
        !           891:   /* restore the global bit buffer */
        !           892:   bb = b;
        !           893:   bk = k;
        !           894: 
        !           895: 
        !           896:   /* inflate that block type */
        !           897:   if (t == 2)
        !           898:     return inflate_dynamic();
        !           899:   if (t == 0)
        !           900:     return inflate_stored();
        !           901:   if (t == 1)
        !           902:     return inflate_fixed();
        !           903: 
        !           904: 
        !           905:   /* bad block type */
        !           906:   return 2;
        !           907: }
        !           908: 
        !           909: 
        !           910: 
        !           911: int inflate()
        !           912: /* decompress an inflated entry */
        !           913: {
        !           914:   int e;                /* last block flag */
        !           915:   int r;                /* result code */
        !           916:   unsigned h;           /* maximum struct huft's malloc'ed */
        !           917: 
        !           918: 
        !           919:   /* initialize window, bit buffer */
        !           920:   wp = 0;
        !           921:   bk = 0;
        !           922:   bb = 0;
        !           923: 
        !           924: 
        !           925:   /* decompress until the last block */
        !           926:   h = 0;
        !           927:   do {
        !           928:     hufts = 0;
        !           929:     if ((r = inflate_block(&e)) != 0)
        !           930:       return r;
        !           931:     if (hufts > h)
        !           932:       h = hufts;
        !           933:   } while (!e);
        !           934: 
        !           935:   /* Undo too much lookahead. The next read will be byte aligned so we
        !           936:    * can discard unused bits in the last meaningful byte.
        !           937:    */
        !           938:   while (bk >= 8) {
        !           939:     bk -= 8;
        !           940:     inptr--;
        !           941:   }
        !           942: 
        !           943:   /* flush out slide */
        !           944:   flush_output(wp);
        !           945: 
        !           946: 
        !           947:   /* return success */
        !           948: #ifdef DEBUG
        !           949:   fprintf(stderr, "<%u> ", h);
        !           950: #endif /* DEBUG */
        !           951:   return 0;
        !           952: }
unix.superglobalmegacorp.com
This archive runs on limited infrastructure. Preserving old code on modern bandwidth. Automated agents are requested to crawl responsibly.