walkBeta2
A powerful recursive tree-walking utility.
| param | type [= default] | description | 
|---|---|---|
| core | string = 'default' | The plugin core identifier to use for plugin injection | 
| fs [deprecated] | FileSystem | The filesystem containing the git repo. Overrides the fs provided by the plugin system. | 
| dir | string | The working tree directory path | 
| gitdir | string = join(dir,'.git') | The git directory path | 
| trees | Array<Walker> | The trees you want to traverse | 
| map | function(string, ?WalkerEntry[]): Promise<any> | Transform WalkerEntrys into a result form | 
| reduce | function(any, any[]): Promise<any> | Control how mapped entries are combined with their parent result | 
| iterate | function(function(WalkerEntry[]): Promise<any[]>, IterableIterator<WalkerEntry[]>): Promise<any[]> | Fine-tune how entries within a tree are iterated over | 
| return | Promise<any> | The finished tree-walking result | 
The walk API (tentatively named walkBeta2) simplifies gathering detailed information about a tree or comparing all the filepaths in two or more trees.
Trees can be git commits, the working directory, or the or git index (staging area).
As long as a file or directory is present in at least one of the trees, it will be traversed.
Entries are traversed in alphabetical order.
The arguments to walk are the trees you want to traverse, and 3 optional transform functions:
map, reduce, and iterate.
TREE, WORKDIR, and STAGE
Tree walkers are represented by three separate functions that can be imported:
import { TREE, WORKDIR, STAGE } from 'isomorphic-git'
These functions return opaque handles called Walkers.
The only thing that Walker objects are good for is passing into walkBeta2.
Here are the three Walkers passed into walkBeta2 by the statusMatrix command for example:
let ref = 'HEAD'
let trees = [TREE({ ref }), WORKDIR(), STAGE()]
For the arguments, see the doc pages for TREE, WORKDIR, and STAGE.
map, reduce, and iterate allow you control the recursive walk by pruning and transforming WalkerEntrys into the desired result.
WalkerEntry
The WalkerEntry is an interface that abstracts computing many common tree / blob stats.
type WalkerEntry = {
  type: function(): Promise<'tree'|'blob'|'special'|'commit'>;
  mode: function(): Promise<number>;
  oid: function(): Promise<string>;
  content: function(): Promise<Buffer>;
  stat: function(): Promise<Stat>;
}
map receives an array of WalkerEntry[] as its main argument, one WalkerEntry for each Walker in the trees argument.
The methods are memoized per WalkerEntry so calling them multiple times in a map function does not adversely impact performance.
By only computing these values if needed, you build can build lean, mean, efficient walking machines.
WalkerEntry#type()
Returns the kind as a string. This is normally either tree or blob.
TREE, STAGE, and WORKDIR walkers all return a string.
Possible values:
- 'tree'directory
- 'blob'file
- 'special'used by- WORKDIRto represent irregular files like sockets and FIFOs
- 'commit'used by- TREEto represent submodules
await entry.type()
WalkerEntry#mode()
Returns the file mode as a number. Use this to distinguish between regular files, symlinks, and executable files.
TREE, STAGE, and WORKDIR walkers all return a number for all types of entries.
It has been normalized to one of the 4 values that are allowed in git commits:
- 0o40000directory
- 0o100644file
- 0o100755file (executable)
- 0o120000symlink
Tip: to make modes more readable, you can print them to octal using .toString(8).
await entry.mode()
WalkerEntry#oid()
Returns the SHA-1 object id for blobs and trees.
TREE walkers return a string for blob and tree entries.
STAGE and WORKDIR walkers return a string for blob entries and undefined for tree entries.
await entry.oid()
WalkerEntry#content()
Returns the file contents as a Buffer.
TREE and WORKDIR walkers return a Buffer for blob entries and undefined for tree entries.
STAGE walkers always return undefined since the file contents are never stored in the stage.
await entry.content()
WalkerEntry#stat()
Returns a normalized subset of filesystem Stat data.
WORKDIR walkers return a Stat for blob and tree entries.
STAGE walkers return a Stat for blob entries and undefined for tree entries.
TREE walkers return undefined for all entry types.
await entry.stat()
Normalized subset of filesystem stat data:
type Stat = {
  ctimeSeconds: number;
  ctimeNanoseconds: number;
  mtimeSeconds: number;
  mtimeNanoseconds: number;
  dev: number;
  ino: number;
  mode: number;
  uid: number;
  gid: number;
  size: number;
}
map(string, Array<WalkerEntry|null>) => Promise
This is the function that is called once per entry BEFORE visiting the children of that node.
If you return null for a tree entry, then none of the children of that tree entry will be walked.
This is a good place for query logic, such as examining the contents of a file. Ultimately, compare all the entries and return any values you are interested in. If you do not return a value (or return undefined) that entry will be filtered from the results.
Example 1: Find all the files containing the word 'foo'.
async function map(filepath, [head, workdir]) {
  let content = (await workdir.content()).toString('utf8')
  if (content.contains('foo')) {
    return {
      filepath,
      content
    }
  }
}
Example 2: Return the difference between the working directory and the HEAD commit
const diff = require('diff-lines')
async function map(filepath, [head, workdir]) {
  return {
    filepath,
    oid: await head.oid(),
    diff: diff((await head.content()).toString('utf8'), (await workdir.content()).toString('utf8'))
  }
}
Example 3:
let path = require('path')
// Only examine files in the directory `cwd`
let cwd = 'src/app'
async function map (filepath, [head, workdir, stage]) {
  if (
    // don't skip the root directory
    head.fullpath !== '.' &&
    // return true for 'src' and 'src/app'
    !cwd.startsWith(filepath) &&
    // return true for 'src/app/*'
    path.dirname(filepath) !== cwd
  ) {
    return null
  } else {
    return filepath
  }
}
reduce(parent, children)
This is the function that is called once per entry AFTER visiting the children of that node.
Default: async (parent, children) => parent === undefined ? children.flat() : [parent, children].flat()
The default implementation of this function returns all directories and children in a giant flat array. You can define a different accumulation method though.
Example: Return a hierarchical structure
async function reduce (parent, children) {
  return Object.assign(parent, { children })
}
iterate(walk, children)
Default: (walk, children) => Promise.all([...children].map(walk))
The default implementation recurses all children concurrently using Promise.all. However you could use a custom function to traverse children serially or use a global queue to throttle recursion.