walkBeta2
A powerful recursive tree-walking utility.
param | type [= default] | description |
---|---|---|
core | string = 'default' | The plugin core identifier to use for plugin injection |
fs [deprecated] | FileSystem | The filesystem containing the git repo. Overrides the fs provided by the plugin system. |
dir | string | The working tree directory path |
gitdir | string = join(dir,'.git') | The git directory path |
trees | Array<Walker> | The trees you want to traverse |
map | function(string, ?WalkerEntry[]): Promise<any> | Transform WalkerEntry s into a result form |
reduce | function(any, any[]): Promise<any> | Control how mapped entries are combined with their parent result |
iterate | function(function(WalkerEntry[]): Promise<any[]>, IterableIterator<WalkerEntry[]>): Promise<any[]> | Fine-tune how entries within a tree are iterated over |
return | Promise<any> | The finished tree-walking result |
The walk
API (tentatively named walkBeta2
) simplifies gathering detailed information about a tree or comparing all the filepaths in two or more trees.
Trees can be git commits, the working directory, or the or git index (staging area).
As long as a file or directory is present in at least one of the trees, it will be traversed.
Entries are traversed in alphabetical order.
The arguments to walk
are the trees
you want to traverse, and 3 optional transform functions:
map
, reduce
, and iterate
.
TREE
, WORKDIR
, and STAGE
Tree walkers are represented by three separate functions that can be imported:
import { TREE, WORKDIR, STAGE } from 'isomorphic-git'
These functions return opaque handles called Walker
s.
The only thing that Walker
objects are good for is passing into walkBeta2
.
Here are the three Walker
s passed into walkBeta2
by the statusMatrix
command for example:
let ref = 'HEAD'
let trees = [TREE({ ref }), WORKDIR(), STAGE()]
For the arguments, see the doc pages for TREE, WORKDIR, and STAGE.
map
, reduce
, and iterate
allow you control the recursive walk by pruning and transforming WalkerEntry
s into the desired result.
WalkerEntry
The WalkerEntry
is an interface that abstracts computing many common tree / blob stats.
type WalkerEntry = {
type: function(): Promise<'tree'|'blob'|'special'|'commit'>;
mode: function(): Promise<number>;
oid: function(): Promise<string>;
content: function(): Promise<Buffer>;
stat: function(): Promise<Stat>;
}
map
receives an array of WalkerEntry[]
as its main argument, one WalkerEntry
for each Walker
in the trees
argument.
The methods are memoized per WalkerEntry
so calling them multiple times in a map
function does not adversely impact performance.
By only computing these values if needed, you build can build lean, mean, efficient walking machines.
WalkerEntry#type()
Returns the kind as a string. This is normally either tree
or blob
.
TREE
, STAGE
, and WORKDIR
walkers all return a string.
Possible values:
'tree'
directory'blob'
file'special'
used byWORKDIR
to represent irregular files like sockets and FIFOs'commit'
used byTREE
to represent submodules
await entry.type()
WalkerEntry#mode()
Returns the file mode as a number. Use this to distinguish between regular files, symlinks, and executable files.
TREE
, STAGE
, and WORKDIR
walkers all return a number for all type
s of entries.
It has been normalized to one of the 4 values that are allowed in git commits:
0o40000
directory0o100644
file0o100755
file (executable)0o120000
symlink
Tip: to make modes more readable, you can print them to octal using .toString(8)
.
await entry.mode()
WalkerEntry#oid()
Returns the SHA-1 object id for blobs and trees.
TREE
walkers return a string for blob
and tree
entries.
STAGE
and WORKDIR
walkers return a string for blob
entries and undefined
for tree
entries.
await entry.oid()
WalkerEntry#content()
Returns the file contents as a Buffer.
TREE
and WORKDIR
walkers return a Buffer for blob
entries and undefined
for tree
entries.
STAGE
walkers always return undefined
since the file contents are never stored in the stage.
await entry.content()
WalkerEntry#stat()
Returns a normalized subset of filesystem Stat data.
WORKDIR
walkers return a Stat
for blob
and tree
entries.
STAGE
walkers return a Stat
for blob
entries and undefined
for tree
entries.
TREE
walkers return undefined
for all entry types.
await entry.stat()
Normalized subset of filesystem stat
data:
type Stat = {
ctimeSeconds: number;
ctimeNanoseconds: number;
mtimeSeconds: number;
mtimeNanoseconds: number;
dev: number;
ino: number;
mode: number;
uid: number;
gid: number;
size: number;
}
This is the function that is called once per entry BEFORE visiting the children of that node.
If you return null
for a tree
entry, then none of the children of that tree
entry will be walked.
This is a good place for query logic, such as examining the contents of a file. Ultimately, compare all the entries and return any values you are interested in. If you do not return a value (or return undefined) that entry will be filtered from the results.
Example 1: Find all the files containing the word 'foo'.
async function map(filepath, [head, workdir]) {
let content = (await workdir.content()).toString('utf8')
if (content.contains('foo')) {
return {
filepath,
content
}
}
}
Example 2: Return the difference between the working directory and the HEAD commit
const diff = require('diff-lines')
async function map(filepath, [head, workdir]) {
return {
filepath,
oid: await head.oid(),
diff: diff((await head.content()).toString('utf8'), (await workdir.content()).toString('utf8'))
}
}
Example 3:
let path = require('path')
// Only examine files in the directory `cwd`
let cwd = 'src/app'
async function map (filepath, [head, workdir, stage]) {
if (
// don't skip the root directory
head.fullpath !== '.' &&
// return true for 'src' and 'src/app'
!cwd.startsWith(filepath) &&
// return true for 'src/app/*'
path.dirname(filepath) !== cwd
) {
return null
} else {
return filepath
}
}
reduce(parent, children)
This is the function that is called once per entry AFTER visiting the children of that node.
Default: async (parent, children) => parent === undefined ? children.flat() : [parent, children].flat()
The default implementation of this function returns all directories and children in a giant flat array. You can define a different accumulation method though.
Example: Return a hierarchical structure
async function reduce (parent, children) {
return Object.assign(parent, { children })
}
iterate(walk, children)
Default: (walk, children) => Promise.all([...children].map(walk))
The default implementation recurses all children concurrently using Promise.all. However you could use a custom function to traverse children serially or use a global queue to throttle recursion.