Class DirectoryScanner
- All Implemented Interfaces:
Scanner
Class for scanning a directory for files/directories which match certain criteria.
These criteria consist of selectors and patterns which have been specified. With the selectors you can select which files you want to have included. Files which are not selected are excluded. With patterns you can include or exclude files based on their filename.
The idea is simple. A given directory is recursively scanned for all files and directories. Each file/directory is matched against a set of selectors, including special support for matching against filenames with include and and exclude patterns. Only files/directories which match at least one pattern of the include pattern list or other file selector, and don't match any pattern of the exclude pattern list or fail to match against a required selector will be placed in the list of files/directories found.
When no list of include patterns is supplied, "**" will be used, which means that everything will be matched. When no list of exclude patterns is supplied, an empty list is used, such that nothing will be excluded. When no selectors are supplied, none are applied.
The filename pattern matching is done as follows: The name to be matched is split up in path segments. A path segment
is the name of a directory or file, which is bounded by File.separator
('/' under UNIX, '\' under
Windows). For example, "abc/def/ghi/xyz.java" is split up in the segments "abc", "def","ghi" and "xyz.java". The same
is done for the pattern against which should be matched.
The segments of the name and the pattern are then matched against each other. When '**' is used for a path segment in the pattern, it matches zero or more path segments of the name.
There is a special case regarding the use of File.separator
s at the beginning of the pattern and the
string to match:
When a pattern starts with a File.separator
, the string to match must also start with a
File.separator
. When a pattern does not start with a File.separator
, the string to match
may not start with a File.separator
. When one of these rules is not obeyed, the string will not match.
When a name path segment is matched against a pattern path segment, the following special characters can be used:
'*' matches zero or more characters
'?' matches one character.
- "**\*.class" matches all .class files/dirs in a directory tree.
- "test\a??.java" matches all files/dirs which start with an 'a', then two more characters and then ".java", in a directory called test.
- "**" matches everything in a directory tree.
- "**\test\**\XYZ*" matches all files/dirs which start with "XYZ" and where there is a parent directory called test (e.g. "abc\test\def\ghi\XYZ123").
Case sensitivity may be turned off if necessary. By default, it is turned on.
Example of usage:String[] includes = { "**\\*.class" }; String[] excludes = { "modules\\*\\**" }; ds.setIncludes( includes ); ds.setExcludes( excludes ); ds.setBasedir( new File( "test" ) ); ds.setCaseSensitive( true ); ds.scan(); System.out.println( "FILES:" ); String[] files = ds.getIncludedFiles(); for ( int i = 0; i < files.length; i++ ) { System.out.println( files[i] ); }
This will scan a directory called test for .class files, but excludes all files in all proper subdirectories of a directory called "modules"
- Author:
- Arnout J. Kuiper ajkuiper@wxs.nl, Magesh Umasankar, Bruce Atherton, Antoine Levy-Lambert
-
Field Summary
Modifier and TypeFieldDescriptionprotected File
The base directory to be scanned.The directories which matched at least one include and no excludes but which a selector discarded.The directories which matched at least one include and at least one exclude.The directories which matched at least one include and no excludes and were selected.The directories which were found and did not match any includes.protected boolean
Whether or not everything tested so far has been included.The files which matched at least one include and no excludes and which a selector discarded.The files which matched at least one include and at least one exclude.The files which matched at least one include and no excludes and were selected.The files which did not match any includes or selectors.protected boolean
Whether or not our results were built by a slow scan.Fields inherited from class org.codehaus.plexus.util.AbstractScanner
DEFAULTEXCLUDES, excludes, filenameComparator, includes, isCaseSensitive
-
Constructor Summary
-
Method Summary
Modifier and TypeMethodDescriptionReturns the base directory to be scanned.String[]
Returns the names of the directories which were selected out and therefore not ultimately included.String[]
Returns the names of the files which were selected out and therefore not ultimately included.String[]
Returns the names of the directories which matched at least one of the include patterns and at least one of the exclude patterns.String[]
Returns the names of the files which matched at least one of the include patterns and at least one of the exclude patterns.String[]
Returns the names of the directories which matched at least one of the include patterns and none of the exclude patterns.String[]
Returns the names of the files which matched at least one of the include patterns and none of the exclude patterns.String[]
Returns the names of the directories which matched none of the include patterns.String[]
Returns the names of the files which matched none of the include patterns.boolean
Returns whether or not the scanner has included all the files or directories it has come across so far.boolean
isParentSymbolicLink
(File parent, String name) Checks whether the parent of this file is a symbolic link.protected boolean
isSelected
(String name, File file) Tests whether a name should be selected.boolean
isSymbolicLink
(File parent, String name) Checks whether a given file is a symbolic link.void
scan()
Scans the base directory for files which match at least one include pattern and don't match any exclude patterns.protected void
Scans the given directory for files and directories.void
setBasedir
(File basedir) Sets the base directory to be scanned.void
setBasedir
(String basedir) Sets the base directory to be scanned.void
setFollowSymlinks
(boolean followSymlinks) Sets whether or not symbolic links should be followed.protected void
slowScan()
Top level invocation for a slow scan.Methods inherited from class org.codehaus.plexus.util.AbstractScanner
addDefaultExcludes, couldHoldIncluded, isExcluded, isExcluded, isExcluded, isIncluded, isIncluded, isIncluded, match, match, matchPath, matchPath, matchPatternStart, matchPatternStart, setCaseSensitive, setExcludes, setFilenameComparator, setIncludes, setupDefaultFilters, setupMatchPatterns
-
Field Details
-
basedir
The base directory to be scanned. -
filesIncluded
The files which matched at least one include and no excludes and were selected. -
filesNotIncluded
The files which did not match any includes or selectors. -
filesExcluded
The files which matched at least one include and at least one exclude. -
dirsIncluded
The directories which matched at least one include and no excludes and were selected. -
dirsNotIncluded
The directories which were found and did not match any includes. -
dirsExcluded
The directories which matched at least one include and at least one exclude. -
filesDeselected
The files which matched at least one include and no excludes and which a selector discarded. -
dirsDeselected
The directories which matched at least one include and no excludes but which a selector discarded. -
haveSlowResults
protected boolean haveSlowResultsWhether or not our results were built by a slow scan. -
everythingIncluded
protected boolean everythingIncludedWhether or not everything tested so far has been included.
-
-
Constructor Details
-
DirectoryScanner
public DirectoryScanner()Sole constructor.
-
-
Method Details
-
setBasedir
Sets the base directory to be scanned. This is the directory which is scanned recursively. All '/' and '\' characters are replaced byFile.separatorChar
, so the separator used need not matchFile.separatorChar
.- Parameters:
basedir
- The base directory to scan. Must not benull
.
-
setBasedir
Sets the base directory to be scanned. This is the directory which is scanned recursively.- Parameters:
basedir
- The base directory for scanning. Should not benull
.
-
getBasedir
Returns the base directory to be scanned. This is the directory which is scanned recursively.- Returns:
- the base directory to be scanned
-
setFollowSymlinks
public void setFollowSymlinks(boolean followSymlinks) Sets whether or not symbolic links should be followed.- Parameters:
followSymlinks
- whether or not symbolic links should be followed
-
isEverythingIncluded
public boolean isEverythingIncluded()Returns whether or not the scanner has included all the files or directories it has come across so far.- Returns:
true
if all files and directories which have been found so far have been included.
-
scan
Scans the base directory for files which match at least one include pattern and don't match any exclude patterns. If there are selectors then the files must pass muster there, as well.- Throws:
IllegalStateException
- if the base directory was set incorrectly (i.e. if it isnull
, doesn't exist, or isn't a directory).
-
slowScan
protected void slowScan()Top level invocation for a slow scan. A slow scan builds up a full list of excluded/included files/directories, whereas a fast scan will only have full results for included files, as it ignores directories which can't possibly hold any included files/directories.
Returns immediately if a slow scan has already been completed.
-
scandir
Scans the given directory for files and directories. Found files and directories are placed in their respective collections, based on the matching of includes, excludes, and the selectors. When a directory is found, it is scanned recursively.- Parameters:
dir
- The directory to scan. Must not benull
.vpath
- The path relative to the base directory (needed to prevent problems with an absolute path when using dir). Must not benull
.fast
- Whether or not this call is part of a fast scan.- See Also:
-
isSelected
Tests whether a name should be selected.- Parameters:
name
- the filename to check for selectingfile
- the java.io.File object for this filename- Returns:
false
when the selectors says that the file should not be selected,true
otherwise.
-
getIncludedFiles
Returns the names of the files which matched at least one of the include patterns and none of the exclude patterns. The names are relative to the base directory.- Returns:
- the names of the files which matched at least one of the include patterns and none of the exclude patterns.
-
getNotIncludedFiles
Returns the names of the files which matched none of the include patterns. The names are relative to the base directory. This involves performing a slow scan if one has not already been completed.- Returns:
- the names of the files which matched none of the include patterns.
- See Also:
-
getExcludedFiles
Returns the names of the files which matched at least one of the include patterns and at least one of the exclude patterns. The names are relative to the base directory. This involves performing a slow scan if one has not already been completed.- Returns:
- the names of the files which matched at least one of the include patterns and at at least one of the exclude patterns.
- See Also:
-
getDeselectedFiles
Returns the names of the files which were selected out and therefore not ultimately included.
The names are relative to the base directory. This involves performing a slow scan if one has not already been completed.
- Returns:
- the names of the files which were deselected.
- See Also:
-
getIncludedDirectories
Returns the names of the directories which matched at least one of the include patterns and none of the exclude patterns. The names are relative to the base directory.- Returns:
- the names of the directories which matched at least one of the include patterns and none of the exclude patterns.
-
getNotIncludedDirectories
Returns the names of the directories which matched none of the include patterns. The names are relative to the base directory. This involves performing a slow scan if one has not already been completed.- Returns:
- the names of the directories which matched none of the include patterns.
- See Also:
-
getExcludedDirectories
Returns the names of the directories which matched at least one of the include patterns and at least one of the exclude patterns. The names are relative to the base directory. This involves performing a slow scan if one has not already been completed.- Returns:
- the names of the directories which matched at least one of the include patterns and at least one of the exclude patterns.
- See Also:
-
getDeselectedDirectories
Returns the names of the directories which were selected out and therefore not ultimately included.
The names are relative to the base directory. This involves performing a slow scan if one has not already been completed.
- Returns:
- the names of the directories which were deselected.
- See Also:
-
isSymbolicLink
Checks whether a given file is a symbolic link.
It doesn't really test for symbolic links but whether the canonical and absolute paths of the file are identical - this may lead to false positives on some platforms.
- Parameters:
parent
- the parent directory of the file to testname
- the name of the file to test.- Returns:
- true if it's a symbolic link
- Throws:
IOException
- .- Since:
- Ant 1.5
-
isParentSymbolicLink
Checks whether the parent of this file is a symbolic link.
For java versions prior to 7 It doesn't really test for symbolic links but whether the canonical and absolute paths of the file are identical - this may lead to false positives on some platforms.
- Parameters:
parent
- the parent directory of the file to testname
- the name of the file to test.- Returns:
- true if it's a symbolic link
- Throws:
IOException
- .- Since:
- Ant 1.5
-