Overview docs for analysis nbsearch

author Teemu Murtola <teemu.murtola@gmail.com>

Mon, 25 May 2015 05:49:16 +0000 (08:49 +0300)

committer Teemu Murtola <teemu.murtola@gmail.com>

Mon, 25 May 2015 18:23:36 +0000 (21:23 +0300)
author Teemu Murtola <teemu.murtola@gmail.com>
Mon, 25 May 2015 05:49:16 +0000 (08:49 +0300)
committer Teemu Murtola <teemu.murtola@gmail.com>
Mon, 25 May 2015 18:23:36 +0000 (21:23 +0300)
diff --git a/docs/doxygen/user/analysisframework.md b/docs/doxygen/user/analysisframework.md

index 21586afdaba22e9d8a36fed90a8b8c0329958589..29358489b9fe24997f1ed49b595f3a7e1a689e22 100644 (file)
--- a/docs/doxygen/user/analysisframework.md
+++ b/docs/doxygen/user/analysisframework.md
@@ -32,6 +32,10 @@ out of the framework.  The main features are:
     such language bindings in the framework (no such integration is implemented
     at this time, though).
  
+There are also some reusable analysis routines that can be used independent of
+the framework:
+ - \subpage page_analysisnbsearch
+
  For a crash course on how to implement an analysis tool using the framework, see
  \subpage page_analysistemplate.
  
diff --git a/docs/doxygen/user/analysisnbsearch.md b/docs/doxygen/user/analysisnbsearch.md

new file mode 100644 (file)

index 0000000..a8b2bf5
--- /dev/null
+++ b/docs/doxygen/user/analysisnbsearch.md
@@ -0,0 +1,113 @@
+Neighborhood search for analysis tools {#page_analysisnbsearch}
+======================================
+
+The header nbsearch.h declares a C++ interface to a relatively flexible and
+efficient neighborhood search.  It is currently implemented within the
+selection module where it originated, but it does not have any dependencies on
+the other selection code and can be easily split out in the future.
+
+The emphasis is on flexibility and ease of use; one main driver is to have
+one common implementation of grid-based searching to avoid replicating this in
+multiple tools (and to make more tools take advantage of the significant
+performance improvement this allows).  The main features that it provides:
+
+ - Grid-based searching with any triclinic box shape that \Gromacs supports
+   (i.e., a triangular box matrix and not too skewed).
+ - Grid-based searching with all PBC options except for screw boundary
+   conditions.
+ - With no PBC, grid-based searching where the grid is constructed based on the
+   bounding box of the gridded atoms.
+ - Efficient, rectangular grid cells whose size is determined by particle
+   density and not limited by the cutoff.
+ - Transparent fallback to a simple all-pairs search if the cutoff is too long
+   for the algorithm or grid searching is not otherwise supported.
+ - Support for computing all distances in the XY plane only (and still
+   grid-based).
+ - Convenience functions for finding the shortest distance or the nearest pair
+   between two sets of positions.
+ - Basic support for exclusions.
+ - Thread-safe handling of multiple concurrent searches with the same cutoff
+   with the same or different reference positions.
+
+Usage
+=====
+
+The neighborhood search works conceptually with two different sets of
+coordinates:
+
+ - _reference positions_: When initiating the search, you provide one set of
+   reference positions that get placed on the search grid and determine the
+   size of the grid.
+ - _test positions_: For each set of reference positions, you provide a set of
+   test positions (or a single position).  The search is performed from each
+   test position, finding the reference positions within the cutoff from this
+   point.  It is possible to perform multiple searches against the same set of
+   reference positions (and the same grid).
+
+To start using the neighborhood search, you need to first create an instance of
+gmx::AnalysisNeighborhood.  This class allows you to set some global properties
+for the search (most notably, the cutoff distance).  Then you provide the
+reference positions as a gmx::AnalysisNeighborhoodPositions and PBC information
+to get a gmx::AnalysisNeighborhoodSearch instance.  You can then either use
+methods directly in this class to find, e.g., the nearest reference point from
+a test position, or you can do a full pair search that returns you all the
+reference-test pairs within a cutoff.  The pair search is performed using an
+instance of gmx::AnalysisNeighborhoodPairSearch that the search object returns.
+Methods that return information about pairs return an instance of
+gmx::AnalysisNeighborhoodPair, which can be used to access the indices of
+the reference and test positions in the pair, as well as the computed distance.
+See the class documentation for these classes for details.
+
+For use together with selections, an instance of gmx::Selection or
+gmx::SelectionPosition can be transparently passed as the positions for the
+neighborhood search.
+
+Implementation
+==============
+
+This section provides a high-level overview of the algorithm used.  It is not
+necessary to understand all the details to use the API, but it can be useful to
+get the best performance out of it.  The main audience is developers who may
+need to extend the API to make it suitable for more cases.
+
+The grid for the search is initialized based on the reference positions and the
+PBC information:
+
+ - The grid cells are always rectangular, even for fully triclinic boxes.
+ - If there is no PBC, the grid edges are defined from the bounding box of the
+   reference positions; with PBC, the grid covers the unit cell.
+ - The grid cell size is determined such that on average, each cell contains
+   ten particles.  Special considerations are in place for cases where the grid
+   will only be one- or two-dimensional because of a flat box.
+ - If the resulting grid has too few cells in some dimensions, the code
+   falls back automatically to an all-pairs search.  For correct operation, the
+   grid algorithm needs three cells in each dimension, but the code can fall
+   back to a non-gridded search for each dimension separately.
+ - If the resulting grid has so few cells that the search would anyways
+   consider all (or nearly all) cell pairs, the search falls back to a
+   simple search.
+ - The initialization also pre-calculates the shifts required across the
+   periodic boundaries for triclinic cells, i.e., the fractional number of
+   cells that the grid origin is shifted when crossing the periodic boundary in
+   Y or Z directions.
+ - Finally, all the reference positions are mapped to the grid cells.
+
+There are a few heuristic numbers in the above logic: the average number of
+particles within a cell, and the cutover point from grid to an all-pairs
+search.  These have not been particularly optimized for best performance.
+
+When doing the search for test positions, each test position is considered
+independently:
+
+ - The coordinates of the test position are mapped to the grid coordinate
+   system.  The coordinates here are fractional and may lay outside the grid
+   for non-periodic dimensions.
+ - The bounding box of the cutoff sphere centered at the mapped coordinates is
+   determined, and each grid cell that intersects with this box is used for
+   searching the reference positions.  So the searched grid cells may vary
+   depending on the coordinates of the test position, even if the test position
+   is within the same cell.
+ - Possible triclinic shifts in the grid are considered when looping over the
+   cells in the cutoff box if the coordinates wrap around a periodic dimension.
+   This is done by shifting the search range in the other dimensions when the Z
+   or Y dimension loop crosses the boundary.
diff --git a/src/gromacs/selection/nbsearch.cpp b/src/gromacs/selection/nbsearch.cpp

index 8d808900fc69c9eb6dee895855ec218fd8f12e89..768be462d87beaa763113edb6043f3e43ebb38dc 100644 (file)
--- a/src/gromacs/selection/nbsearch.cpp
+++ b/src/gromacs/selection/nbsearch.cpp
@@ -36,6 +36,8 @@
   * \brief
   * Implements neighborhood searching for analysis (from nbsearch.h).
   *
+ * High-level overview of the algorithm is at \ref page_analysisnbsearch.
+ *
   * \todo
   * The grid implementation could still be optimized in several different ways:
   *   - Pruning grid cells from the search list if they are completely outside
@@ -567,6 +569,9 @@ bool AnalysisNeighborhoodSearchImpl::initGridCells(
          else
          {
              cellCount = std::max(1, static_cast<int>(box[dd][dd] / targetsize));
+            // TODO: If the cell count is one or two, it would be better to
+            // just fall back to bSingleCell[dd] = true, and leave the rest to
+            // the efficiency check later.
              if (bGridPBC_[dd] && cellCount < 3)
              {
                  return false;
diff --git a/src/gromacs/selection/nbsearch.h b/src/gromacs/selection/nbsearch.h

index d5ae8d41c417f126324520bb99a70ae683d903dc..062d9b025833f94ab30b415013077f2f4ce28edf 100644 (file)
--- a/src/gromacs/selection/nbsearch.h
+++ b/src/gromacs/selection/nbsearch.h
@@ -36,11 +36,10 @@
   * \brief API for neighborhood searching for analysis.
   *
   * The main part of the API is the class gmx::AnalysisNeighborhood.
- * See the class documentation for usage.
+ * See \ref page_analysisnbsearch for an overview.
   *
   * The classes within this file can be used independently of the other parts
- * of the library.
- * The library also uses the classes internally.
+ * of the selection module.
   *
   * \author Teemu Murtola <teemu.murtola@gmail.com>
   * \inpublicapi
@@ -196,11 +195,7 @@ class AnalysisNeighborhoodPositions
  /*! \brief
   * Neighborhood searching for analysis tools.
   *
- * This class implements neighborhood searching routines for analysis tools.
- * The emphasis is in flexibility and ease of use; one main driver is to have
- * a common implementation of grid-based searching to avoid replicating this in
- * multiple tools (and to make more tools take advantage of the significant
- * performance improvement this allows).
+ * See \ref page_analysisnbsearch for an overview.
   *
   * To use the search, create an object of this type, call setCutoff() to
   * initialize it, and then repeatedly call initSearch() to start a search with
author	Teemu Murtola <teemu.murtola@gmail.com>
	Mon, 25 May 2015 05:49:16 +0000 (08:49 +0300)
committer	Teemu Murtola <teemu.murtola@gmail.com>
	Mon, 25 May 2015 18:23:36 +0000 (21:23 +0300)
docs/doxygen/user/analysisframework.md		patch \| blob \| history
docs/doxygen/user/analysisnbsearch.md	[new file with mode: 0644]	patch \| blob
src/gromacs/selection/nbsearch.cpp		patch \| blob \| history
src/gromacs/selection/nbsearch.h		patch \| blob \| history