Merge release-5-0 into master

author Mark Abraham <mark.j.abraham@gmail.com>

Thu, 25 Jun 2015 08:47:53 +0000 (10:47 +0200)

committer Mark Abraham <mark.j.abraham@gmail.com>

Thu, 25 Jun 2015 09:25:13 +0000 (11:25 +0200)
author Mark Abraham <mark.j.abraham@gmail.com>
Thu, 25 Jun 2015 08:47:53 +0000 (10:47 +0200)
committer Mark Abraham <mark.j.abraham@gmail.com>
Thu, 25 Jun 2015 09:25:13 +0000 (11:25 +0200)
diff --cc CMakeLists.txt

index bb66472c6bc8ffd3a5541e262585defedbf0a457,4f4088153535b8cb58c9400ed3bb08ffffcda0f0..3adae520c3627065717fc87fff8d73e6715100ed
--- 1/CMakeLists.txt
--- 2/CMakeLists.txt
+++ b/CMakeLists.txt
@@@ -57,9 -105,11 +57,12 @@@ include(gmxVersionInfo
   if(CMAKE_INSTALL_PREFIX_INITIALIZED_TO_DEFAULT AND UNIX)
       set(CMAKE_INSTALL_PREFIX "/usr/local/gromacs" CACHE STRING "Installation prefix (installation will need write permissions here)" FORCE)
   endif()
+ if("${CMAKE_INSTALL_PREFIX}" STREQUAL "${CMAKE_BINARY_DIR}")
+     message(FATAL_ERROR "GROMACS cannot be installed into the build tree, choose a different location for CMAKE_INSTALL_PREFIX")
+ endif()
   
   include(gmxBuildTypeReference)
+ +include(gmxBuildTypeProfile)
   include(gmxBuildTypeTSAN)
   include(gmxBuildTypeASAN)
   include(gmxBuildTypeReleaseWithAssert)
diff --cc docs/install-guide/index.rst

index e2fbbf48cf08a8290b98b69cf97682a968b01e27,0000000000000000000000000000000000000000..90412d52000da9de0025e68a31188dfedb6bd0ee

mode 100644,000000..100644
--- 1/docs/install-guide/index.rst
--- /dev/null
+++ b/docs/install-guide/index.rst
@@@ -1,1102 -1,0 +1,1118 @@@
- * ``-DCMAKE_INSTALL_PREFIX=xxx`` to install |Gromacs| to a non-standard location (default ``/usr/local/gromacs``)
+ +.. _install guide:
+ +
+ +******************
+ +Installation guide
+ +******************
+ +
+ +.. highlight:: bash
+ +
+ +Introduction to building |Gromacs|
+ +==================================
+ +
+ +These instructions pertain to building |Gromacs|
+ +|version|. You might also want to check the `up-to-date installation instructions`_.
+ +
+ +Quick and dirty installation
+ +----------------------------
+ +1. Get the latest version of your C and C++ compilers.
+ +2. Check that you have CMake version |GMX_CMAKE_MINIMUM_REQUIRED_VERSION| or later.
+ +3. Get and unpack the latest version of the |Gromacs| tarball.
+ +4. Make a separate build directory and change to it. 
+ +5. Run ``cmake`` with the path to the source as an argument
+ +6. Run ``make``, ``make check``, and ``make install``
+ +
+ +Or, as a sequence of commands to execute:
+ +
+ +.. parsed-literal::
+ +
+ +    tar xfz gromacs-|version|.tar.gz
+ +    cd gromacs-|version|
+ +    mkdir build
+ +    cd build
+ +    cmake .. -DGMX_BUILD_OWN_FFTW=ON -DREGRESSIONTEST_DOWNLOAD=ON
+ +    make
+ +    make check
+ +    sudo make install
+ +    source /usr/local/gromacs/bin/GMXRC
+ +
+ +This will download and build first the prerequisite FFT library
+ +followed by |Gromacs|. If you already have FFTW installed, you can
+ +remove that argument to ``cmake``. Overall, this build of |Gromacs| will
+ +be correct and reasonably fast on the machine upon which ``cmake``
+ +ran. If you want to get the maximum value for your hardware with
+ +|Gromacs|, you will have to read further. Sadly, the interactions of
+ +hardware, libraries, and compilers are only going to continue to get
+ +more complex.
+ +
+ +Typical installation
+ +--------------------
+ +As above, and with further details below, but you should consider
+ +using the following `CMake options`_ with the
+ +appropriate value instead of ``xxx`` :
+ +
+ +* ``-DCMAKE_C_COMPILER=xxx`` equal to the name of the C99 `Compiler`_ you wish to use (or the environment variable ``CC``)
+ +* ``-DCMAKE_CXX_COMPILER=xxx`` equal to the name of the C++98 `compiler`_ you wish to use (or the environment variable ``CXX``)
+ +* ``-DGMX_MPI=on`` to build using `MPI support`_
+ +* ``-DGMX_GPU=on`` to build using nvcc to run with an NVIDIA `native GPU acceleration`_
+ +* ``-DGMX_SIMD=xxx`` to specify the level of `SIMD support`_ of the node on which mdrun will run
+ +* ``-DGMX_BUILD_MDRUN_ONLY=on`` for `building only mdrun`_, e.g. for compute cluster back-end nodes
+ +* ``-DGMX_DOUBLE=on`` to run |Gromacs| in double precision (slower, and not normally useful)
+ +* ``-DCMAKE_PREFIX_PATH=xxx`` to add a non-standard location for CMake to `search for libraries, headers or programs`_
++* ``-DCMAKE_INSTALL_PREFIX=xxx`` to install |Gromacs| to a `non-standard location`_ (default ``/usr/local/gromacs``)
+ +* ``-DBUILD_SHARED_LIBS=off`` to turn off the building of shared libraries to help with `static linking`_
+ +* ``-DGMX_FFT_LIBRARY=xxx`` to select whether to use ``fftw``, ``mkl`` or ``fftpack`` libraries for `FFT support`_
+ +* ``-DCMAKE_BUILD_TYPE=Debug`` to build |Gromacs| in debug mode
+ +
+ +Building older versions
+ +-----------------------
+ +For installation instructions for old |Gromacs| versions, see the
+ +documentation for installing
+ +`GROMACS 4.5 <http://www.gromacs.org/Documentation/Installation_Instructions_4.5>`_,
+ +`GROMACS 4.6 <http://www.gromacs.org/Documentation/Installation_Instructions_4.6>`_,
+ +and
+ +`GROMACS 5.0 <http://www.gromacs.org/Documentation/Installation_Instructions_5.0>`_.
+ +
+ +Prerequisites
+ +=============
+ +Platform
+ +--------
+ +|Gromacs| can be compiled for many operating systems and architectures.
+ +These include any distribution of Linux, Mac OS X or Windows, and
+ +architectures including x86, AMD64/x86-64, PPC, ARM v7 and SPARC VIII.
+ +
+ +Compiler
+ +--------
+ +Technically, |Gromacs| can be compiled on any platform with an ANSI C99
+ +and C++98 compiler, and their respective standard C/C++ libraries.
+ +We use only a few C99 features, but note that the C++ compiler also needs to
+ +support these C99 features (notably, int64_t and related things), which are not
+ +part of the C++98 standard.
+ +Getting good performance on an OS and architecture requires choosing a
+ +good compiler. In practice, many compilers struggle to do a good job
+ +optimizing the |Gromacs| architecture-optimized SIMD kernels.
+ +
+ +For best performance, the |Gromacs| team strongly recommends you get the
+ +most recent version of your preferred compiler for your platform.
+ +There is a large amount of |Gromacs| code that depends on effective
+ +compiler optimization to get high performance. This makes |Gromacs|
+ +performance sensitive to the compiler used, and the binary will often
+ +only work on the hardware for which it is compiled.
+ +
+ +* In particular, |Gromacs| includes a lot of explicit SIMD (single
+ +  instruction, multiple data) optimization that can use assembly
+ +  instructions available on most modern processors. This can have a
+ +  substantial effect on performance, but for recent processors you
+ +  also need a similarly recent compiler that includes support for the
+ +  corresponding SIMD instruction set to get this benefit. The
+ +  configuration does a good job at detecting this, and you will
+ +  usually get warnings if |Gromacs| and your hardware support a more
+ +  recent instruction set than your compiler.
+ +
+ +* On Intel-based x86 hardware, we recommend you to use the GNU
+ +  compilers version 4.7 or later or Intel compilers version 12 or
+ +  later for best performance. The Intel compiler has historically been
+ +  better at instruction scheduling, but recent gcc versions have
+ +  proved to be as fast or sometimes faster than Intel.
+ +
+ +* The Intel and GNU compilers produce much faster |Gromacs| executables
+ +  than the PGI and Cray compilers.
+ +
+ +* On AMD-based x86 hardware up through the "K10" microarchitecture
+ +  ("Family 10h") Thuban/Magny-Cours architecture (e.g. Opteron
+ +  6100-series processors), it is worth using the Intel compiler for
+ +  better performance, but gcc version 4.7 and later are also
+ +  reasonable.
+ +
+ +* On the AMD Bulldozer architecture (Opteron 6200), AMD introduced
+ +  fused multiply-add instructions and an "FMA4" instruction format not
+ +  available on Intel x86 processors. Thus, on the most recent AMD
+ +  processors you want to use gcc version 4.7 or later for best
+ +  performance! The Intel compiler will only generate code for the
+ +  subset also supported by Intel processors, and that is significantly
+ +  slower.
+ +
+ +* If you are running on Mac OS X, the best option is the Intel
+ +  compiler. Both clang and gcc will work, but they produce lower
+ +  performance and each have some shortcomings. Current Clang does not
+ +  support OpenMP. This may change when clang 3.5 becomes available.
+ +
+ +* For all non-x86 platforms, your best option is typically to use the
+ +  vendor's default or recommended compiler, and check for specialized
+ +  information below.
+ +
+ +Compiling with parallelization options
+ +--------------------------------------
+ +|Gromacs| can run in parallel on multiple cores of a single
+ +workstation using its built-in thread-MPI. No user action is required
+ +in order to enable this.
+ +
+ +GPU support
+ +^^^^^^^^^^^
+ +If you wish to use the excellent native GPU support in |Gromacs|,
+ +NVIDIA's CUDA_ version |REQUIRED_CUDA_VERSION| software development kit is required,
+ +and the latest version is strongly encouraged. NVIDIA GPUs with at
+ +least NVIDIA compute capability |REQUIRED_CUDA_COMPUTE_CAPABILITY| are
+ +required, e.g. Fermi or Kepler cards. You are strongly recommended to
+ +get the latest CUDA version and driver supported by your hardware, but
+ +beware of possible performance regressions in newer CUDA versions on
+ +older hardware. Note that while some CUDA compilers (nvcc) might not
+ +officially support recent versions of gcc as the back-end compiler, we
+ +still recommend that you at least use a gcc version recent enough to
+ +get the best SIMD support for your CPU, since |Gromacs| always runs some
+ +code on the CPU. It is most reliable to use the same C++ compiler
+ +version for |Gromacs| code as used as the back-end compiler for nvcc,
+ +but it could be faster to mix compiler versions to suit particular
+ +contexts.
+ +
+ +.. _mpi-support:
+ +
+ +MPI support
+ +^^^^^^^^^^^
+ +If you wish to run in parallel on multiple machines across a network,
+ +you will need to have
+ +
+ +* an MPI library installed that supports the MPI 1.3
+ +  standard, and
+ +* wrapper compilers that will compile code using that library.
+ +
+ +The |Gromacs| team recommends OpenMPI_ version
+ +1.6 (or higher), MPICH_ version 1.4.1 (or
+ +higher), or your hardware vendor's MPI installation. The most recent
+ +version of either of these is likely to be the best. More specialized
+ +networks might depend on accelerations only available in the vendor's
+ +library. LAMMPI_ might work, but since it has
+ +been deprecated for years, it is not supported.
+ +
+ +Often OpenMP_ parallelism is an
+ +advantage for |Gromacs|, but support for this is generally built into
+ +your compiler and detected automatically.
+ +
+ +In summary, for maximum performance you will need to examine how you
+ +will use |Gromacs|, what hardware you plan to run on, and whether you
+ +can afford a non-free compiler for slightly better
+ +performance. Unfortunately, the only way to find out is to test
+ +different options and parallelization schemes for the actual
+ +simulations you want to run. You will still get *good*,
+ +performance with the default build and runtime options, but if you
+ +truly want to push your hardware to the performance limit, the days of
+ +just blindly starting programs with ``mdrun`` are gone.
+ +
+ +CMake
+ +-----
+ +|Gromacs| uses the CMake build system, and requires
+ +version |GMX_CMAKE_MINIMUM_REQUIRED_VERSION| or higher. Lower versions
+ +will not work. You can check whether CMake is installed, and what
+ +version it is, with ``cmake --version``. If you need to install CMake,
+ +then first check whether your platform's package management system
+ +provides a suitable version, or visit the `CMake installation page`_
+ +for pre-compiled
+ +binaries, source code and installation instructions. The |Gromacs| team
+ +recommends you install the most recent version of CMake you can.
+ +
+ +.. _FFT support:
+ +
+ +Fast Fourier Transform library
+ +------------------------------
+ +Many simulations in |Gromacs| make extensive use of fast Fourier
+ +transforms, and a software library to perform these is always
+ +required. We recommend FFTW_ (version 3 or higher only) or
+ +Intel MKL_. The choice of
+ +library can be set with ``cmake -DGMX_FFT_LIBRARY=<name>``, where
+ +``<name>`` is one of ``fftw``, ``mkl``, or ``fftpack``. FFTPACK is bundled
+ +with |Gromacs| as a fallback, and is acceptable if mdrun performance is
+ +not a priority.
+ +
+ +Using FFTW
+ +^^^^^^^^^^
+ +FFTW_ is likely to be available for your platform via its package
+ +management system, but there can be compatibility and significant
+ +performance issues associated with these packages. In particular,
+ +|Gromacs| simulations are normally run in "mixed" floating-point
+ +precision, which is suited for the use of single precision in
+ +FFTW. The default FFTW package is normally in double
+ +precision, and good compiler options to use for FFTW when linked to
+ +|Gromacs| may not have been used. Accordingly, the |Gromacs| team
+ +recommends either
+ +
+ +* that you permit the |Gromacs| installation to download and
+ +  build FFTW from source automatically for you (use
+ +  ``cmake -DGMX_BUILD_OWN_FFTW=ON``), or
+ +* that you build FFTW from the source code.
+ +
+ +If you build FFTW from source yourself, get the most recent version
+ +and follow the `FFTW installation guide`_. Note that we have recently
+ +contributed new SIMD optimization for several extra platforms to
+ +FFTW, which will appear in FFTW-3.3.5 (for now it is available in the
+ +FFTW repository on github, or you can find a very unofficial prerelease
+ +version at ftp://ftp.gromacs.org/pub/prerequisite_software ).
+ +Choose the precision for FFTW (i.e. single/float vs. double) to
+ +match whether you will later use mixed or double precision for
+ +|Gromacs|. There is no need to compile FFTW with
+ +threading or MPI support, but it does no harm. On x86 hardware,
+ +compile with *both* ``--enable-sse2`` and ``--enable-avx`` for
+ +FFTW-3.3.4 and earlier. As of FFTW-3.3.5 you should also add
+ +``--enable-avx2``. FFTW will create a fat library with codelets
+ +for all different instruction sets, and pick the fastest supported
+ +one at runtime. On IBM Power8, you definitely want the upcoming
+ +FFTW-3.3.5 and use ``--enable-vsx`` for SIMD support. If you are
+ +using a Cray, there is a special modified (commercial) version of
+ +FFTs using the FFTW interface which might be faster, but we have
+ +not yet tested this extensively.
+ +
+ +Using MKL
+ +^^^^^^^^^
+ +Using MKL_ with the Intel Compilers version 11 or higher is very
+ +simple. Set up your compiler environment correctly, perhaps with a
+ +command like ``source /path/to/compilervars.sh intel64`` (or consult
+ +your local documentation). Then set ``-DGMX_FFT_LIBRARY=mkl`` when you
+ +run cmake. In this case, |Gromacs| will also use MKL for BLAS and LAPACK
+ +(see `linear algebra libraries`_). Generally,
+ +there is no advantage in using MKL with |Gromacs|, and FFTW is often
+ +faster.
+ +
+ +Otherwise, you can get your hands dirty and configure MKL by setting
+ +
+ +::
+ +
+ +    -DGMX_FFT_LIBRARY=mkl
+ +    -DMKL_LIBRARIES="/full/path/to/libone.so;/full/path/to/libtwo.so"
+ +    -DMKL_INCLUDE_DIR="/full/path/to/mkl/include"
+ +
+ +where the full list (and order!) of libraries you require are found in
+ +Intel's MKL documentation for your system.
+ +
+ +Optional build components
+ +-------------------------
+ +* Compiling to run on NVIDIA GPUs requires CUDA_
+ +* An external Boost library can be used to provide better
+ +  implementation support for smart pointers and exception handling,
+ +  but the |Gromacs| source bundles a subset of Boost 1.55.0 as a fallback
+ +* Hardware-optimized BLAS and LAPACK libraries are useful
+ +  for a few of the |Gromacs| utilities focused on normal modes and
+ +  matrix manipulation, but they do not provide any benefits for normal
+ +  simulations. Configuring these are discussed at
+ +  `linear algebra libraries`_.
+ +* The built-in |Gromacs| trajectory viewer ``gmx view`` requires X11 and
+ +  Motif/Lesstif libraries and header files. You may prefer to use
+ +  third-party software for visualization, such as VMD_ or PyMol_.
+ +* An external TNG library for trajectory-file handling can be used,
+ +  but TNG 1.7.3 is bundled in the |Gromacs| source already
+ +* zlib is used by TNG for compressing some kinds of trajectory data
+ +* Running the |Gromacs| test suite requires libxml2
+ +* Building the |Gromacs| documentation requires ImageMagick, pdflatex,
+ +  bibtex, doxygen, python, sphinx and pygments.
+ +* The |Gromacs| utility programs often write data files in formats
+ +  suitable for the Grace plotting tool, but it is straightforward to
+ +  use these files in other plotting programs, too.
+ +
+ +Doing a build of |Gromacs|
+ +==========================
+ +This section will cover a general build of |Gromacs| with CMake_, but it
+ +is not an exhaustive discussion of how to use CMake. There are many
+ +resources available on the web, which we suggest you search for when
+ +you encounter problems not covered here. The material below applies
+ +specifically to builds on Unix-like systems, including Linux, and Mac
+ +OS X. For other platforms, see the specialist instructions below.
+ +
+ +Configuring with CMake
+ +----------------------
+ +CMake will run many tests on your system and do its best to work out
+ +how to build |Gromacs| for you. If your build machine is the same as
+ +your target machine, then you can be sure that the defaults will be
+ +pretty good. The build configuration will for instance attempt to
+ +detect the specific hardware instructions available in your
+ +processor. However, if you want to control aspects of the build, or
+ +you are compiling on a cluster head node for back-end nodes with a
+ +different architecture, there are plenty of things you can set
+ +manually.
+ +
+ +The best way to use CMake to configure |Gromacs| is to do an
+ +"out-of-source" build, by making another directory from which you will
+ +run CMake. This can be outside the source directory, or a subdirectory
+ +of it. It also means you can never corrupt your source code by trying
+ +to build it! So, the only required argument on the CMake command line
+ +is the name of the directory containing the ``CMakeLists.txt`` file of
+ +the code you want to build. For example, download the source tarball
+ +and use
+ +
+ +.. parsed-literal::
+ +
+ +    tar xfz gromacs-|version|.tgz
+ +    cd gromacs-|version|
+ +    mkdir build-gromacs
+ +    cd build-gromacs
+ +    cmake ..
+ +
+ +You will see ``cmake`` report a sequence of results of tests and
+ +detections done by the |Gromacs| build system. These are written to the
+ +``cmake`` cache, kept in ``CMakeCache.txt``. You can edit this file by
+ +hand, but this is not recommended because you could make a mistake.
+ +You should not attempt to move or copy this file to do another build,
+ +because file paths are hard-coded within it. If you mess things up,
+ +just delete this file and start again with ``cmake``.
+ +
+ +If there is a serious problem detected at this stage, then you will see
+ +a fatal error and some suggestions for how to overcome it. If you are
+ +not sure how to deal with that, please start by searching on the web
+ +(most computer problems already have known solutions!) and then
+ +consult the gmx-users mailing list. There are also informational
+ +warnings that you might like to take on board or not. Piping the
+ +output of ``cmake`` through ``less`` or ``tee`` can be
+ +useful, too.
+ +
+ +Once ``cmake`` returns, you can see all the settings that were chosen
+ +and information about them by using e.g. the curses interface
+ +
+ +::
+ +
+ +    ccmake ..
+ +
+ +You can actually use ``ccmake`` (available on most Unix platforms,
+ +if the curses library is supported) directly in the first step, but then
+ +most of the status messages will merely blink in the lower part
+ +of the terminal rather than be written to standard out. Most platforms
+ +including Linux, Windows, and Mac OS X even have native graphical user interfaces for
+ +``cmake``, and it can create project files for almost any build environment
+ +you want (including Visual Studio or Xcode).
+ +Check out `running CMake`_ for
+ +general advice on what you are seeing and how to navigate and change
+ +things. The settings you might normally want to change are already
+ +presented. You may make changes, then re-configure (using ``c``), so that it
+ +gets a chance to make changes that depend on yours and perform more
+ +checking. It may take several configuration passes to reach the desired
+ +configuration, in particular if you need to resolve errors.
+ +
+ +A key thing to consider here is the setting of
+ +``CMAKE_INSTALL_PREFIX``. You will need to be able to write to this
+ +directory in order to install |Gromacs| later.
+ +So if you do not have super-user privileges on your
+ +machine, then you will need to choose a sensible location within your
+ +home directory for your |Gromacs| installation. Even if you do have
+ +super-user privileges, you should use them only for the installation
+ +phase, and never for configuring, building, or running |Gromacs|!
+ +
+ +When you have reached the desired configuration with ``ccmake``, the
+ +build system can be generated by pressing ``g``.  This requires that the previous
+ +configuration pass did not reveal any additional settings (if it did, you need
+ +to configure once more with ``c``).  With ``cmake``, the build system is generated
+ +after each pass that does not produce errors.
+ +
+ +You cannot attempt to change compilers after the initial run of
+ +``cmake``. If you need to change, clean up, and start again.
+ +
++.. _non-standard location:
++
++Where to install GROMACS
++^^^^^^^^^^^^^^^^^^^^^^^^
++
++A key thing to consider is the setting of
++``CMAKE_INSTALL_PREFIX``. You will need to be able to write to this
++directory in order to install GROMACS later, this directory cannot
++be the same as the build tree, and if you change your
++mind later, changing it in the cache triggers a full re-build,
++unfortunately. So if you do not have super-user privileges on your
++machine, then you will need to choose a sensible location within your
++home directory for your GROMACS installation. Even if you do have
++super-user privileges, you should use them only for the installation
++phase, and never for configuring, building, or running GROMACS!
++
+ +.. _cmake options:
+ +
+ +Using CMake command-line options
+ +^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+ +Once you become comfortable with setting and changing options, you may
+ +know in advance how you will configure |Gromacs|. If so, you can speed
+ +things up by invoking ``cmake`` and passing the various options at once
+ +on the command line. This can be done by setting cache variable at the
+ +cmake invocation using the ``-DOPTION=VALUE``; note that some
+ +environment variables are also taken into account, in particular
+ +variables like ``CC``, ``CXX``, ``FCC`` (which may be familiar to autoconf users).
+ +
+ +For example, the following command line
+ +
+ +::
+ +
+ +    cmake .. -DGMX_GPU=ON -DGMX_MPI=ON -DCMAKE_INSTALL_PREFIX=/home/marydoe/programs
+ +
+ +can be used to build with GPUs, MPI and install in a custom
+ +location. You can even save that in a shell script to make it even
+ +easier next time. You can also do this kind of thing with ``ccmake``,
+ +but you should avoid this, because the options set with ``-D`` will not
+ +be able to be changed interactively in that run of ``ccmake``.
+ +
+ +SIMD support
+ +^^^^^^^^^^^^
+ +|Gromacs| has extensive support for detecting and using the SIMD
+ +capabilities of many modern HPC CPU architectures. If you are building
+ +|Gromacs| on the same hardware you will run it on, then you don't need
+ +to read more about this, unless you are getting configuration warnings
+ +you do not understand. By default, the |Gromacs| build system will
+ +detect the SIMD instruction set supported by the CPU architecture (on
+ +which the configuring is done), and thus pick the best
+ +available SIMD parallelization supported by |Gromacs|. The build system
+ +will also check that the compiler and linker used also support the
+ +selected SIMD instruction set and issue a fatal error if they
+ +do not.
+ +
+ +Valid values are listed below, and the applicable value with the
+ +highest number in the list is generally the one you should choose:
+ +
+ +1. ``None`` For use only on an architecture either lacking SIMD,
+ +   or to which |Gromacs| has not yet been ported and none of the
+ +   options below are applicable.
+ +2. ``SSE2`` This SIMD instruction set was introduced in Intel
+ +   processors in 2001, and AMD in 2003. Essentially all x86
+ +   machines in existence have this, so it might be a good choice if
+ +   you need to support dinosaur x86 computers too.
+ +3. ``SSE4.1`` Present in all Intel core processors since 2007,
+ +   but notably not in AMD magny-cours. Still, almost all recent
+ +   processors support this, so this can also be considered a good
+ +   baseline if you are content with portability between reasonably
+ +   modern processors.
+ +4. ``AVX_128_FMA`` AMD bulldozer processors (2011) have this.
+ +   Unfortunately Intel and AMD have diverged the last few years;
+ +   If you want good performance on modern AMD processors
+ +   you have to use this since it also allows the reset of the
+ +   code to use AMD 4-way fused multiply-add instructions. The drawback
+ +   is that your code will not run on Intel processors at all.
+ +5. ``AVX_256`` This instruction set is present on Intel processors
+ +   since Sandy Bridge (2011), where it is the best choice unless
+ +   you have an even more recent CPU that supports AVX2. While this
+ +   code will work on recent AMD processors, it is significantly
+ +   less efficient than the ``AVX_128_FMA`` choice above - do not be
+ +   fooled to assume that 256 is better than 128 in this case.
+ +6. ``AVX2_256`` Present on Intel Haswell processors released in 2013,
+ +   and it will also enable Intel 3-way fused multiply-add instructions.
+ +   This code will not work on AMD CPUs.
+ +7. ``IBM_QPX`` BlueGene/Q A2 cores have this.
+ +8. ``Sparc64_HPC_ACE`` Fujitsu machines like the K computer have this.
+ +
+ +The CMake configure system will check that the compiler you have
+ +chosen can target the architecture you have chosen. mdrun will check
+ +further at runtime, so if in doubt, choose the lowest setting you
+ +think might work, and see what mdrun says. The configure system also
+ +works around many known issues in many versions of common HPC
+ +compilers. However, since the options also enable general compiler
+ +flags for the platform in question, you can end up in situations
+ +where e.g. an ``AVX_128_FMA`` binary will just crash on any
+ +Intel machine, since the code will try to execute general illegal
+ +instructions (inserted by the compiler) before mdrun gets to the
+ +architecture detection routines.
+ +
+ +A further ``GMX_SIMD=Reference`` option exists, which is a special
+ +SIMD-like implementation written in plain C that developers can use
+ +when developing support in |Gromacs| for new SIMD architectures. It is
+ +not designed for use in production simulations, but if you are using
+ +an architecture with SIMD support to which |Gromacs| has not yet been
+ +ported, you may wish to try this option instead of the default
+ +``GMX_SIMD=None``, as it can often out-perform this when the
+ +auto-vectorization in your compiler does a good job. And post on the
+ +|Gromacs| mailing lists, because |Gromacs| can probably be ported for new
+ +SIMD architectures in a few days.
+ +
+ +CMake advanced options
+ +^^^^^^^^^^^^^^^^^^^^^^
+ +The options that are displayed in the default view of ``ccmake`` are
+ +ones that we think a reasonable number of users might want to consider
+ +changing. There are a lot more options available, which you can see by
+ +toggling the advanced mode in ``ccmake`` on and off with ``t``. Even
+ +there, most of the variables that you might want to change have a
+ +``CMAKE_`` or ``GMX_`` prefix. There are also some options that will be
+ +visible or not according to whether their preconditions are satisfied.
+ +
+ +.. _search for libraries, headers or programs:
+ +
+ +Helping CMake find the right libraries, headers, or programs
+ +^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+ +If libraries are installed in non-default locations their location can
+ +be specified using the following environment variables:
+ +
+ +* ``CMAKE_INCLUDE_PATH`` for header files
+ +* ``CMAKE_LIBRARY_PATH`` for libraries
+ +* ``CMAKE_PREFIX_PATH`` for header, libraries and binaries
+ +  (e.g. ``/usr/local``).
+ +
+ +The respective ``include``, ``lib``, or ``bin`` is
+ +appended to the path. For each of these variables, a list of paths can
+ +be specified (on Unix, separated with ":"). Note that these are
+ +enviroment variables (and not ``cmake`` command-line arguments) and in
+ +a ``bash`` shell are used like:
+ +
+ +::
+ +
+ +    CMAKE_PREFIX_PATH=/opt/fftw:/opt/cuda cmake ..
+ +
+ +Alternatively, these variables are also ``cmake`` options, so they can
+ +be set like ``-DCMAKE_PREFIX_PATH=/opt/fftw:/opt/cuda``.
+ +
+ +The ``CC`` and ``CXX`` environment variables are also useful
+ +for indicating to ``cmake`` which compilers to use, which can be very
+ +important for maximising |Gromacs| performance. Similarly,
+ +``CFLAGS``/``CXXFLAGS`` can be used to pass compiler
+ +options, but note that these will be appended to those set by
+ +|Gromacs| for your build platform and build type. You can customize
+ +some of this with advanced options such as ``CMAKE_C_FLAGS``
+ +and its relatives.
+ +
+ +See also the page on `CMake environment variables`_.
+ +
+ +Native GPU acceleration
+ +^^^^^^^^^^^^^^^^^^^^^^^
+ +If you have the CUDA_ Toolkit installed, you can use ``cmake`` with:
+ +
+ +::
+ +
+ +    cmake .. -DGMX_GPU=ON -DCUDA_TOOLKIT_ROOT_DIR=/usr/local/cuda
+ +
+ +(or whichever path has your installation). In some cases, you might
+ +need to specify manually which of your C++ compilers should be used,
+ +e.g. with the advanced option ``CUDA_HOST_COMPILER``.
+ +
+ +By default, optimized code will be generated for CUDA architectures
+ +supported by the nvcc compiler (and the |Gromacs| build system). 
+ +However, it can be beneficial to manually pick the specific CUDA architecture(s)
+ +to generate code for either to reduce compilation time (and binary size) or to
+ +target a new architecture not yet supported by the |GROMACS| build system.
+ +Setting the desired CUDA architecture(s) and virtual architecture(s)
+ +can be done using the ``GMX_CUDA_TARGET_SM`` and ``GMX_CUDA_TARGET_COMPUTE``
+ +variables, respectively. These take a semicolon delimited string with 
+ +the two digit suffixes of CUDA (virtual) architectures names
+ +(for details see the "Options for steering GPU code generation" section of the
+ +nvcc man / help or Chapter 6. of the nvcc manual).
+ +
+ +The GPU acceleration has been tested on AMD64/x86-64 platforms with
+ +Linux, Mac OS X and Windows operating systems, but Linux is the
+ +best-tested and supported of these. Linux running on ARM v7 (32 bit)
+ +CPUs also works.
+ +
+ +Static linking
+ +^^^^^^^^^^^^^^
+ +Dynamic linking of the |Gromacs| executables will lead to a
+ +smaller disk footprint when installed, and so is the default on
+ +platforms where we believe it has been tested repeatedly and found to work.
+ +In general, this includes Linux, Windows, Mac OS X and BSD systems.
+ +Static binaries take much more space, but on some hardware and/or under
+ +some conditions they are necessary, most commonly when you are running a parallel
+ +simulation using MPI libraries (e.g. BlueGene, Cray).
+ +
+ +* To link |Gromacs| binaries statically against the internal |Gromacs|
+ +  libraries, set ``-DBUILD_SHARED_LIBS=OFF``.
+ +* To link statically against external (non-system) libraries as well,
+ +  the ``-DGMX_PREFER_STATIC_LIBS=ON`` option can be used. Note, that in
+ +  general ``cmake`` picks up whatever is available, so this option only
+ +  instructs ``cmake`` to prefer static libraries when both static and
+ +  shared are available. If no static version of an external library is
+ +  available, even when the aforementioned option is ``ON``, the shared
+ +  library will be used. Also note, that the resulting binaries will
+ +  still be dynamically linked against system libraries on platforms
+ +  where that is the default. To use static system libraries,
+ +  additional compiler/linker flags are necessary, e.g. ``-static-libgcc
+ +  -static-libstdc++``.
+ +* To attempt to link a fully static binary set
+ +  ``-DGMX_BUILD_SHARED_EXE=OFF``. This will prevent CMake from explicitly
+ +  setting any dynamic linking flags. This option also sets
+ +  ``-DBUILD_SHARED_LIBS=OFF`` and ``-DGMX_PREFER_STATIC_LIBS=ON`` by
+ +  default, but the above caveats apply. For compilers which don't
+ +  default to static linking, the required flags have to be specified. On
+ +  Linux, this is usually ``CFLAGS=-static CXXFLAGS=-static``.
+ +
+ +Portability aspects
+ +^^^^^^^^^^^^^^^^^^^
+ +Here, we consider portability aspects related to CPU instruction sets,
+ +for details on other topics like binaries with statical vs dynamic
+ +linking please consult the relevant parts of this documentation or
+ +other non-|Gromacs| specific resources.
+ +
+ +A |Gromacs| build will normally not be portable, not even across
+ +hardware with the same base instruction set like x86. Non-portable
+ +hardware-specific optimizations are selected at configure-time, such
+ +as the SIMD instruction set used in the compute-kernels. This
+ +selection will be done by the build system based on the capabilities
+ +of the build host machine or based on cross-compilation information
+ +provided to ``cmake`` at configuration.
+ +
+ +Often it is possible to ensure portability by choosing the least
+ +common denominator of SIMD support, e.g. SSE2 for x86, and ensuring
+ +the you use ``cmake -DGMX_USE_RDTSCP=off`` if any of the target CPU
+ +architectures does not support the ``RDTSCP`` instruction.  However, we
+ +discourage attempts to use a single |Gromacs| installation when the
+ +execution environment is heterogeneous, such as a mix of AVX and
+ +earlier hardware, because this will lead to programs (especially
+ +mdrun) that run slowly on the new hardware. Building two full
+ +installations and locally managing how to call the correct one
+ +(e.g. using the module system) is the recommended
+ +approach. Alternatively, as at the moment the |Gromacs| tools do not
+ +make strong use of SIMD acceleration, it can be convenient to create
+ +an installation with tools portable across different x86 machines, but
+ +with separate mdrun binaries for each architecture. To achieve this,
+ +one can first build a full installation with the
+ +least-common-denominator SIMD instruction set, e.g. ``-DGMX_SIMD=SSE2``,
+ +then build separate mdrun binaries for each architecture present in
+ +the heterogeneous environment. By using custom binary and library
+ +suffixes for the mdrun-only builds, these can be installed to the
+ +same location as the "generic" tools installation.
+ +`Building just the mdrun binary`_ is possible by setting the
+ +``-DGMX_BUILD_MDRUN_ONLY=ON`` option.
+ +
+ +Linear algebra libraries
+ +^^^^^^^^^^^^^^^^^^^^^^^^
+ +As mentioned above, sometimes vendor BLAS and LAPACK libraries
+ +can provide performance enhancements for |Gromacs| when doing
+ +normal-mode analysis or covariance analysis. For simplicity, the text
+ +below will refer only to BLAS, but the same options are available
+ +for LAPACK. By default, CMake will search for BLAS, use it if it
+ +is found, and otherwise fall back on a version of BLAS internal to
+ +|Gromacs|. The ``cmake`` option ``-DGMX_EXTERNAL_BLAS=on`` will be set
+ +accordingly. The internal versions are fine for normal use. If you
+ +need to specify a non-standard path to search, use
+ +``-DCMAKE_PREFIX_PATH=/path/to/search``. If you need to specify a
+ +library with a non-standard name (e.g. ESSL on AIX or BlueGene), then
+ +set ``-DGMX_BLAS_USER=/path/to/reach/lib/libwhatever.a``.
+ +
+ +If you are using Intel MKL_ for FFT, then the BLAS and
+ +LAPACK it provides are used automatically. This could be
+ +over-ridden with ``GMX_BLAS_USER``, etc.
+ +
+ +On Apple platforms where the Accelerate Framework is available, these
+ +will be automatically used for BLAS and LAPACK. This could be
+ +over-ridden with ``GMX_BLAS_USER``, etc.
+ +
+ +Changing the names of |Gromacs| binaries and libraries
+ +^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+ +It is sometimes convenient to have different versions of the same
+ +|Gromacs| programs installed. The most common use cases have been single
+ +and double precision, and with and without MPI. This mechanism can
+ +also be used to install side-by-side multiple versions of mdrun
+ +optimized for different CPU architectures, as mentioned previously.
+ +
+ +By default, |Gromacs| will suffix programs and libraries for such builds
+ +with ``_d`` for double precision and/or ``_mpi`` for MPI (and nothing
+ +otherwise). This can be controlled manually with ``GMX_DEFAULT_SUFFIX
+ +(ON/OFF)``, ``GMX_BINARY_SUFFIX`` (takes a string) and ``GMX_LIBS_SUFFIX``
+ +(also takes a string). For instance, to set a custom suffix for
+ +programs and libraries, one might specify:
+ +
+ +::
+ +
+ +    cmake .. -DGMX_DEFAULT_SUFFIX=OFF -DGMX_BINARY_SUFFIX=_mod -DGMX_LIBS_SUFFIX=_mod
+ +
+ +Thus the names of all programs and libraries will be appended with
+ +``_mod``.
+ +
+ +Changing installation tree structure
+ +^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
+ +By default, a few different directories under ``CMAKE_INSTALL_PREFIX`` are used
+ +when when |Gromacs| is installed. Some of these can be changed, which is mainly
+ +useful for packaging |Gromacs| for various distributions. The directories are
+ +listed below, with additional notes about some of them. Unless otherwise noted,
+ +the directories can be renamed by editing the installation paths in the main
+ +CMakeLists.txt.
+ +
+ +``bin/``
+ +    The standard location for executables and some scripts.
+ +    Some of the scripts hardcode the absolute installation prefix, which needs
+ +    to be changed if the scripts are relocated.
+ +``include/gromacs/``
+ +    The standard location for installed headers.
+ +``lib/``
+ +    The standard location for libraries. The default depends on the system, and
+ +    is determined by CMake.
+ +    The name of the directory can be changed using ``GMX_LIB_INSTALL_DIR`` CMake
+ +    variable.
+ +``lib/pkgconfig/``
+ +    Information about the installed ``libgromacs`` library for ``pkg-config`` is
+ +    installed here.  The ``lib/`` part adapts to the installation location of the
+ +    libraries.  The installed files contain the installation prefix as absolute
+ +    paths.
+ +``share/cmake/``
+ +    CMake package configuration files are installed here.
+ +``share/gromacs/``
+ +    Various data files and some documentation go here.
+ +    The ``gromacs`` part can be changed using ``GMX_DATA_INSTALL_DIR``. Using this
+ +    CMake variable is the preferred way of changing the installation path for
+ +    ``share/gromacs/top/``, since the path to this directory is built into
+ +    ``libgromacs`` as well as some scripts, both as a relative and as an absolute
+ +    path (the latter as a fallback if everything else fails).
+ +``share/man/``
+ +    Installed man pages go here.
+ +
+ +Compiling and linking
+ +---------------------
+ +Once you have configured with ``cmake``, you can build |Gromacs| with ``make``.
+ +It is expected that this will always complete successfully, and
+ +give few or no warnings. The CMake-time tests |Gromacs| makes on the settings
+ +you choose are pretty extensive, but there are probably a few cases we
+ +have not thought of yet. Search the web first for solutions to
+ +problems, but if you need help, ask on gmx-users, being sure to
+ +provide as much information as possible about what you did, the system
+ +you are building on, and what went wrong. This may mean scrolling back
+ +a long way through the output of ``make`` to find the first error
+ +message!
+ +
+ +If you have a multi-core or multi-CPU machine with ``N``
+ +processors, then using
+ +
+ +::
+ +
+ +    make -j N
+ +
+ +will generally speed things up by quite a bit. Other build generator systems
+ +supported by ``cmake`` (e.g. ``ninja``) also work well.
+ +
+ +.. _building just the mdrun binary:
+ +
+ +Building only mdrun
+ +^^^^^^^^^^^^^^^^^^^
+ +Past versions of the build system offered "mdrun" and "install-mdrun"
+ +targets (similarly for other programs too) to build and install only
+ +the mdrun program, respectively. Such a build is useful when the
+ +configuration is only relevant for mdrun (such as with
+ +parallelization options for MPI, SIMD, GPUs, or on BlueGene or Cray),
+ +or the length of time for the compile-link-install cycle is relevant
+ +when developing.
+ +
+ +This is now supported with the ``cmake`` option
+ +``-DGMX_BUILD_MDRUN_ONLY=ON``, which will build a cut-down version of
+ +``libgromacs`` and/or the mdrun program (according to whether shared
+ +or static). Naturally, now ``make install`` installs only those
+ +products. By default, mdrun-only builds will default to static linking
+ +against |Gromacs| libraries, because this is generally a good idea for
+ +the targets for which an mdrun-only build is desirable. If you re-use
+ +a build tree and change to the mdrun-only build, then you will inherit
+ +the setting for ``BUILD_SHARED_LIBS`` from the old build, and will be
+ +warned that you may wish to manage ``BUILD_SHARED_LIBS`` yourself.
+ +
+ +Installing |Gromacs|
+ +--------------------
+ +Finally, ``make install`` will install |Gromacs| in the
+ +directory given in ``CMAKE_INSTALL_PREFIX``. If this is a system
+ +directory, then you will need permission to write there, and you
+ +should use super-user privileges only for ``make install`` and
+ +not the whole procedure.
+ +
+ +.. _getting access to GROMACS:
+ +
+ +Getting access to |Gromacs| after installation
+ +----------------------------------------------
+ +|Gromacs| installs the script ``GMXRC`` in the ``bin``
+ +subdirectory of the installation directory
+ +(e.g. ``/usr/local/gromacs/bin/GMXRC``), which you should source
+ +from your shell:
+ +
+ +::
+ +
+ +    source /your/installation/prefix/here/bin/GMXRC
+ +
+ +It will detect what kind of shell you are running and set up your
+ +environment for using |Gromacs|. You may wish to arrange for your
+ +login scripts to do this automatically; please search the web for
+ +instructions on how to do this for your shell. 
+ +
+ +Many of the |Gromacs| programs rely on data installed in the
+ +``share/gromacs`` subdirectory of the installation directory. By
+ +default, the programs will use the environment variables set in the
+ +``GMXRC`` script, and if this is not available they will try to guess the
+ +path based on their own location.  This usually works well unless you
+ +change the names of directories inside the install tree. If you still
+ +need to do that, you might want to recompile with the new install
+ +location properly set, or edit the ``GMXRC`` script.
+ +
+ +Testing |Gromacs| for correctness
+ +---------------------------------
+ +Since 2011, the |Gromacs| development uses an automated system where
+ +every new code change is subject to regression testing on a number of
+ +platforms and software combinations. While this improves
+ +reliability quite a lot, not everything is tested, and since we
+ +increasingly rely on cutting edge compiler features there is
+ +non-negligible risk that the default compiler on your system could
+ +have bugs. We have tried our best to test and refuse to use known bad
+ +versions in ``cmake``, but we strongly recommend that you run through
+ +the tests yourself. It only takes a few minutes, after which you can
+ +trust your build.
+ +
+ +The simplest way to run the checks is to build |Gromacs| with
+ +``-DREGRESSIONTEST_DOWNLOAD``, and run ``make check``.
+ +|Gromacs| will automatically download and run the tests for you.
+ +Alternatively, you can download and unpack the GROMACS
+ +regression test suite |gmx-regressiontests-package| tarball yourself
+ +and use the advanced ``cmake`` option ``REGRESSIONTEST_PATH`` to
+ +specify the path to the unpacked tarball, which will then be used for
+ +testing. If the above does not work, then please read on.
+ +
+ +The regression tests are also available from the download_ section.
+ +Once you have downloaded them, unpack the tarball, source
+ +``GMXRC`` as described above, and run ``./gmxtest.pl all``
+ +inside the regression tests folder. You can find more options
+ +(e.g. adding ``double`` when using double precision, or
+ +``-only expanded`` to run just the tests whose names match
+ +"expanded") if you just execute the script without options.
+ +
+ +Hopefully, you will get a report that all tests have passed. If there
+ +are individual failed tests it could be a sign of a compiler bug, or
+ +that a tolerance is just a tiny bit too tight. Check the output files
+ +the script directs you too, and try a different or newer compiler if
+ +the errors appear to be real. If you cannot get it to pass the
+ +regression tests, you might try dropping a line to the gmx-users
+ +mailing list, but then you should include a detailed description of
+ +your hardware, and the output of ``mdrun -version`` (which contains
+ +valuable diagnostic information in the header).
+ +
+ +A build with ``-DGMX_BUILD_MDRUN_ONLY`` cannot be tested with
+ +``make check`` from the build tree, because most of the tests
+ +require a full build to run things like ``grompp``. To test such an
+ +mdrun fully requires installing it to the same location as a normal
+ +build of |Gromacs|, downloading the regression tests tarball manually
+ +as described above, sourcing the correct ``GMXRC`` and running the
+ +perl script manually. For example, from your |Gromacs| source
+ +directory:
+ +
+ +::
+ +
+ +    mkdir build-normal
+ +    cd build-normal
+ +    cmake .. -DCMAKE_INSTALL_PREFIX=/your/installation/prefix/here
+ +    make -j 4
+ +    make install
+ +    cd ..
+ +    mkdir build-mdrun-only
+ +    cd build-mdrun-only
+ +    cmake .. -DGMX_MPI=ON -DGMX_GPU=ON -DGMX_BUILD_MDRUN_ONLY=ON -DCMAKE_INSTALL_PREFIX=/your/installation/prefix/here
+ +    make -j 4
+ +    make install
+ +    cd /to/your/unpacked/regressiontests
+ +    source /your/installation/prefix/here/bin/GMXRC
+ +    ./gmxtest.pl all -np 2
+ +
+ +If your mdrun program has been suffixed in a non-standard way, then
+ +the ``./gmxtest.pl -mdrun`` option will let you specify that name to the
+ +test machinery. You can use ``./gmxtest.pl -double`` to test the
+ +double-precision version. You can use ``./gmxtest.pl -crosscompiling``
+ +to stop the test harness attempting to check that the programs can
+ +be run. You can use ``./gmxtest.pl -mpirun srun`` if your command to
+ +run an MPI program is called ``srun``.
+ +
+ +The ``make check`` target also runs integration-style tests that may run
+ +with MPI if ``GMX_MPI=ON`` was set. To make these work, you may need to
+ +set the CMake variables ``MPIEXEC``, ``MPIEXEC_NUMPROC_FLAG``, ``NUMPROC``,
+ +``MPIEXEC_PREFLAGS`` and ``MPIEXEC_POSTFLAGS`` so that
+ +``mdrun-mpi-test_mpi`` would run on multiple ranks via the shell command
+ +
+ +    $ ${MPIEXEC} ${MPIEXEC_NUMPROC_FLAG} ${NUMPROC} ${MPIEXEC_PREFLAGS} \
+ +          mdrun-mpi-test_mpi ${MPIEXEC_POSTFLAGS} -otherflags
+ +
+ +Typically, one might use variable values ``mpirun``, ``-np``, ``2``, ``''``,
+ +``''`` respectively, in order to run on two ranks.
+ +
+ +
+ +Testing |Gromacs| for performance
+ +---------------------------------
+ +We are still working on a set of benchmark systems for testing
+ +the performance of |Gromacs|. Until that is ready, we recommend that
+ +you try a few different parallelization options, and experiment with
+ +tools such as ``gmx tune_pme``.
+ +
+ +Having difficulty?
+ +------------------
+ +You are not alone - this can be a complex task! If you encounter a
+ +problem with installing |Gromacs|, then there are a number of
+ +locations where you can find assistance. It is recommended that you
+ +follow these steps to find the solution:
+ +
+ +1. Read the installation instructions again, taking note that you
+ +   have followed each and every step correctly.
+ +
+ +2. Search the |Gromacs| webpage_ and users emailing list for information
+ +   on the error. Adding
+ +   ``site:https://mailman-1.sys.kth.se/pipermail/gromacs.org_gmx-users``
+ +   to a Google search may help filter better results.
+ +
+ +3. Search the internet using a search engine such as Google.
+ +
+ +4. Post to the |Gromacs| users emailing list gmx-users for
+ +   assistance. Be sure to give a full description of what you have
+ +   done and why you think it did not work. Give details about the
+ +   system on which you are installing.  Copy and paste your command
+ +   line and as much of the output as you think might be relevant -
+ +   certainly from the first indication of a problem. In particular,
+ +   please try to include at least the header from the mdrun logfile,
+ +   and preferably the entire file.  People who might volunteer to help
+ +   you do not have time to ask you interactive detailed follow-up
+ +   questions, so you will get an answer faster if you provide as much
+ +   information as you think could possibly help. High quality bug
+ +   reports tend to receive rapid high quality answers.
+ +
+ +Special instructions for some platforms
+ +=======================================
+ +
+ +Building on Windows
+ +-------------------
+ +Building on Windows using native compilers is rather similar to
+ +building on Unix, so please start by reading the above. Then, download
+ +and unpack the |Gromacs| source archive. Make a folder in which to do
+ +the out-of-source build of |Gromacs|. For example, make it within the
+ +folder unpacked from the source archive, and call it ``build-gromacs``.
+ +
+ +For CMake, you can either use the graphical user interface provided on
+ +Windows, or you can use a command line shell with instructions similar
+ +to the UNIX ones above. If you open a shell from within your IDE
+ +(e.g. Microsoft Visual Studio), it will configure the environment for
+ +you, but you might need to tweak this in order to get either a 32-bit
+ +or 64-bit build environment. The latter provides the fastest
+ +executable. If you use a normal Windows command shell, then you will
+ +need to either set up the environment to find your compilers and
+ +libraries yourself, or run the ``vcvarsall.bat`` batch script provided
+ +by MSVC (just like sourcing a bash script under Unix).
+ +
+ +With the graphical user interface, you will be asked about what
+ +compilers to use at the initial configuration stage, and if you use
+ +the command line they can be set in a similar way as under UNIX. You
+ +will probably make your life easier and faster by using the new
+ +facility to download and install FFTW automatically.
+ +
+ +For the build, you can either load the generated solutions file into
+ +e.g. Visual Studio, or use the command line with ``cmake --build`` so
+ +the right tools get used.
+ +
+ +Building on Cray
+ +----------------
+ +|Gromacs| builds mostly out of the box on modern Cray machines, but
+ +
+ +* you may need to specify the use of static binaries
+ +  with ``-DGMX_BUILD_SHARED_EXE=off``,
+ +* you may need to set the F77 environmental variable to ``ftn`` when
+ +  compiling FFTW,
+ +
+ +Building on BlueGene
+ +--------------------
+ +
+ +BlueGene/Q
+ +^^^^^^^^^^
+ +There is currently native acceleration on this platform for the Verlet
+ +cut-off scheme. There are no plans to provide accelerated kernels for
+ +the group cut-off scheme, but the default plain C kernels will work
+ +(slowly).
+ +
+ +Only static linking with XL compilers is supported by |Gromacs|. Dynamic
+ +linking would be supported by the architecture and |Gromacs|, but has no
+ +advantages other than disk space, and is generally discouraged on
+ +BlueGene for performance reasons.
+ +
+ +Computation on BlueGene floating-point units is always done in
+ +double-precision. However, mixed-precision builds of |Gromacs| are still
+ +normal and encouraged since they use cache more efficiently. The
+ +BlueGene hardware automatically converts values stored in single
+ +precision in memory to double precision in registers for computation,
+ +converts the results back to single precision correctly, and does so
+ +for no additional cost. As with other platforms, doing the whole
+ +computation in double precision normally shows no improvement in
+ +accuracy and costs twice as much time moving memory around.
+ +
+ +You need to arrange for FFTW to be installed correctly, following the
+ +above instructions.
+ +
+ +MPI wrapper compilers should be used for compiling and linking. Both
+ +xlc and bgclang are supported back ends - either might prove to be
+ +faster in practice. The MPI wrapper compilers can make it awkward to
+ +attempt to use IBM's optimized BLAS/LAPACK called ESSL (see the
+ +section on `linear algebra libraries`_. Since mdrun is the only part
+ +of |Gromacs| that should normally run on the compute nodes, and there is
+ +nearly no need for linear algebra support for mdrun, it is recommended
+ +to use the |Gromacs| built-in linear algebra routines - this is never
+ +a problem for normal simulations.
+ +
+ +The recommended configuration is to use
+ +
+ +::
+ +
+ +    cmake .. -DCMAKE_C_COMPILER=mpicc \
+ +             -DCMAKE_CXX_COMPILER=mpicxx \
+ +             -DCMAKE_TOOLCHAIN_FILE=Platform/BlueGeneQ-static-XL-CXX.cmake \
+ +             -DCMAKE_PREFIX_PATH=/your/fftw/installation/prefix \
+ +             -DGMX_MPI=ON \
+ +             -DGMX_BUILD_MDRUN_ONLY=ON
+ +    make
+ +    make install
+ +
+ +which will build a statically-linked MPI-enabled mdrun for the compute
+ +nodes. Or use the Platform/BlueGeneQ-static-bgclang-cxx
+ +toolchain file if compiling with bgclang. Otherwise, |Gromacs| default configuration
+ +behaviour applies.
+ +
+ +It is possible to configure and make the remaining |Gromacs| tools with
+ +the compute-node toolchain, but as none of those tools are MPI-aware
+ +and could then only run on the compute nodes, this would not normally
+ +be useful. Instead, these should be planned to run on the login node,
+ +and a separate |Gromacs| installation performed for that using the login
+ +node's toolchain - not the above platform file, or any other
+ +compute-node toolchain.
+ +
+ +Note that only the MPI build is available for the compute-node
+ +toolchains. The |Gromacs| thread-MPI or no-MPI builds are not useful at
+ +all on BlueGene/Q.
+ +
+ +BlueGene/P
+ +^^^^^^^^^^
+ +There is currently no SIMD support on this platform and no plans to
+ +add it. The default plain C kernels will work.
+ +
+ +Fujitsu PRIMEHPC
+ +^^^^^^^^^^^^^^^^
+ +This is the architecture of the K computer, which uses Fujitsu
+ +Sparc64VIIIfx chips. On this platform, |Gromacs| has
+ +accelerated group kernels using the HPC-ACE instructions, no
+ +accelerated Verlet kernels, and a custom build toolchain. Since this
+ +particular chip only does double precision SIMD, the default setup
+ +is to build |Gromacs| in double. Since most users only need single, we have added
+ +an option GMX_RELAXED_DOUBLE_PRECISION to accept single precision square root
+ +accuracy in the group kernels; unless you know that you really need 15 digits
+ +of accuracy in each individual force, we strongly recommend you use this. Note
+ +that all summation and other operations are still done in double.
+ +
+ +The recommended configuration is to use
+ +
+ +::
+ +
+ +    cmake .. -DCMAKE_TOOLCHAIN_FILE=Toolchain-Fujitsu-Sparc64-mpi.cmake \
+ +             -DCMAKE_PREFIX_PATH=/your/fftw/installation/prefix \
+ +             -DCMAKE_INSTALL_PREFIX=/where/gromacs/should/be/installed \
+ +             -DGMX_MPI=ON \
+ +             -DGMX_BUILD_MDRUN_ONLY=ON \
+ +             -DGMX_RELAXED_DOUBLE_PRECISION=ON
+ +    make
+ +    make install
+ +
+ +Intel Xeon Phi
+ +^^^^^^^^^^^^^^
+ +|Gromacs| has preliminary support for Intel Xeon Phi. Only symmetric
+ +(aka native) mode is supported. |Gromacs| is functional on Xeon Phi, but
+ +it has so far not been optimized to the same level as other
+ +architectures have. The performance depends among other factors on the
+ +system size, and for
+ +now the performance might not be faster than CPUs. Building for Xeon
+ +Phi works almost as any other Unix. See the instructions above for
+ +details. The recommended configuration is
+ +
+ +::
+ +
+ +    cmake .. -DCMAKE_TOOLCHAIN_FILE=Platform/XeonPhi
+ +    make
+ +    make install
+ +
+ +Tested platforms
+ +================
+ +While it is our best belief that |Gromacs| will build and run pretty
+ +much everywhere, it is important that we tell you where we really know
+ +it works because we have tested it. We do test on Linux, Windows, and
+ +Mac with a range of compilers and libraries for a range of our
+ +configuration options. Every commit in our git source code repository
+ +is currently tested on x86 with gcc versions ranging from 4.4 through
+ +4.7, and versions 12 and 13 of the Intel compiler as well as Clang
+ +version 3.1 through 3.4. For this, we use a variety of GNU/Linux
+ +flavors and versions as well as recent version of Mac OS X.  Under
+ +Windows we test both MSVC and the Intel compiler. For details, you can
+ +have a look at the `continuous integration server used by GROMACS`_,
+ +which runs Jenkins_.
+ +
+ +We test irregularly on ARM v7, BlueGene/Q, Cray, Fujitsu PRIMEHPC, Google
+ +Native Client and other environments, and with other compilers and
+ +compiler versions, too.
diff --cc docs/manual/analyse.tex
Simple merge
diff --cc docs/user-guide/environment-variables.rst

index 4581dff496399c717668081673097307ce604ed1,0000000000000000000000000000000000000000..ba58e1781a2994ffdce05876ef62157df1630646

mode 100644,000000..100644
--- 1/docs/user-guide/environment-variables.rst
--- /dev/null
+++ b/docs/user-guide/environment-variables.rst
@@@ -1,436 -1,0 +1,432 @@@
- ``GMX_MAX_MPI_THREADS``
-         sets the maximum number of MPI-threads that :ref:`gmx mdrun`
-         can use.
- 
+ +Environment Variables
+ +=====================
+ +
+ +|Gromacs| programs may be influenced by the use of
+ +environment variables.  First of all, the variables set in
+ +the ``GMXRC`` file are essential for running and
+ +compiling |Gromacs|. Some other useful environment variables are
+ +listed in the following sections. Most environment variables function
+ +by being set in your shell to any non-NULL value. Specific
+ +requirements are described below if other values need to be set. You
+ +should consult the documentation for your shell for instructions on
+ +how to set environment variables in the current shell, or in config
+ +files for future shells. Note that requirements for exporting
+ +environment variables to jobs run under batch control systems vary and
+ +you should consult your local documentation for details.
+ +
+ +Output Control
+ +--------------
+ +``GMX_CONSTRAINTVIR``
+ +        Print constraint virial and force virial energy terms.
+ +
+ +``GMX_MAXBACKUP``
+ +        |Gromacs| automatically backs up old
+ +        copies of files when trying to write a new file of the same
+ +        name, and this variable controls the maximum number of
+ +        backups that will be made, default 99. If set to 0 it fails to
+ +        run if any output file already exists. And if set to -1 it
+ +        overwrites any output file without making a backup.
+ +
+ +``GMX_NO_QUOTES``
+ +        if this is explicitly set, no cool quotes
+ +        will be printed at the end of a program.
+ +
+ +``GMX_SUPPRESS_DUMP``
+ +        prevent dumping of step files during
+ +        (for example) blowing up during failure of constraint
+ +        algorithms.
+ +
+ +``GMX_TPI_DUMP``
+ +        dump all configurations to a :ref:`pdb`
+ +        file that have an interaction energy less than the value set
+ +        in this environment variable.
+ +
+ +``GMX_VIEW_XPM``
+ +        ``GMX_VIEW_XVG``, ``GMX_VIEW_EPS`` and ``GMX_VIEW_PDB``, commands used to
+ +        automatically view :ref:`xvg`, :ref:`xpm`, :ref:`eps`
+ +        and :ref:`pdb` file types, respectively; they default to ``xv``, ``xmgrace``,
+ +        ``ghostview`` and ``rasmol``. Set to empty to disable
+ +        automatic viewing of a particular file type. The command will
+ +        be forked off and run in the background at the same priority
+ +        as the |Gromacs| tool (which might not be what you want).
+ +        Be careful not to use a command which blocks the terminal
+ +        (e.g. ``vi``), since multiple instances might be run.
+ +
+ +``GMX_VIRIAL_TEMPERATURE``
+ +        print virial temperature energy term
+ +
+ +``GMX_LOG_BUFFER``
+ +        the size of the buffer for file I/O. When set
+ +        to 0, all file I/O will be unbuffered and therefore very slow.
+ +        This can be handy for debugging purposes, because it ensures
+ +        that all files are always totally up-to-date.
+ +
+ +``GMX_LOGO_COLOR``
+ +        set display color for logo in :ref:`gmx view`.
+ +
+ +``GMX_PRINT_LONGFORMAT``
+ +        use long float format when printing
+ +        decimal values.
+ +
+ +``GMX_COMPELDUMP``
+ +        Applies for computational electrophysiology setups
+ +        only (see reference manual). The initial structure gets dumped to
+ +        :ref:`pdb` file, which allows to check whether multimeric channels have
+ +        the correct PBC representation.
+ +
+ +Debugging
+ +---------
+ +``GMX_PRINT_DEBUG_LINES``
+ +        when set, print debugging info on line numbers.
+ +
+ +``GMX_DD_NST_DUMP``
+ +        number of steps that elapse between dumping
+ +        the current DD to a PDB file (default 0). This only takes effect
+ +        during domain decomposition, so it should typically be
+ +        0 (never), 1 (every DD phase) or a multiple of :mdp:`nstlist`.
+ +
+ +``GMX_DD_NST_DUMP_GRID``
+ +        number of steps that elapse between dumping
+ +        the current DD grid to a PDB file (default 0). This only takes effect
+ +        during domain decomposition, so it should typically be
+ +        0 (never), 1 (every DD phase) or a multiple of :mdp:`nstlist`.
+ +
+ +``GMX_DD_DEBUG``
+ +        general debugging trigger for every domain
+ +        decomposition (default 0, meaning off). Currently only checks
+ +        global-local atom index mapping for consistency.
+ +
+ +``GMX_DD_NPULSE``
+ +        over-ride the number of DD pulses used
+ +        (default 0, meaning no over-ride). Normally 1 or 2.
+ +
+ +There are a number of extra environment variables like these
+ +that are used in debugging - check the code!
+ +
+ +Performance and Run Control
+ +---------------------------
+ +``GMX_DO_GALACTIC_DYNAMICS``
+ +        planetary simulations are made possible (just for fun) by setting
+ +        this environment variable, which allows setting :mdp:`epsilon-r` to -1 in the :ref:`mdp`
+ +        file. Normally, :mdp:`epsilon-r` must be greater than zero to prevent a fatal error.
+ +        See webpage_ for example input files for a planetary simulation.
+ +
+ +``GMX_ALLOW_CPT_MISMATCH``
+ +        when set, runs will not exit if the
+ +        ensemble set in the :ref:`tpr` file does not match that of the
+ +        :ref:`cpt` file.
+ +
+ +``GMX_CUDA_NB_EWALD_TWINCUT``
+ +        force the use of twin-range cutoff kernel even if :mdp:`rvdw` equals
+ +        :mdp:`rcoulomb` after PP-PME load balancing. The switch to twin-range kernels is automated,
+ +        so this variable should be used only for benchmarking.
+ +
+ +``GMX_CUDA_NB_ANA_EWALD``
+ +        force the use of analytical Ewald kernels. Should be used only for benchmarking.
+ +
+ +``GMX_CUDA_NB_TAB_EWALD``
+ +        force the use of tabulated Ewald kernels. Should be used only for benchmarking.
+ +
+ +``GMX_CUDA_STREAMSYNC``
+ +        force the use of cudaStreamSynchronize on ECC-enabled GPUs, which leads
+ +        to performance loss due to a known CUDA driver bug present in API v5.0 NVIDIA drivers (pre-30x.xx).
+ +        Cannot be set simultaneously with ``GMX_NO_CUDA_STREAMSYNC``.
+ +
+ +``GMX_CYCLE_ALL``
+ +        times all code during runs.  Incompatible with threads.
+ +
+ +``GMX_CYCLE_BARRIER``
+ +        calls MPI_Barrier before each cycle start/stop call.
+ +
+ +``GMX_DD_ORDER_ZYX``
+ +        build domain decomposition cells in the order
+ +        (z, y, x) rather than the default (x, y, z).
+ +
+ +``GMX_DD_USE_SENDRECV2``
+ +        during constraint and vsite communication, use a pair
+ +        of ``MPI_Sendrecv`` calls instead of two simultaneous non-blocking calls
+ +        (default 0, meaning off). Might be faster on some MPI implementations.
+ +
+ +``GMX_DLB_BASED_ON_FLOPS``
+ +        do domain-decomposition dynamic load balancing based on flop count rather than
+ +        measured time elapsed (default 0, meaning off).
+ +        This makes the load balancing reproducible, which can be useful for debugging purposes.
+ +        A value of 1 uses the flops; a value > 1 adds (value - 1)*5% of noise to the flops to increase the imbalance and the scaling.
+ +
+ +``GMX_DLB_MAX_BOX_SCALING``
+ +        maximum percentage box scaling permitted per domain-decomposition
+ +        load-balancing step (default 10)
+ +
+ +``GMX_DD_RECORD_LOAD``
+ +        record DD load statistics for reporting at end of the run (default 1, meaning on)
+ +
+ +``GMX_DD_NST_SORT_CHARGE_GROUPS``
+ +        number of steps that elapse between re-sorting of the charge
+ +        groups (default 1). This only takes effect during domain decomposition, so should typically
+ +        be 0 (never), 1 (to mean at every domain decomposition), or a multiple of :mdp:`nstlist`.
+ +
+ +``GMX_DETAILED_PERF_STATS``
+ +        when set, print slightly more detailed performance information
+ +        to the :ref:`log` file. The resulting output is the way performance summary is reported in versions
+ +        4.5.x and thus may be useful for anyone using scripts to parse :ref:`log` files or standard output.
+ +
+ +``GMX_DISABLE_SIMD_KERNELS``
+ +        disables architecture-specific SIMD-optimized (SSE2, SSE4.1, AVX, etc.)
+ +        non-bonded kernels thus forcing the use of plain C kernels.
+ +
+ +``GMX_DISABLE_CUDA_TIMING``
+ +        timing of asynchronously executed GPU operations can have a
+ +        non-negligible overhead with short step times. Disabling timing can improve performance in these cases.
+ +
+ +``GMX_DISABLE_GPU_DETECTION``
+ +        when set, disables GPU detection even if :ref:`gmx mdrun` was compiled
+ +        with GPU support.
+ +
+ +``GMX_DISRE_ENSEMBLE_SIZE``
+ +        the number of systems for distance restraint ensemble
+ +        averaging. Takes an integer value.
+ +
+ +``GMX_EMULATE_GPU``
+ +        emulate GPU runs by using algorithmically equivalent CPU reference code instead of
+ +        GPU-accelerated functions. As the CPU code is slow, it is intended to be used only for debugging purposes.
+ +        The behavior is automatically triggered if non-bonded calculations are turned off using ``GMX_NO_NONBONDED``
+ +        case in which the non-bonded calculations will not be called, but the CPU-GPU transfer will also be skipped.
+ +
+ +``GMX_ENX_NO_FATAL``
+ +        disable exiting upon encountering a corrupted frame in an :ref:`edr`
+ +        file, allowing the use of all frames up until the corruption.
+ +
+ +``GMX_FORCE_UPDATE``
+ +        update forces when invoking ``mdrun -rerun``.
+ +
+ +``GMX_GPU_ID``
+ +        set in the same way as ``mdrun -gpu_id``, ``GMX_GPU_ID``
+ +        allows the user to specify different GPU id-s, which can be useful for selecting different
+ +        devices on different compute nodes in a cluster.  Cannot be used in conjunction with ``mdrun -gpu_id``.
+ +
+ +``GMX_IGNORE_FSYNC_FAILURE_ENV``
+ +        allow :ref:`gmx mdrun` to continue even if
+ +        a file is missing.
+ +
+ +``GMX_LJCOMB_TOL``
+ +        when set to a floating-point value, overrides the default tolerance of
+ +        1e-5 for force-field floating-point parameters.
+ +
+ +``GMX_MAXCONSTRWARN``
+ +        if set to -1, :ref:`gmx mdrun` will
+ +        not exit if it produces too many LINCS warnings.
+ +
+ +``GMX_NB_GENERIC``
+ +        use the generic C kernel.  Should be set if using
+ +        the group-based cutoff scheme and also sets ``GMX_NO_SOLV_OPT`` to be true,
+ +        thus disabling solvent optimizations as well.
+ +
+ +``GMX_NB_MIN_CI``
+ +        neighbor list balancing parameter used when running on GPU. Sets the
+ +        target minimum number pair-lists in order to improve multi-processor load-balance for better
+ +        performance with small simulation systems. Must be set to a positive integer, the default value
+ +        is optimized for NVIDIA Fermi and Kepler GPUs, therefore changing it is not necessary for
+ +        normal usage, but it can be useful on future architectures.
+ +
+ +``GMX_NBLISTCG``
+ +        use neighbor list and kernels based on charge groups.
+ +
+ +``GMX_NBNXN_CYCLE``
+ +        when set, print detailed neighbor search cycle counting.
+ +
+ +``GMX_NBNXN_EWALD_ANALYTICAL``
+ +        force the use of analytical Ewald non-bonded kernels,
+ +        mutually exclusive of ``GMX_NBNXN_EWALD_TABLE``.
+ +
+ +``GMX_NBNXN_EWALD_TABLE``
+ +        force the use of tabulated Ewald non-bonded kernels,
+ +        mutually exclusive of ``GMX_NBNXN_EWALD_ANALYTICAL``.
+ +
+ +``GMX_NBNXN_SIMD_2XNN``
+ +        force the use of 2x(N+N) SIMD CPU non-bonded kernels,
+ +        mutually exclusive of ``GMX_NBNXN_SIMD_4XN``.
+ +
+ +``GMX_NBNXN_SIMD_4XN``
+ +        force the use of 4xN SIMD CPU non-bonded kernels,
+ +        mutually exclusive of ``GMX_NBNXN_SIMD_2XNN``.
+ +
+ +``GMX_NO_ALLVSALL``
+ +        disables optimized all-vs-all kernels.
+ +
+ +``GMX_NO_CART_REORDER``
+ +        used in initializing domain decomposition communicators. Rank reordering
+ +        is default, but can be switched off with this environment variable.
+ +
+ +``GMX_NO_CUDA_STREAMSYNC``
+ +        the opposite of ``GMX_CUDA_STREAMSYNC``. Disables the use of the
+ +        standard cudaStreamSynchronize-based GPU waiting to improve performance when using CUDA driver API
+ +        ealier than v5.0 with ECC-enabled GPUs.
+ +
+ +``GMX_NO_INT``, ``GMX_NO_TERM``, ``GMX_NO_USR1``
+ +        disable signal handlers for SIGINT,
+ +        SIGTERM, and SIGUSR1, respectively.
+ +
+ +``GMX_NO_NODECOMM``
+ +        do not use separate inter- and intra-node communicators.
+ +
+ +``GMX_NO_NONBONDED``
+ +        skip non-bonded calculations; can be used to estimate the possible
+ +        performance gain from adding a GPU accelerator to the current hardware setup -- assuming that this is
+ +        fast enough to complete the non-bonded calculations while the CPU does bonded force and PME computation.
+ +
+ +``GMX_NO_PULLVIR``
+ +        when set, do not add virial contribution to COM pull forces.
+ +
+ +``GMX_NOCHARGEGROUPS``
+ +        disables multi-atom charge groups, i.e. each atom
+ +        in all non-solvent molecules is assigned its own charge group.
+ +
+ +``GMX_NOPREDICT``
+ +        shell positions are not predicted.
+ +
+ +``GMX_NO_SOLV_OPT``
+ +        turns off solvent optimizations; automatic if ``GMX_NB_GENERIC``
+ +        is enabled.
+ +
+ +``GMX_NSCELL_NCG``
+ +        the ideal number of charge groups per neighbor searching grid cell is hard-coded
+ +        to a value of 10. Setting this environment variable to any other integer value overrides this hard-coded
+ +        value.
+ +
+ +``GMX_PME_NTHREADS``
+ +        set the number of OpenMP or PME threads (overrides the number guessed by
+ +        :ref:`gmx mdrun`.
+ +
+ +``GMX_PME_P3M``
+ +        use P3M-optimized influence function instead of smooth PME B-spline interpolation.
+ +
+ +``GMX_PME_THREAD_DIVISION``
+ +        PME thread division in the format "x y z" for all three dimensions. The
+ +        sum of the threads in each dimension must equal the total number of PME threads (set in
+ +        `GMX_PME_NTHREADS`).
+ +
+ +``GMX_PMEONEDD``
+ +        if the number of domain decomposition cells is set to 1 for both x and y,
+ +        decompose PME in one dimension.
+ +
+ +``GMX_REQUIRE_SHELL_INIT``
+ +        require that shell positions are initiated.
+ +
+ +``GMX_REQUIRE_TABLES``
+ +        require the use of tabulated Coulombic
+ +        and van der Waals interactions.
+ +
+ +``GMX_SCSIGMA_MIN``
+ +        the minimum value for soft-core sigma. **Note** that this value is set
+ +        using the :mdp:`sc-sigma` keyword in the :ref:`mdp` file, but this environment variable can be used
+ +        to reproduce pre-4.5 behavior with respect to this parameter.
+ +
+ +``GMX_TPIC_MASSES``
+ +        should contain multiple masses used for test particle insertion into a cavity.
+ +        The center of mass of the last atoms is used for insertion into the cavity.
+ +
+ +``GMX_USE_GRAPH``
+ +        use graph for bonded interactions.
+ +
+ +``GMX_VERLET_BUFFER_RES``
+ +        resolution of buffer size in Verlet cutoff scheme.  The default value is
+ +        0.001, but can be overridden with this environment variable.
+ +
+ +``MPIRUN``
+ +        the ``mpirun`` command used by :ref:`gmx tune_pme`.
+ +
+ +``MDRUN``
+ +        the :ref:`gmx mdrun` command used by :ref:`gmx tune_pme`.
+ +
+ +``GMX_NSTLIST``
+ +        sets the default value for :mdp:`nstlist`, preventing it from being tuned during
+ +        :ref:`gmx mdrun` startup when using the Verlet cutoff scheme.
+ +
+ +``GMX_USE_TREEREDUCE``
+ +        use tree reduction for nbnxn force reduction. Potentially faster for large number of
+ +        OpenMP threads (if memory locality is important).
+ +
+ +Analysis and Core Functions
+ +---------------------------
+ +``GMX_QM_ACCURACY``
+ +        accuracy in Gaussian L510 (MC-SCF) component program.
+ +
+ +``GMX_QM_ORCA_BASENAME``
+ +        prefix of :ref:`tpr` files, used in Orca calculations
+ +        for input and output file names.
+ +
+ +``GMX_QM_CPMCSCF``
+ +        when set to a nonzero value, Gaussian QM calculations will
+ +        iteratively solve the CP-MCSCF equations.
+ +
+ +``GMX_QM_MODIFIED_LINKS_DIR``
+ +        location of modified links in Gaussian.
+ +
+ +``DSSP``
+ +        used by :ref:`gmx do_dssp` to point to the ``dssp``
+ +        executable (not just its path).
+ +
+ +``GMX_QM_GAUSS_DIR``
+ +        directory where Gaussian is installed.
+ +
+ +``GMX_QM_GAUSS_EXE``
+ +        name of the Gaussian executable.
+ +
+ +``GMX_DIPOLE_SPACING``
+ +        spacing used by :ref:`gmx dipoles`.
+ +
+ +``GMX_MAXRESRENUM``
+ +        sets the maximum number of residues to be renumbered by
+ +        :ref:`gmx grompp`. A value of -1 indicates all residues should be renumbered.
+ +
+ +``GMX_FFRTP_TER_RENAME``
+ +        Some force fields (like AMBER) use specific names for N- and C-
+ +        terminal residues (NXXX and CXXX) as :ref:`rtp` entries that are normally renamed. Setting
+ +        this environment variable disables this renaming.
+ +
+ +``GMX_PATH_GZIP``
+ +        ``gunzip`` executable, used by :ref:`gmx wham`.
+ +
+ +``GMX_FONT``
+ +        name of X11 font used by :ref:`gmx view`.
+ +
+ +``GMXTIMEUNIT``
+ +        the time unit used in output files, can be
+ +        anything in fs, ps, ns, us, ms, s, m or h.
+ +
+ +``GMX_QM_GAUSSIAN_MEMORY``
+ +        memory used for Gaussian QM calculation.
+ +
+ +``MULTIPROT``
+ +        name of the ``multiprot`` executable, used by the
+ +        contributed program ``do_multiprot``.
+ +
+ +``NCPUS``
+ +        number of CPUs to be used for Gaussian QM calculation
+ +
+ +``GMX_ORCA_PATH``
+ +        directory where Orca is installed.
+ +
+ +``GMX_QM_SA_STEP``
+ +        simulated annealing step size for Gaussian QM calculation.
+ +
+ +``GMX_QM_GROUND_STATE``
+ +        defines state for Gaussian surface hopping calculation.
+ +
+ +``GMX_TOTAL``
+ +        name of the ``total`` executable used by the contributed
+ +        ``do_shift`` program.
+ +
+ +``GMX_ENER_VERBOSE``
+ +        make :ref:`gmx energy` and :ref:`gmx eneconv`
+ +        loud and noisy.
+ +
+ +``VMD_PLUGIN_PATH``
+ +        where to find VMD plug-ins. Needed to be
+ +        able to read file formats recognized only by a VMD plug-in.
+ +
+ +``VMDDIR``
+ +        base path of VMD installation.
+ +
+ +``GMX_USE_XMGR``
+ +        sets viewer to ``xmgr`` (deprecated) instead of ``xmgrace``.
diff --cc src/gromacs/commandline/cmdlineprogramcontext.cpp
Simple merge
diff --cc src/gromacs/fft/fft5d.cpp
Simple merge
diff --cc src/gromacs/fileio/confio.c
Simple merge
diff --cc src/gromacs/fileio/trxio.c
Simple merge
diff --cc src/gromacs/gmxana/gmx_analyze.c
Simple merge
diff --cc src/gromacs/gmxana/gmx_density.c
Simple merge
diff --cc src/gromacs/gmxana/gmx_gyrate.c
Simple merge
diff --cc src/gromacs/gmxana/gmx_hbond.c

index 931824997119f833311f3dd071f9a47ec9a257cc,1d1ca6f0aa4dd8ea7ed5373fa2abbd7617c61ff1..73b17d1a05180fd8f32226f48718799e035c281c
--- 1/src/gromacs/gmxana/gmx_hbond.c
--- 2/src/gromacs/gmxana/gmx_hbond.c
+++ b/src/gromacs/gmxana/gmx_hbond.c
@@@ -3287,10 -3243,10 +3251,10 @@@ static void do_hbac(const char *fn, t_h
                   fprintf(fp, "%10g  %10g  %10g  %10g  %10g\n",
                           hb->time[j]-hb->time[0], ct[j], cct[j], ght[j], kt[j]);
               }
- -            gmx_ffclose(fp);
+ +            xvgrclose(fp);
   
               analyse_corr(nn, hb->time, ct, ght, kt, NULL, NULL, NULL,
-                          fit_start, temp, smooth_tail_start, oenv);
+                          fit_start, temp);
   
               do_view(oenv, fn, NULL);
               sfree(rhbex);
@@@ -3646,10 -3591,8 +3610,8 @@@ int gmx_hbond(int argc, char *argv[]
             "Time (ps) to which to stop fitting the correlation functions in order to obtain the forward and backward rate constants for HB breaking and formation (only with [TT]-gemfit[tt])" },
           { "-temp",  FALSE, etREAL, {&temp},
             "Temperature (K) for computing the Gibbs energy corresponding to HB breaking and reforming" },
-         { "-smooth", FALSE, etREAL, {&smooth_tail_start},
-           "If >= 0, the tail of the ACF will be smoothed by fitting it to an exponential function: y = A exp(-x/[GRK]tau[grk])" },
           { "-dump",  FALSE, etINT, {&nDump},
- -          "Dump the first N hydrogen bond ACFs in a single [TT].xvg[tt] file for debugging" },
+ +          "Dump the first N hydrogen bond ACFs in a single [REF].xvg[ref] file for debugging" },
           { "-max_hb", FALSE, etREAL, {&maxnhb},
             "Theoretical maximum number of hydrogen bonds used for normalizing HB autocorrelation function. Can be useful in case the program estimates it wrongly" },
           { "-merge", FALSE, etBOOL, {&bMerge},
@@@ -4643,9 -4581,9 +4605,9 @@@
                       gmx_fatal(FARGS, "Could not initiate t_gemParams params.");
                   }
               }
- -            gemstring = strdup(gemType[hb->per->gemtype]);
+ +            gemstring = gmx_strdup(gemType[hb->per->gemtype]);
               do_hbac(opt2fn("-ac", NFILE, fnm), hb, nDump,
-                     bMerge, bContact, fit_start, temp, r2cut > 0, smooth_tail_start, oenv,
+                     bMerge, bContact, fit_start, temp, r2cut > 0, oenv,
                       gemstring, nThreads, NN, bBallistic, bGemFit);
           }
           if (opt2bSet("-life", NFILE, fnm))
diff --cc src/gromacs/gmxana/gmx_rmsdist.c
Simple merge
diff --cc src/gromacs/gmxana/gmx_trjconv.c

index 1bd02fe2d91e8b6a5671b52767d82394e51ece8c,abd42945e987571d238a3e1c6d1f8f1a455879f5..26ba84944cdd64e9027bec5522a70f9284bff803
--- 1/src/gromacs/gmxana/gmx_trjconv.c
--- 2/src/gromacs/gmxana/gmx_trjconv.c
+++ b/src/gromacs/gmxana/gmx_trjconv.c
@@@ -716,9 -721,10 +716,10 @@@ int gmx_trjconv(int argc, char *argv[]
           "trajectories must be concatenated without having double frames.[PAR]",
   
           "Option [TT]-dump[tt] can be used to extract a frame at or near",
-         "one specific time from your trajectory.[PAR]",
+         "one specific time from your trajectory, but only works reliably",
+         "if the time interval between frames is uniform.[PAR]",
   
- -        "Option [TT]-drop[tt] reads an [TT].xvg[tt] file with times and values.",
+ +        "Option [TT]-drop[tt] reads an [REF].xvg[ref] file with times and values.",
           "When options [TT]-dropunder[tt] and/or [TT]-dropover[tt] are set,",
           "frames with a value below and above the value of the respective options",
           "will not be written."
diff --cc src/gromacs/gmxana/gstat.h
Simple merge
diff --cc src/gromacs/gmxlib/copyrite.cpp
Simple merge
diff --cc src/gromacs/gmxlib/gmx_detect_hardware.cpp

index 711e83e6742579a6cb11d41b3c01e8fff8c89f38,19f08d9fb2f7d6ebb16461ab0463eac3b933af9a..96ea3c3db17b10c0d3a6d24fe914a94e730d061b
--- 1/src/gromacs/gmxlib/gmx_detect_hardware.cpp
--- 2/src/gromacs/gmxlib/gmx_detect_hardware.c
+++ b/src/gromacs/gmxlib/gmx_detect_hardware.cpp
@@@ -277,9 -231,9 +277,9 @@@ void gmx_check_hw_runconf_consistency(F
                                         const gmx_hw_opt_t  *hw_opt,
                                         gmx_bool             bUseGPU)
   {
- -    int      npppn, ntmpi_pp;
- -    char     sbuf[STRLEN], th_or_proc[STRLEN], th_or_proc_plural[STRLEN], pernode[STRLEN];
+ +    int      npppn;
+ +    char     th_or_proc[STRLEN], th_or_proc_plural[STRLEN], pernode[STRLEN];
-     gmx_bool btMPI, bMPI, bMaxMpiThreadsSet, bNthreadsAuto, bEmulateGPU;
+     gmx_bool btMPI, bMPI, bNthreadsAuto, bEmulateGPU;
   
       assert(hwinfo);
       assert(cr);
@@@ -310,22 -259,22 +310,21 @@@
       /* GPU emulation detection is done later, but we need here as well
        * -- uncool, but there's no elegant workaround */
       bEmulateGPU       = (getenv("GMX_EMULATE_GPU") != NULL);
-     bMaxMpiThreadsSet = (getenv("GMX_MAX_MPI_THREADS") != NULL);
   
- -    /* check the SIMD level mdrun is compiled with against hardware
- -       capabilities */
- -    /* TODO: Here we assume homogeneous hardware which is not necessarily
- -             the case! Might not hurt to add an extra check over MPI. */
- -    gmx_cpuid_simd_check(hwinfo->cpuid_info, fplog, SIMMASTER(cr));
- -
- -    check_use_of_rdtscp_on_this_cpu(fplog, cr, hwinfo);
- -
- -    /* NOTE: this print is only for and on one physical node */
- -    print_gpu_detection_stats(fplog, &hwinfo->gpu_info, cr);
- -
- -    if (hwinfo->gpu_info.ncuda_dev_compatible > 0)
+ +    if (hwinfo->gpu_info.n_dev_compatible > 0)
       {
+ +        std::string gpuUseageReport;
+ +        try
+ +        {
+ +            gpuUseageReport = makeGpuUsageReport(&hwinfo->gpu_info,
+ +                                                 &hw_opt->gpu_opt,
+ +                                                 cr->nrank_pp_intranode,
+ +                                                 bMPI && cr->nnodes > 1);
+ +        }
+ +        GMX_CATCH_ALL_AND_EXIT_WITH_FATAL_ERROR;
+ +
           /* NOTE: this print is only for and on one physical node */
- -        print_gpu_use_stats(fplog, &hwinfo->gpu_info, &hw_opt->gpu_opt, cr);
+ +        md_print_info(cr, fplog, "%s\n", gpuUseageReport.c_str());
       }
   
       /* Need to ensure that we have enough GPUs:
diff --cc src/gromacs/gmxpreprocess/gen_ad.c
Simple merge
diff --cc src/gromacs/gmxpreprocess/pdb2gmx.c

index 95215c89f6653778caa8fdd8d0ef666a3bb0a059,f5371c7911f3bcd05951b342771bd9cac0591f11..242aaf65e08ec587661d8acc684dcaaccd5348cb
--- 1/src/gromacs/gmxpreprocess/pdb2gmx.c
--- 2/src/gromacs/gmxpreprocess/pdb2gmx.c
+++ b/src/gromacs/gmxpreprocess/pdb2gmx.c
@@@ -43,44 -44,48 +43,45 @@@
   #include <string.h>
   #include <time.h>
   
- -#include "sysstuff.h"
- -#include "typedefs.h"
- -#include "gromacs/fileio/gmxfio.h"
- -#include "gromacs/utility/smalloc.h"
- -#include "copyrite.h"
- -#include "gromacs/utility/cstringutil.h"
- -#include "gromacs/fileio/confio.h"
- -#include "symtab.h"
- -#include "vec.h"
   #include "gromacs/commandline/pargs.h"
- -#include "gromacs/fileio/futil.h"
- -#include "gmx_fatal.h"
+ +#include "gromacs/fileio/confio.h"
+ +#include "gromacs/fileio/gmxfio.h"
   #include "gromacs/fileio/pdbio.h"
- -#include "toputil.h"
- -#include "h_db.h"
- -#include "physics.h"
- -#include "pgutil.h"
- -#include "calch.h"
- -#include "resall.h"
- -#include "pdb2top.h"
- -#include "ter_db.h"
- -#include "gromacs/gmxlib/conformation-utilities.h"
- -#include "genhydro.h"
- -#include "readinp.h"
- -#include "atomprop.h"
- -#include "index.h"
- -#include "fflibutil.h"
- -#include "macros.h"
- -
   #include "gromacs/fileio/strdb.h"
- -
- -#include "hizzie.h"
- -#include "specbond.h"
- -#include "xlate.h"
+ +#include "gromacs/gmxlib/conformation-utilities.h"
+ +#include "gromacs/gmxpreprocess/fflibutil.h"
+ +#include "gromacs/gmxpreprocess/genhydro.h"
+ +#include "gromacs/gmxpreprocess/h_db.h"
+ +#include "gromacs/gmxpreprocess/hizzie.h"
+ +#include "gromacs/gmxpreprocess/pdb2top.h"
+ +#include "gromacs/gmxpreprocess/pgutil.h"
+ +#include "gromacs/gmxpreprocess/resall.h"
+ +#include "gromacs/gmxpreprocess/specbond.h"
+ +#include "gromacs/gmxpreprocess/ter_db.h"
+ +#include "gromacs/gmxpreprocess/toputil.h"
+ +#include "gromacs/gmxpreprocess/xlate.h"
+ +#include "gromacs/legacyheaders/copyrite.h"
+ +#include "gromacs/legacyheaders/macros.h"
+ +#include "gromacs/legacyheaders/readinp.h"
+ +#include "gromacs/legacyheaders/typedefs.h"
+ +#include "gromacs/math/vec.h"
+ +#include "gromacs/topology/atomprop.h"
+ +#include "gromacs/topology/block.h"
+ +#include "gromacs/topology/index.h"
+ +#include "gromacs/topology/residuetypes.h"
+ +#include "gromacs/topology/symtab.h"
+ +#include "gromacs/utility/cstringutil.h"
+ +#include "gromacs/utility/dir_separator.h"
+ +#include "gromacs/utility/fatalerror.h"
+ +#include "gromacs/utility/smalloc.h"
   
+ #define RTP_MAXCHAR 5
   typedef struct {
-     char gmx[6];
-     char main[6];
-     char nter[6];
-     char cter[6];
-     char bter[6];
+     char gmx[RTP_MAXCHAR+2];
+     char main[RTP_MAXCHAR+2];
+     char nter[RTP_MAXCHAR+2];
+     char cter[RTP_MAXCHAR+2];
+     char bter[RTP_MAXCHAR+2];
   } rtprename_t;
   
   
diff --cc src/gromacs/gmxpreprocess/readir.c
Simple merge
diff --cc src/gromacs/gmxpreprocess/topio.c
Simple merge
diff --cc src/gromacs/utility/exceptions.h
Simple merge
diff --cc src/gromacs/utility/init.cpp

index a33ada5df0976bde44d282ac8460cb32ae6fc074,f996ae589f965166362567f0c22246e0c93e4d1e..c8d3f0f4963e135a8e92424879a874933405c4a2
--- 1/src/gromacs/utility/init.cpp
--- 2/src/gromacs/utility/init.cpp
+++ b/src/gromacs/utility/init.cpp
@@@ -39,14 -39,14 +39,15 @@@
    * \author Teemu Murtola <teemu.murtola@gmail.com>
    * \ingroup module_utility
    */
- -#include "gromacs/utility/init.h"
+ +#include "gmxpre.h"
   
- -#ifdef HAVE_CONFIG_H
- -#include "config.h"
- -#endif
+ +#include "init.h"
   
- -#include "gromacs/legacyheaders/gmx_fatal.h"
+ +#include "config.h"
   
+ +#include "gromacs/utility/basedefinitions.h"
++#include "gromacs/utility/fatalerror.h"
+ +#include "gromacs/utility/gmxassert.h"
   #ifdef GMX_LIB_MPI
   #include "gromacs/utility/gmxmpi.h"
   #endif
diff --cc src/gromacs/utility/path.h
Simple merge
diff --cc src/programs/mdrun/mdrun.cpp
Simple merge
diff --cc src/programs/mdrun/runner.cpp

index d88c5cd656b7742142896f0247cb1082e4455064,744e9f501e301cede3bcaac6434dccf8d4eb2e13..ff1ac26cc708c10be23825887f86c63ee972c27a
--- 1/src/programs/mdrun/runner.cpp
--- 2/src/programs/mdrun/runner.c
+++ b/src/programs/mdrun/runner.cpp
@@@ -663,16 -1156,6 +663,18 @@@ int mdrunner(gmx_hw_opt_t *hw_opt
        * global for this process (MPI rank). */
       hwinfo = gmx_detect_hardware(fplog, cr, bTryUseGPU);
   
+ +    gmx_print_detected_hardware(fplog, cr, hwinfo);
+ +
+ +    if (fplog != NULL)
+ +    {
+ +        /* Print references after all software/hardware printing */
++        please_cite(fplog, "Pall2015");
++        please_cite(fplog, "Pronk2013");
+ +        please_cite(fplog, "Hess2008b");
+ +        please_cite(fplog, "Spoel2005a");
+ +        please_cite(fplog, "Lindahl2001a");
+ +        please_cite(fplog, "Berendsen95a");
+ +    }
   
       snew(state, 1);
       if (SIMMASTER(cr))
author	Mark Abraham <mark.j.abraham@gmail.com>
	Thu, 25 Jun 2015 08:47:53 +0000 (10:47 +0200)
committer	Mark Abraham <mark.j.abraham@gmail.com>
	Thu, 25 Jun 2015 09:25:13 +0000 (11:25 +0200)
		1	2
CMakeLists.txt	patch \|	diff1 \|	diff2 \|	blob \| history
docs/install-guide/index.rst	patch \|	diff1 \|	\|	blob \| history
docs/manual/analyse.tex	patch \|	diff1 \|	diff2 \|	blob \| history
docs/user-guide/environment-variables.rst	patch \|	diff1 \|	\|	blob \| history
src/gromacs/commandline/cmdlineprogramcontext.cpp	patch \|	diff1 \|	diff2 \|	blob \| history
src/gromacs/fft/fft5d.cpp	patch \|	diff1 \|	diff2 \|	blob \| history
src/gromacs/fileio/confio.c	patch \|	diff1 \|	diff2 \|	blob \| history
src/gromacs/fileio/trxio.c	patch \|	diff1 \|	diff2 \|	blob \| history
src/gromacs/gmxana/gmx_analyze.c	patch \|	diff1 \|	diff2 \|	blob \| history
src/gromacs/gmxana/gmx_density.c	patch \|	diff1 \|	diff2 \|	blob \| history
src/gromacs/gmxana/gmx_gyrate.c	patch \|	diff1 \|	diff2 \|	blob \| history
src/gromacs/gmxana/gmx_hbond.c	patch \|	diff1 \|	diff2 \|	blob \| history
src/gromacs/gmxana/gmx_rmsdist.c	patch \|	diff1 \|	diff2 \|	blob \| history
src/gromacs/gmxana/gmx_trjconv.c	patch \|	diff1 \|	diff2 \|	blob \| history
src/gromacs/gmxana/gstat.h	patch \|	diff1 \|	diff2 \|	blob \| history
src/gromacs/gmxlib/copyrite.cpp	patch \|	diff1 \|	diff2 \|	blob \| history
src/gromacs/gmxlib/gmx_detect_hardware.cpp	patch \|	diff1 \|	diff2 \|	blob \| history
src/gromacs/gmxpreprocess/gen_ad.c	patch \|	diff1 \|	diff2 \|	blob \| history
src/gromacs/gmxpreprocess/pdb2gmx.c	patch \|	diff1 \|	diff2 \|	blob \| history
src/gromacs/gmxpreprocess/readir.c	patch \|	diff1 \|	diff2 \|	blob \| history
src/gromacs/gmxpreprocess/topio.c	patch \|	diff1 \|	diff2 \|	blob \| history
src/gromacs/utility/exceptions.h	patch \|	diff1 \|	diff2 \|	blob \| history
src/gromacs/utility/init.cpp	patch \|	diff1 \|	diff2 \|	blob \| history
src/gromacs/utility/path.h	patch \|	diff1 \|	diff2 \|	blob \| history
src/programs/mdrun/mdrun.cpp	patch \|	diff1 \|	diff2 \|	blob \| history
src/programs/mdrun/runner.cpp	patch \|	diff1 \|	diff2 \|	blob \| history