Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 20 Dec 2005 01:21:30 +0000 (UTC)
From:      Bruce Evans <bde@FreeBSD.org>
To:        src-committers@FreeBSD.org, cvs-src@FreeBSD.org, cvs-all@FreeBSD.org
Subject:   cvs commit: src/lib/msun/src s_cbrt.c
Message-ID:  <200512200121.jBK1LUTd050099@repoman.freebsd.org>

next in thread | raw e-mail | index | archive | help
bde         2005-12-20 01:21:30 UTC

  FreeBSD src repository

  Modified files:
    lib/msun/src         s_cbrt.c 
  Log:
  Extract the high and low words together.  With gcc-3.4 on uniformly
  distributed non-large args, this saves about 14 of 134 cycles for
  Athlon64s and about 5 of 199 cycles for AthlonXPs.
  
  Moved the check for x == 0 inside the check for subnormals.  With
  gcc-3.4 on uniformly distributed non-large args, this saves another
  5 cycles on Athlon64s and loses 1 cycle on AthlonXPs.
  
  Use INSERT_WORDS() and not SET_HIGH_WORD() when converting the first
  approximation from bits to double.  With gcc-3.4 on uniformly distributed
  non-large args, this saves another 4 cycles on both Athlon64s and and
  AthlonXPs.
  
  Accessing doubles as 2 words may be an optimization on old CPUs, but on
  current CPUs it tends to cause extra operations and pipeline stalls,
  especially for writes, even when only 1 of the words needs to be accessed.
  
  Removed an unused variable.
  
  Revision  Changes    Path
  1.14      +6 -8      src/lib/msun/src/s_cbrt.c



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200512200121.jBK1LUTd050099>