Patent | Date |
---|
Optimizing branch re-wiring in a software instruction cache Grant 10,782,973 - Bertolli , et al. Sept | 2020-09-22 |
Multi-petascale highly efficient parallel supercomputer Grant 9,971,713 - Asaad , et al. May 15, 2 | 2018-05-15 |
Program structure-based blocking Grant 9,772,824 - Bertolli , et al. September 26, 2 | 2017-09-26 |
Program structure-based blocking Grant 9,772,825 - Bertolli , et al. September 26, 2 | 2017-09-26 |
Parallel execution unit that extracts data parallelism at runtime Grant 9,696,995 - Eichenberger , et al. July 4, 2 | 2017-07-04 |
Parallel execution unit that extracts data parallelism at runtime Grant 9,696,996 - Eichenberger , et al. July 4, 2 | 2017-07-04 |
All-to-all permutation of vector elements based on a permutation pattern encoded in mantissa and exponent bits in a floating-point SIMD architecture Grant 9,652,231 - Eichenberger , et al. May 16, 2 | 2017-05-16 |
Matrix multiplication operations using pair-wise load and splat operations Grant 9,600,281 - Eichenberger , et al. March 21, 2 | 2017-03-21 |
SIMD compare instruction using permute logic for distributed register files Grant 9,575,753 - Eichenberger , et al. February 21, 2 | 2017-02-21 |
Schedulers with load-store queue awareness Grant 9,563,428 - Chen , et al. February 7, 2 | 2017-02-07 |
Schedulers with load-store queue awareness Grant 9,552,196 - Chen , et al. January 24, 2 | 2017-01-24 |
Dedicated memory structure holding data for detecting available worker thread(s) and informing available worker thread(s) of task(s) to execute Grant 9,519,583 - Chiu , et al. December 13, 2 | 2016-12-13 |
Optimizing Branch Re-wiring In A Software Instruction Cache App 20160335087 - Bertolli; Carlo ;   et al. | 2016-11-17 |
Adaptive runtime for a multiprocessing API Grant 9,495,274 - Antao , et al. November 15, 2 | 2016-11-15 |
Adaptive runtime for a multiprocessing API Grant 9,465,714 - Antao , et al. October 11, 2 | 2016-10-11 |
Program Structure-based Blocking App 20160283208 - Bertolli; Carlo ;   et al. | 2016-09-29 |
Schedulers With Load-store Queue Awareness App 20160283248 - Chen; Tong ;   et al. | 2016-09-29 |
Program Structure-based Blocking App 20160283210 - Bertolli; Carlo ;   et al. | 2016-09-29 |
Schedulers With Load-store Queue Awareness App 20160283212 - Chen; Tong ;   et al. | 2016-09-29 |
Multi-petascale Highly Efficient Parallel Supercomputer App 20160011996 - Asaad; Sameh ;   et al. | 2016-01-14 |
Systems, methods and computer products for cross-thread scheduling Grant 9,223,580 - Eichenberger , et al. December 29, 2 | 2015-12-29 |
Multi-petascale highly efficient parallel supercomputer Grant 9,081,501 - Asaad , et al. July 14, 2 | 2015-07-14 |
Analyze and reduce number of data reordering operations in SIMD code Grant 8,954,943 - Eichenberger , et al. February 10, 2 | 2015-02-10 |
Vector loads with multiple vector elements from a same cache line in a scattered load operation Grant 8,904,153 - Eichenberger , et al. December 2, 2 | 2014-12-02 |
Constant time worker thread allocation via configuration caching Grant 8,881,159 - Eichenberger , et al. November 4, 2 | 2014-11-04 |
Efficient software cache accessing with handle reuse Grant 8,819,651 - Eichenberger , et al. August 26, 2 | 2014-08-26 |
Management of conditional branches within a data parallel system Grant 8,726,252 - Eichenberger , et al. May 13, 2 | 2014-05-13 |
Building approximate data dependences with a moving window Grant 8,667,260 - Eichenberger , et al. March 4, 2 | 2014-03-04 |
Complex matrix multiplication operations with data pre-conditioning in a high performance computing architecture Grant 8,650,240 - Eichenberger , et al. February 11, 2 | 2014-02-11 |
Write-through cache optimized for dependence-free parallel regions Grant 8,627,010 - Eichenberger , et al. January 7, 2 | 2014-01-07 |
Data parallel function call for determining if called routine is data parallel Grant 8,627,043 - Eichenberger , et al. January 7, 2 | 2014-01-07 |
Data parallel function call for determining if called routine is data parallel Grant 8,627,042 - Eichenberger , et al. January 7, 2 | 2014-01-07 |
Reducing parallelism of computer source code Grant 8,584,103 - Bondhugula , et al. November 12, 2 | 2013-11-12 |
Runtime extraction of data parallelism Grant 8,583,905 - Eichenberger , et al. November 12, 2 | 2013-11-12 |
Matrix multiplication operations with data pre-conditioning in a high performance computing architecture Grant 8,577,950 - Eichenberger , et al. November 5, 2 | 2013-11-05 |
Optimized scalar promotion with load and splat SIMD instructions Grant 8,572,586 - Eichenberger , et al. October 29, 2 | 2013-10-29 |
Runtime extraction of data parallelism Grant 8,572,359 - Eichenberger , et al. October 29, 2 | 2013-10-29 |
Thread Specific Compiler Generated Customization of Runtime Support for Application Programming Interfaces App 20130283250 - Eichenberger; Alexandre E. ;   et al. | 2013-10-24 |
Optimized code generation targeting a high locality software cache Grant 8,561,044 - Chen , et al. October 15, 2 | 2013-10-15 |
Data transfer optimized software cache for irregular memory references Grant 8,561,043 - Ayguade , et al. October 15, 2 | 2013-10-15 |
Framework for generating mixed-mode operations in loop-level simdization Grant 8,549,501 - Eichenberger , et al. October 1, 2 | 2013-10-01 |
SIMD Compare Instruction Using Permute Logic for Distributed Register Files App 20130246737 - Eichenberger; Alexandre E. ;   et al. | 2013-09-19 |
Data transfer optimized software cache for regular memory references Grant 8,527,974 - Ayguade , et al. September 3, 2 | 2013-09-03 |
Checkpointing in speculative versioning caches Grant 8,521,961 - Eichenberger , et al. August 27, 2 | 2013-08-27 |
Write-through cache optimized for dependence-free parallel regions Grant 8,516,197 - Eichenberger , et al. August 20, 2 | 2013-08-20 |
Shared prefetching to reduce execution skew in multi-threaded systems Grant 8,490,071 - Eichenberger , et al. July 16, 2 | 2013-07-16 |
Tracking and detecting thread dependencies using speculative versioning cache Grant 8,468,539 - Eichenberger , et al. June 18, 2 | 2013-06-18 |
Parallelization of irregular reductions via parallel building and exploitation of conflict-free units of work at runtime Grant 8,468,508 - Eichenberger , et al. June 18, 2 | 2013-06-18 |
Efficient Enqueuing of Values in SIMD Engines with Permute Unit App 20130151822 - Eichenberger; Alexandre E. ;   et al. | 2013-06-13 |
Runtime dependence-aware scheduling using assist thread Grant 8,464,271 - Eichenberger , et al. June 11, 2 | 2013-06-11 |
Method and structure of using SIMD vector architectures to implement matrix multiplication Grant 8,458,442 - Eichenberger , et al. June 4, 2 | 2013-06-04 |
Insertion of operation-and-indicate instructions for optimized SIMD code Grant 8,458,684 - Eichenberger , et al. June 4, 2 | 2013-06-04 |
Code generation for complex arithmetic reduction for architectures lacking cross data-path support Grant 8,423,979 - Archambault , et al. April 16, 2 | 2013-04-16 |
Version pressure feedback mechanisms for speculative versioning caches Grant 8,397,052 - Eichenberger , et al. March 12, 2 | 2013-03-12 |
Optimizing scalar code executed on a SIMD engine by alignment of SIMD slots Grant 8,370,817 - Eichenberger , et al. February 5, 2 | 2013-02-05 |
Optimized software cache lookup for SIMD architectures Grant 8,370,575 - Eichenberger , et al. February 5, 2 | 2013-02-05 |
Write-through Cache Optimized For Dependence-free Parallel Regions App 20120331232 - Eichenberger; Alexandre E. ;   et al. | 2012-12-27 |
Single instruction multiple data (SIMD) code generation for parallel loops using versioning and scheduling Grant 8,341,615 - Eichenberger , et al. December 25, 2 | 2012-12-25 |
Optimized Scalar Promotion with Load and Splat SIMD Instructions App 20120290816 - Eichenberger; Alexandre E. ;   et al. | 2012-11-15 |
Constant Time Worker Thread Allocation Via Configuration Caching App 20120246654 - Eichenberger; Alexandre E. ;   et al. | 2012-09-27 |
Method using SLP packing with statements having both isomorphic and non-isomorphic expressions Grant 8,266,587 - Eichenberger , et al. September 11, 2 | 2012-09-11 |
Optimized scalar promotion with load and splat SIMD instructions Grant 8,255,884 - Eichenberger , et al. August 28, 2 | 2012-08-28 |
Write-through Cache Optimized For Dependence-free Parallel Regions App 20120210073 - Eichenberger; Alexandre E. ;   et al. | 2012-08-16 |
SIMD code generation for loops with mixed data lengths Grant 8,245,208 - Eichenberger , et al. August 14, 2 | 2012-08-14 |
Runtime Dependence-Aware Scheduling Using Assist Thread App 20120204189 - Eichenberger; Alexandre E. ;   et al. | 2012-08-09 |
Management Of Conditional Branches Within A Data Parallel System App 20120198425 - Eichenberger; Alexandre E. ;   et al. | 2012-08-02 |
Runtime Extraction of Data Parallelism App 20120192167 - Eichenberger; Alexandre E. ;   et al. | 2012-07-26 |
Parallel Execution Unit that Extracts Data Parallelism at Runtime App 20120191953 - Eichenberger; Alexandre E. ;   et al. | 2012-07-26 |
Data Parallel Function Call for Determining if Called Routine is Data Parallel App 20120180031 - Eichenberger; Alexandre E. ;   et al. | 2012-07-12 |
Runtime dependence-aware scheduling using assist thread Grant 8,214,831 - Eichenberger , et al. July 3, 2 | 2012-07-03 |
SIMD code generation in the presence of optimized misaligned data reorganization Grant 8,196,124 - Eichenberger , et al. June 5, 2 | 2012-06-05 |
Efficient code generation using loop peeling for SIMD loop code with multile misaligned statements Grant 8,171,464 - Eichenberger , et al. May 1, 2 | 2012-05-01 |
Method and apparatus for data stream alignment support Grant 8,156,310 - Eichenberger , et al. April 10, 2 | 2012-04-10 |
Efficient data reorganization to satisfy data alignment constraints Grant 8,146,067 - Eichenberger , et al. March 27, 2 | 2012-03-27 |
Method to exploit superword-level parallelism using semi-isomorphic packing Grant 8,136,105 - Eichenberger , et al. March 13, 2 | 2012-03-13 |
Vector Loads from Scattered Memory Locations App 20120060016 - Eichenberger; Alexandre E. ;   et al. | 2012-03-08 |
Vector Loads with Multiple Vector Elements from a Same Cache Line in a Scattered Load Operation App 20120060015 - Eichenberger; Alexandre E. ;   et al. | 2012-03-08 |
Matrix Multiplication Operations Using Pair-Wise Load and Splat Operations App 20120011348 - Eichenberger; Alexandre E. ;   et al. | 2012-01-12 |
Domain stretching for an advanced dual-representation polyhedral loop transformation framework Grant 8,087,011 - Eichenberger , et al. December 27, 2 | 2011-12-27 |
Selective code generation optimization for an advanced dual-representation polyhedral loop transformation framework Grant 8,087,010 - Eichenberger , et al. December 27, 2 | 2011-12-27 |
Reducing Parallelism Of Computer Source Code App 20110314442 - Bondhugula; Uday Kumar ;   et al. | 2011-12-22 |
System and method for advanced polyhedral loop transformations of source code in a compiler Grant 8,060,870 - Eichenberger , et al. November 15, 2 | 2011-11-15 |
Shared Prefetching to Reduce Execution Skew in Multi-Threaded Systems App 20110276786 - Eichenberger; Alexandre E. ;   et al. | 2011-11-10 |
Stable transitions in the presence of conditionals for an advanced dual-representation polyhedral loop transformation framework Grant 8,056,065 - Eichenberger , et al. November 8, 2 | 2011-11-08 |
Framework for integrated intra- and inter-loop aggregation of contiguous memory accesses for SIMD vectorization Grant 8,056,069 - Eichenberger , et al. November 8, 2 | 2011-11-08 |
Generating optimized SIMD code in the presence of data dependences Grant 8,037,464 - Eichenberger , et al. October 11, 2 | 2011-10-11 |
Multi-petascale Highly Efficient Parallel Supercomputer App 20110219208 - Asaad; Sameh ;   et al. | 2011-09-08 |
Building Approximate Data Dependences with a Moving Window App 20110219222 - Eichenberger; Alexandre E. ;   et al. | 2011-09-08 |
Workload partitioning in a parallel system with hetergeneous alignment constraints Grant 8,006,238 - Eichenberger , et al. August 23, 2 | 2011-08-23 |
Data Parallel Function Call for Determining if Called Routine is Data Parallel App 20110161623 - Eichenberger; Alexandre E. ;   et al. | 2011-06-30 |
Parallel Execution Unit that Extracts Data Parallelism at Runtime App 20110161642 - Eichenberger; Alexandre E. ;   et al. | 2011-06-30 |
Runtime Extraction of Data Parallelism App 20110161643 - Eichenberger; Alexandre E. ;   et al. | 2011-06-30 |
Parallelization Of Irregular Reductions Via Parallel Building And Exploitation Of Conflict-free Units Of Work At Runtime App 20110088020 - Eichenberger; Alexandre E. ;   et al. | 2011-04-14 |
Method And Structure Of Using Simd Vector Architectures To Implement Matrix Multiplication App 20110055517 - Eichenberger; Alexandre E. ;   et al. | 2011-03-03 |
Detecting Task Complete Dependencies Using Underlying Speculative Multi-Threading Hardware App 20110055484 - Eichenberger; Alexandre E. ;   et al. | 2011-03-03 |
In-Data Path Tracking of Floating Point Exceptions and Store-Based Exception Indication App 20110047358 - Eichenberger; Alexandre E. ;   et al. | 2011-02-24 |
Version Pressure Feedback Mechanisms for Speculative Versioning Caches App 20110047362 - Eichenberger; Alexandre E. ;   et al. | 2011-02-24 |
Checkpointing in Speculative Versioning Caches App 20110047334 - Eichenberger; Alexandre E. ;   et al. | 2011-02-24 |
Insertion of Operation-and-Indicate Instructions for Optimized SIMD Code App 20110047359 - Eichenberger; Alexandre E. ;   et al. | 2011-02-24 |
Matrix Multiplication Operations with Data Pre-Conditioning in a High Performance Computing Architecture App 20110040821 - Eichenberger; Alexandre E. ;   et al. | 2011-02-17 |
Complex Matrix Multiplication Operations with Data Pre-Conditioning in a High Performance Computing Architecture App 20110040822 - Eichenberger; Alexandre E. ;   et al. | 2011-02-17 |
Aligning precision converted vector data using mask indicating offset relative to element boundary corresponding to precision type Grant 7,865,693 - Eichenberger , et al. January 4, 2 | 2011-01-04 |
Runtime Dependence-Aware Scheduling Using Assist Thread App 20100287550 - Eichenberger; Alexandre E. ;   et al. | 2010-11-11 |
Efficient generation of SIMD code in presence of multi-threading and other false sharing conditions and in machines having memory protection support Grant 7,730,463 - Eichenberger , et al. June 1, 2 | 2010-06-01 |
Dynamic Data Driven Alignment and Data Formatting in a Floating-Point SIMD Architecture App 20100095087 - Eichenberger; Alexandre E. ;   et al. | 2010-04-15 |
Dynamically Aligning Enhanced Precision Vectors Based on Addresses Corresponding to Reduced Precision Vectors App 20100095086 - Eichenberger; Alexandre E. ;   et al. | 2010-04-15 |
Optimized Code Generation Targeting a High Locality Software Cache App 20100088673 - Chen; Tong ;   et al. | 2010-04-08 |
Method and system for versioning codes based on relative alignment for single instruction multiple data units Grant 7,673,284 - Eichenberger , et al. March 2, 2 | 2010-03-02 |
Efficient Software Cache Accessing With Handle Reuse App 20100023932 - Eichenberger; Alexandre E. ;   et al. | 2010-01-28 |
Single Instruction Multiple Data (simd) Code Generation For Parallel Loops Using Versioning And Scheduling App 20100011339 - Eichenberger; Alexandre E. ;   et al. | 2010-01-14 |
Optimized Scalar Promotion with Load and Splat SIMD Instructions App 20090307656 - Eichenberger; Alexandre E. ;   et al. | 2009-12-10 |
System and Method for Domain Stretching for an Advanced Dual-Representation Polyhedral Loop Transformation Framework App 20090307673 - Eichenberger; Alexandre E. ;   et al. | 2009-12-10 |
Data Transfer Optimized Software Cache for Regular Memory References App 20090248985 - Ayguade; Eduard ;   et al. | 2009-10-01 |
Data Transfer Optimized Software Cache for Irregular Memory References App 20090249318 - Ayguade; Eduard ;   et al. | 2009-10-01 |
Method Using Slp Packing With Statements Having Both Isomorphic And Non-isomorphic Expressions App 20090171919 - Eichenberger; Alexandre E. ;   et al. | 2009-07-02 |
SIMD Code Generation For Loops With Mixed Data Lengths App 20090144529 - Eichenberger; Alexandre E. ;   et al. | 2009-06-04 |
Method And Apparatus For Allocating Architectural Register Resources Among Threads In A Multi-threaded Microprocessor Core App 20090100249 - EICHENBERGER; ALEXANDRE E. ;   et al. | 2009-04-16 |
System and Method for Stable Transitions in the Presence of Conditionals for an Advanced Dual-Representation Polyhedral Loop Transformation Framework App 20090083722 - Eichenberger; Alexandre E. ;   et al. | 2009-03-26 |
System and Method for Advanced Polyhedral Loop Transformations of Source Code in a Compiler App 20090083724 - Eichenberger; Alexandre E. ;   et al. | 2009-03-26 |
System and Method for Selective Code Generation Optimization for an Advanced Dual-Representation Polyhedral Loop Transformation Framework App 20090083702 - Eichenberger; Alexandre E. ;   et al. | 2009-03-26 |
Increase The Coverage Of Profiling Feedback With Data Flow Analysis App 20090070753 - Chen; Tong ;   et al. | 2009-03-12 |
Systems, Methods And Computer Products For Cross-thread Scheduling App 20090064152 - Eichenberger; Alexandre E. ;   et al. | 2009-03-05 |
Method to efficiently prefetch and batch compiler-assisted software cache accesses Grant 7,493,452 - Eichenberger , et al. February 17, 2 | 2009-02-17 |
SIMD code generation in the presence of optimized misaligned data reorganization Grant 7,478,377 - Eichenberger , et al. January 13, 2 | 2009-01-13 |
SIMD code generation for loops with mixed data lengths Grant 7,475,392 - Eichenberger , et al. January 6, 2 | 2009-01-06 |
SIMD Code Generation in the Presence of Optimized Misaligned Data Reorganization App 20080307402 - Eichenberger; Alexandre E. ;   et al. | 2008-12-11 |
Efficient Code Generation Using Loop Peeling for SIMD Loop Code with Multiple Misaligned Statements App 20080222623 - Eichenberger; Alexandre E. ;   et al. | 2008-09-11 |
Apparatus and Method for Optimizing Scalar Code Executed on a SIMD Engine by Alignment of SIMD Slots App 20080222391 - Eichenberger; Alexandre E. ;   et al. | 2008-09-11 |
Efficient Data Reorganization to Satisfy Data Alignment Constraints App 20080201699 - Eichenberger; Alexandre E. ;   et al. | 2008-08-21 |
Framework for efficient code generation using loop peeling for SIMD loop code with multiple misaligned statements Grant 7,395,531 - Eichenberger , et al. July 1, 2 | 2008-07-01 |
Efficient data reorganization to satisfy data alignment constraints Grant 7,386,842 - Eichenberger , et al. June 10, 2 | 2008-06-10 |
Method To Exploit Superword-level Parallelism Using Semi-isomorphic Packing App 20080127144 - Eichenberger; Alexandre E. ;   et al. | 2008-05-29 |
Generating Optimized Simd Code In The Presence Of Data Dependences App 20080127059 - Eichenberger; Alexandre E. ;   et al. | 2008-05-29 |
Framework for integrated intra- and inter-loop aggregation of contiguous memory accesses for SIMD vectorization Grant 7,367,026 - Eichenberger , et al. April 29, 2 | 2008-04-29 |
Code generation for complex arithmetic reduction for architectures lacking cross data-path support App 20080092124 - Archambault; Roch Georges ;   et al. | 2008-04-17 |
Workload Partitioning in a Parallel System with Hetergeneous Alignment Constraints App 20080077930 - Eichenberger; Alexandre E. ;   et al. | 2008-03-27 |
Method and apparatus for data stream alignment support App 20080065863 - Eichenberger; Alexandre E. ;   et al. | 2008-03-13 |
Optimized Software Cache Lookup For Simd Architectures App 20080065809 - Eichenberger; Alexandre E. | 2008-03-13 |
System and Method to Efficiently Prefetch and Batch Compiler-Assisted Software Cache Accesses App 20080046657 - Eichenberger; Alexandre E. ;   et al. | 2008-02-21 |
Framework for Integrated Intra- and Inter-Loop Aggregation of Contiguous Memory Accesses for SIMD Vectorization App 20080010634 - Eichenberger; Alexandre E. ;   et al. | 2008-01-10 |
Method for improving processing of relatively aligned memory references for increased reuse opportunities App 20070226453 - Eichenberger; Alexandre E. ;   et al. | 2007-09-27 |
Efficient generation of SIMD code in presence of multi-threading and other false sharing conditions and in machines having memory protection support App 20070226723 - Eichenberger; Alexandre E. ;   et al. | 2007-09-27 |
Method to analyze and reduce number of data reordering operations in SIMD code App 20070192762 - Eichenberger; Alexandre E. ;   et al. | 2007-08-16 |
Apparatus and method for optimizing scalar code executed on a SIMD engine by alignment of SIMD slots App 20070174825 - Eichenberger; Alexandre E. ;   et al. | 2007-07-26 |
Method and system for versioning codes based on relative alignment for single instruction multiple data units App 20070169058 - Eichenberger; Alexandre E. ;   et al. | 2007-07-19 |
Method and apparatus for eliminating the need for register assignment, allocation, spilling and re-filling Grant 7,206,923 - Eichenberger , et al. April 17, 2 | 2007-04-17 |
Method and system for data-driven runtime alignment operation App 20070011441 - Eichenberger; Alexandre E. ;   et al. | 2007-01-11 |
System and method for efficient data reorganization to satisfy data alignment constraints App 20050283769 - Eichenberger, Alexandre E. ;   et al. | 2005-12-22 |
System and method for SIMD code generation in the presence of optimized misaligned data reorganization App 20050283774 - Eichenberger, Alexandre E. ;   et al. | 2005-12-22 |
Framework for integrated intra- and inter-loop aggregation of contiguous memory accesses for SIMD vectorization App 20050283775 - Eichenberger, Alexandre E. ;   et al. | 2005-12-22 |
Framework for efficient code generation using loop peeling for SIMD loop code with multiple misaligned statements App 20050283773 - Eichenberger, Alexandre E. ;   et al. | 2005-12-22 |
Framework for generating mixed-mode operations in loop-level simdization App 20050273769 - Eichenberger, Alexandre E. ;   et al. | 2005-12-08 |
System and method for SIMD code generation for loops with mixed data lengths App 20050273770 - Eichenberger, Alexandre E. ;   et al. | 2005-12-08 |
Method and apparatus for eliminating the need for register assignment, allocation, spilling and re-filling App 20050132172 - Eichenberger, Alexandre E. ;   et al. | 2005-06-16 |