You are viewing archived content (2011-2018). For current research, visit

Articles by Colfax

Second Edition of “Parallel Programming and Optimization with Intel Xeon Phi Coprocessors”

January 15, 2019

Our first book, “Parallel Programming and Optimization with Intel Xeon Phi Coprocessors” (second edition) is now available for free. Use the link below to download.  Parallel_Programming_and_Optimization_with_Intel_Xeon_Phi_Coprocessors_2nd_Edition.pdf (14 MB) This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License. For the code of the supplementary practical exercises (“labs”), including updates for Intel Xeon Phi processor x200 family, go to [...]

MC² 004: Signal Processing in a Physics Experiment

May 30, 2017

Interested in this webinar? See more webinars like this. Speaker Prof. Jeffrey S. Dunham, Professor of Physics, Middlebury College Prof. Jeffrey S. Dunham has taught physics for 34 years at Middlebury College in Middlebury, Vermont, where he is now William R. Kenan Jr. Professor of Natural Sciences. He currently conducts experimental research in nonlinear dynamics. He is using HPC techniques at the workstation level to analyze large data sets from experiments that can be performed in a small-college laboratory. He received the Bachelor of Science degree in physics from the University of Washington in 1975 and the Ph.D. in physics from Stanford University in 1981. Presentation Savitzky-Golay Filter Algorithm for Large One-Dimensional Data Sets A chaotic pendulum experiment in our laboratory performs about 275 million digitized angle measurements in a 24-hour day. A Poincaré plot of the raw data shows significant and unacceptable discretization effects from the optical rotary encoder used to measure angle. The raw data is therefore passed through a Savitzky-Golay (SG) filter [...]

MC² 005: Biological Development Modeling

May 30, 2017

Interested in this webinar? See more webinars like this. Speaker Pablo González de Aledo Marugán, Postdoctoral Research Assistant, Imperial College of London Pablo González de Aledo studied telecommunications engineering at thec University of Cantabria, Santander, Spain, and finished his studies at the Network-on-Chip team in ST-Microelectronics Grenoble, France. After working in the modeling and simulation of high-performance, multi-core and heterogeneous platforms for some European projects he became interested in the theory behind formal methods and worst-case-execution-time and moved to the Department of Computing at Macquarie University, Sydney, Australia, to complement his Ph. D. studies. He is now a Postdoctoral Research Assistant at Imperial College of London. Presentation An optimization approach for the computational modeling of biological development Current research in the field of computational biology often involves simulations on high-performance computer clusters. It is crucial that the code of such simulations is very efficient and correctly reflects the model [...]

MC² 003: Plasma Simulation with Particle-in-Cell Code

April 17, 2017

Interested in this webinar? See more webinars like this. Speaker Dr. Anastasia Perepelkina, Researcher, Keldysh Institute of Applied Mathematics Dr. Anastasia Perepelkina is a researcher at the Keldysh Institute of Applied Mathematics in Moscow, Russia. Plasma particle-in-cell simulation has been her main scientific interest since the master’s program at the NRNU MEPhI and it is the topic of her PhD thesis. She is a member of the laboratory with a long history of the development of plasma simulation methods, efficient algorithms, and applications. Beyond that, she has worked on the implementation of high-order schemes in high-performance codes for stencil simulation in optics and seismology, and on the application of the resulting code. Presentation Particle-in-cell Code with LRnLA Algorithms, Performance Tests on KNL CFHall is a plasma simulation code based on a particle-in-cell method. It uses the finite difference method on a rectangular mesh for Maxwell equations, and couples it to superparticle traversal the mesh in a self-consistent approach. The uniqueness of CFHall [...]

MC² 002: CoMD, Molecular Dynamics Proxy Application

March 3, 2017

Interested in this webinar? See more webinars like this. Speaker Dr. Adedoyin Adetokunbo “Toks”, Staff Scientist Los Alamos National Laboratory Dr. Adedoyin Adetokunbo (“Toks”) is currently a Staff Scientist at Los Alamos National Laboratory (LANL). He currently serves as a member of the Future Application and Architectures (FAA) and Application Performance Team (APT). Previously worked as a postdoctoral fellow at University of Notre Dame’s Mechanical Engineering department. He served as a lead in constitutive modeling of complex heterogeneous materials. A native of Nigeria, Toks arrived in Mississippi in the year 2000, and acquired his Bachelor’s in Applied Mathematics and Mechanical Engineering. In addition, he received his Master’s degrees in field of Computational Fluid Dynamics with an emphasis in turbulence modeling in real and spectral space. He then completed his Ph.D. in Computational Engineering with a focus on modeling and simulating phase transformation in polycrystalline materials. Presentation A case study on software [...]

MC² 001: Smooth Particle Hydrodynamics Optimization

February 11, 2017

Interested in this webinar? See more webinars like this. Speaker Dr. Fabio Baruffa, Sr. HPC Application Specialist Leibniz Supercomputing Centre Since 2016, Dr. Fabio Baruffa is Senior HPC Application Specialist at Leibniz Supercomputing Centre (LRZ) in Munich. He is a member of the Intel Parallel Computing Center (IPCC) at LRZ, focusing on code modernization and porting of scientific applications. Prior to LRZ, he has worked as HPC researcher at Max-Planck Computing and Data Facility (MPCDF), Jülich Research Center and Cineca Supercomputing involved in HPC software development and analysis of scientific codes. His main research interests are in the area of computational methods and optimizations for HPC systems. He holds a PhD in Physics from University of Regensburg for his research in the area of spintronics device and quantum computing. Presentation Performance Optimization of SPH Algorithms for Multi/Many-Core Architectures In the framework of the Intel Parallel Computing Centre at the Research Campus Garching in Munich, our group at LRZ presents recent results on performance [...]

Regional Trainings Begin for 2017

February 8, 2017

We are resuming our regional hands-on training on parallel programming and optimization at major universities across the United States. This year, in addition to teaching performance tuning on Intel Architecture, our program will include Hands-on exercises on our new Colfax Cluster with Intel Xeon Phi processors x200 and Information about machine learning on Intel Architecture Our first stop is Yale University on February 22-23. For future events, watch our Regional Training [...]

Meet us at SC16

November 8, 2016

If you are going to the SC16 conference, visit Colfax: At the Intel HPC Developer Conference (free pre-SC16 event) When: on Sunday, November 13, 2016, at 9:45 am – 10:35 am Where: Sheraton Salt Lake City Hotel Event: Technical session “Optimizing Machine Learning Workloads on Intel Platforms” On the SC’16 exhibition floor When: Monday, November 14 through Thursday, November 16 Where: Colfax’s booth #2407 Event: See the Ninja Developer Platform based on Intel Xeon Phi processors and all other components of Intel Scalable System Framework. Stop by for a live demo of code modernization and play with a machine learning application that describes you and the scene around you with words. At the Intel Community Hub When: Wednesday, November 16, at 10:15 am Where: Intel’s booth #1819 Event: Code Modernization Sharing by the Ecosystem Community [...]

Performance Optimization for Intel® Xeon Phi™ x200 Product Family: Video

September 29, 2016

Optimization for Intel Xeon Phi Processors x200 Colfax now offers a 2-hour Hands-On Workshop (HOW) video on the best practices for performance optimization for Intel® Xeon Phi™ processor (formerly Knights Landing). Use links below the video to navigate the 10 episodes.     Slides:   HOW-Knights-Landing.pdf (4 MB) Part 1. Meet Intel Xeon Phi processors Purpose of Intel Xeon Phi processors and their organization from the programmer’s point of view. Episode 01. ► Intel architecture: today and tomorrow (14 min) Episode 02. ► Cores in Intel Xeon Phi processors (7 min) Episode 03. ► Vector Instruction Support (14 min) Episode 04. ► High-bandwidth memory (8 min) Episode 05. ► Clustering modes (9 min) Part 2. Hands-on Demonstrations Exercises in performance optimization for Intel Xeon Phi processors. Episode 06. ► Memory bandwidth optimization (19 min) (bonus: ► with memkind) (9 min) Episode 07. ► Vectorization with AVX-512 (13 min) (bonus: ► threading) (9 min) Episode 08. ► Tuning with Intel Math Kernel Library (MKL) (20 min) Episode 09. [...]

HOW Series “Tools”

May 19, 2016

The free Hands-On Webinar (HOW) series explores 3 essential software development tools for Intel architecture: Intel VTune Amplifier, Intel Math Kernel Library (MKL) and Intel Advisor. Each 2-hour webinar comprises a theoretical part and a hands-on component, which is live demonstration of programming and optimization techniques on modern Intel Xeon processors and Intel Xeon Phi coprocessors. The Colfax HOW Tools training series is an integral part of the Intel Modern Code Developer program which supports developers in leveraging application performance in code through a systematic optimization methodology. For similar upcoming trainings, see our Training Calendar. Webinar 1 – Hotspot-guided optimization with Intel VTune Amplifier XE   This session features select topics of performance diagnostics using Intel VTune Amplifier XE. Discussion of general performance optimization challenges. Demonstration of the usage of VTune to identify top hotspots on traditional CPUs and Intel Xeon Phi coprocessors. Illustration of the optimization report functionality of Intel compilers. [...]
1 2