Entries tagged as: Developers

Key ISV presentations at APU 13 Professional Graphics Track

Posted by Tony DeYoung on November 11, 2013

Today begins the AMD Developer Summit - APU 13 in San Jose, CA.  In addition to the keynotes and general sessions there are a series of AMD FirePro professional session given by key ISV partners. See full agenda here.

Tues     10:00 – 10:45     PG-4118        
Steve Hoeg       Adobe Systems          Advanced GPU
Accelerated Video Editing

Tues     11:00 – 11:45     PG-4037        
Leonard Hoffnung       Siemens PLM Software
Fast modal analysis with NX Nastran and GPUs

Wed     4:00 – 4:45       PG-4120        
Gideon Shmuel eyeSight       
Bringing next generation gesture technology to digital devices

Tues     4:00 – 4:45       PG-4039        
Dmitry Kozlov   AMD   
RapidFire API

Tues     5:00 – 5:45       PG-4035        
Wociech Tarnawski       Vratis 
Virtual Microscopy in the cloud

Wed     10:00 – 10:45     PG-4036        
Jakub Pola, Lukasz Miroslaw     Vratis 
Computational Fluid Dynamics of the blood flow in the cloud
Wed     11:00 – 11:45     PG-4119        
Jacques Lefaucheux     Cintoo3D       
3D Geometry Compression on GPU

Wed     2:45 – 3:30       PG-4034        
Karl Hillesland   AMD   
Using OpenGL and DirectX for Heterogeneous Compute

Wed     4:00 – 4:45       PG-4084        
Luis Crivelli       Dassault Systemes Simulia Corp        
Accelerating Abaqus Using AMD GPUs            

FirePro Pro Graphics track at upcoming AMD Developer Summit Nov 11-14

Posted by Tony DeYoung on September 30, 2013

This Pro Graphics track at the AMD Developer Summit  (Nov 11-14, 2013) from the AMD FirePro team will feature sessions focused on areas of interest to parties developing professional graphics and computing applications as well as techniques for utilizing CPUs and multiple GPUs to get the most out of available graphics and computing power.

Session ID Speaker(s) Organization Title
PG-4032 Duoduo Liao Computing for Geospatial Research Institute GPU-Based Fast Generating Volumetric Models for 3D Printing
This paper proposes an approach to fast converting 3D solid objects into multivalued voxel-based models based on Graphics Processing Unit (GPU) accelerating for 3D printing or any other areas using 3D voxel-based models. In particular, by dynamically applying different transfer functions, any surface-closed geometric object can be voxelized into a solid volumetric representation with any kind of interior materials, such as rainbow, marble, wood, translucent jade, etcThis approach significantly improves the voxelization speed for solid geometric objects in real time. Moreover, with this method, other limitations, such as missing voxels, binary-only voxels, and non-transparency, are avoided as well. Due to slice-independent voxelization processing, parallel computing can be applied to this approach to improve overall performance.
PG-4035 Wociech Tarnawski, Jakub Poâa, Lukasz Miroslaw Vratis Sp. z o.o. Virtual Microscopy in the cloud
Image data generated in microscopy are is hard to maintain due to high number of data formats, various image processing workflows, data storage and complicated and time-consuming post-processing. High content screening and automated image processing is a current trend but the complexity of both technologies prevent non-specialists to use them in their work. High number of parameters to be set, necessity to use different software for different task rises both the work effort and the cost of the overall system. VIRTUM is a image-processing platform that addresses these issues by integrating various technologies in one place and providing the user a web-based access to the system. The images are stored in the cloud and analyzed on specialized hardware equipped in CPU and GPU nodes.
PG-4037 Leonard Hoffnung Siemens PLM Software Fast modal analysis with NX Nastran and GPUs
Industrial finite element analysis of large structures can consume a large amount of hardware resources and run time, particularly in the computation of eigenvalues and eigenvectors (also known as modal analysis). The RDMODES method in NX Nastran speeds up modal computation by a factor of ten or more, while preserving sufficient accuracy for most engineering purposes. We present performance improvements in RDMODES achieved by solving relatively small and dense subproblems on the GPU using the clMAGMA library.
PG-4084 Luis Crivelli Dassault Systemes Simulia Corp Accelerating Abaqus Using AMD GPUs
We present recent work done at SIMULIA to increase the performance of Abaqus by using GPU accelerators. In particular, we discuss the implementation of Abaqus Direct Sparse Solver and analyze the main technical issues relevant to the implementation of Explicit. Our main objective is to reduce the solver time in the context of multiple CPU cores combined with multiple GPUs. We demonstrate the performance gains of Abaqus/Standard for a number of realistic, customer-based, large problems run on last generation CPUs and the benefits obtained when GPUs and CPUs collaborate and complement each other.


Autodesk, Adobe, OPTIS, Microsoft, Mixamo, Mitsubishi, OpenCL in AMD FirePro booth at SIGGRAPH 2013

Posted by Tony DeYoung on July 16, 2013

The AMD FirePro booth #521 at this years SIGGRAPH 2013 is going to be packed with demos this year demonstrating the power of the AMD FirePro W series. Check out the GPU-accelerated demos from software industry leaders as well as new software firms. There are also several cutting edge display and projection hardwares worth checking out.

What to look for at a glance:

  • 6 Sharp 32" 4K Ultra-HD displays driven by a single AMD FirePro W9000 card: 9252 x 4320 resolution = 40 million pixels at 60fps
  • OpenCL real-time 12-bit color processing at 4K running on the first high frame rate 4K DLP projector (Christie D4K2560) driven by a FirePro W9000. Gary Demos will be presenting.
  • Windows 8.1 (Blue) tiled resources using partially resident textures in DirectX 11.2 running on on a Dell T5600 and AMD FirePro W7000
  • AMD Radeon Sky Series for Cloud Gaming
  • OpenCL-accelerated OPTIS THEIA-RT for real-time physics-based ray-tracing running across 6 30"displays using AMD FirePro W9000
  • Mitusbishi 6 46" LCD display wall across using Playout Composer running on a Dell T5600 and AMD FirePro W9000
  • OpenCL-accelerated facial tracking with Mixamo Face+ running on on a Dell T5600 and AMD FirePro W7000
  • WebGL-based virtual globe rendering by Cesium running on a Dell T5600 and AMD FirePro W7000
  • Autodesk 3ds max 2013 running on a Dell T5600 and AMD FirePro W8000
  • OpenCL-accelerated MayaBullet physics simulation plug-in for Autodesk Maya 2013 running on a Dell T5600 and AMD FirePro W8000
  • OpenCL-accelerated Adobe Premiere Pro CC for real-time video production in HD and 4K running on a Dell T5600 and AMD FirePro W7000
  • Enhanced OpenCL and OpenGL support in Adobe PhotoShop CC for 10-bit high dynamic range imagery and accelerated special effects running on a Dell M6700 PremierColor w/ FirePro M6000 Mobility Pro Graphics
  • AMD OpenCL Developer Tools demos running on a Dell T5600 and AMD FirePro W8000

Also be sure to check out the two AMD FirePro tech talks at SIGGRAPH 2013.

  1. Real-Time ACES Decoding and AMD FirePro GPU-Enabled RRT/ODT Processing at 4K
    Tuesday, 23 July | 11:15 am-12:15 pm
  2. FirePro Technologies for Visual Computing
    Wednesday, 24 July | 11:15 am-12:15 pm

AMD FirePro Tech Talks at SIGGRAPH 2013

Posted by Tony DeYoung on July 10, 2013

The AMD FirePro team will host two tech talks at SIGGRAPH 2013.

Real-Time ACES Decoding and AMD FirePro GPU-Enabled RRT/ODT Processing at 4K
Tuesday, 23 July | 11:15 am-12:15 pm
Interactive decoding enables real-time adjustment in scene-referred linear light of color balance, contrast, and effective exposure of high-dynamic-range ACES.  Efficient coding/decoding using floating-point provides extra precision to support these features.

FirePro Technologies for Visual Computing
Wednesday, 24 July | 11:15 am-12:15 pm
Visual Computing is the combination of two key assets from AMD: compute and graphics. Visual Computing needs numerous specific features in graphics and compute in order to become a new experience in digital content creation, multimedia, or visual simulation. AMD FirePro Technology provides these key features, delivering the best experience in Visual Computing.

RGK geometric 3D kernel for CAD supports CPU multi-threading and GPU acceleration using OpenCL

Posted by Tony DeYoung on June 27, 2013

Geometric 3D kernels are software components designed to represent and precisely model physical objects in 3D on computers.  From these kernels, software companies develop CAD/CAM CAE end user applications. Examples include Parasolid, ACIS, CATIA CGM and ASM.

This paper describes RGK (Russian geometric kernel), a parallel computation kernal that has support for multi-threading (CPU) and GPU acceleration using OpenCL.  The RGK kernel has the full set of tools to create models using solid, surface, and wireframe modeling. Because it uses parallel computations, the application can work simultaneously with these different types of data within the same model.

On the GPU, RGK supports OpenCL to handle functions to compute mass-inertia characteristics of bodies, projections, searching model elements upon the given point, NURBS computations, etc.  New parallel algorithms can easily be introduced to keep up with any newly introduced hardware capabilities. The kernel also supports visualization techniques for models including model tessellation and mesh generator functions using the GPU.

RGK supports 32- and 64-bit architecture, and Windows and Linux platforms. It can be compiled with any С++ compiler that implements features of С++11 standard.

Below is an example video demonstrating geometric constraints in assembly and kinematics simulation

Tags: CAD, Developers

AMD FirePro Technology SDK offers repository of source code samples

Posted by Tony DeYoung on August 23, 2012

AMD has released the FirePro Technology SDK, which is a a repository of samples with complete source code intended for educational purpose. The base code is up to date with the latest specification and can be seen as a guideline for good coding practice, to achieve the best performance from your graphic card.

The first round of code samples includes:  OpenGL textured cube, OpenGL lighting, OpenGL advanced rendering techniques, OpenGL tessellation, OpenCL transfer compute overlap, OpenGL QuadBugger stereo rendering, OpenGL sparse texture, OpenGL frame lock and OpenGL 10-bit per component surfaces.

This code is released free of charge for use in derivative works, whether academic, commercial, or personal.

Tags: Developers

Fabric Engine new DCC app ‘Creation Platform’ to debut at SIGGRAPH 2012 in AMD FirePro booth #709

Posted by Tony DeYoung on July 31, 2012
At this years SIGGRAPH, Fabric Engine will publicly debut the new Creation Platform framework for building custom, high-performance graphics / DCC applications. Creation is built on top of Fabric’s multi-threading heterogeneous compute engine.

Creation Platform will be shown in the AMD FirePro booth (#709) at SIGGRAPH 2012 next week.

Key benefits of Creation include:

  • High Performance: Creation is built on top of the Fabric Core Execution Engine, a multi-threading engine designed to get exceptional performance out of both CPUs and GPUs.
  • Modularity: Creation provides a set of building blocks that can be connected together to build a wide range of different applications.
  • Extensibility: Creation’s extension system enables developers to incorporate existing code libraries into their applications. This enables Creation to incorporate things like Bullet Physics and streaming of motion capture data, as well as allowing for support of custom file types like Alembic and LIDAR.
  • Accessibility: The majority of Creation applications are built using Python and Qt. For the critical performance parts of an application, Creation uses a custom Kernel Language (KL), which a simple, high-level language. If a TD or developer is comfortable with Python, they will find KL easy to pick up.

Small scale clustered computing ecosystem broadens: AMD FirePro,  LS-DYNA, CAPS HMPP compiler

Posted by Tony DeYoung on July 12, 2012

AMD is very active in advancing performance computing (see recent award for DOE Extreme-Scale Computing Research and Development Program).  Part of these efforts entail broadening the small scale clustered computing ecosystem including recent optimizations for OpenCL acceleration on AMD FirePro GPUs and APUs for LS-DYNA simulation software and the HMPP directive-based compiler from CAPS.

LS-DYNA is a general-purpose finite element program capable of simulating complex real world problems. It is used by the automobile, aerospace, construction, military, manufacturing, and bioengineering industries to run simulations that are complex and compute-intensive like auto crashes and explosions. The beta version supports OpenCL acceleration on AMD FirePro graphics (general availability in 3rd quarter) .

CAPS Enterprise HMPP compiler now fully integrates data-parallel backend for OpenCL and adds support for AMD FirePro GPU and APU devices. CAPS technology helps users take advantage of AMD Accelerated Parallel Processing Math Libraries (APPML), the software libraries containing FFT and BLAS functions designed to run on AMD GPUs.  Initial performance benchmarks in functions such as image-processing using Sobel Filters show a 40X increase in performance. With this new release, compiler directives can be used to leverage AMD FirePro capabilities while preserving C or FORTRAN codes.

Article: How does Order Independent Transparency Accelerate Creo 2.0

Posted by Tony DeYoung on April 17, 2012

Order independent transparency or “OIT” in computer graphics programming terminology denotes any technique that can correctly render overlapping semi-transparent objects without having to sort them before they are being rendered. Rendering semi-transparent objects has always been a problem because the blending operation is order dependent: when a semi-transparent fragment is rendered, the underlying color (i.e. the background) is crucial for the final color to be correct. 

This article and accompanying white paper, explain how AMD implemented the OIT technique in PTC Creo Parametric 2.0. The technique uses standard ARB OpenGL 4.2 extensions like Atomic Counter and Shader Image load/store which enables access to texture buffer directly from the shader. The implementation allows for pixel accurate rendering of overlapping semi-transparent objects without having to sort them before they are being rendered, providing up to 9 or 10 times performance improvement when rendering transparency in Creo Parametric 2.0 versus blended rendering in PRO/Engineer Wildfire5.0.

AMD FirePro Development User Community: Share tips and best practices

Posted by Tony DeYoung on February 09, 2012

AMD launched a FirePro Development user community for end-users, developers and AMD experts to share tips and best practices about the FirePro Vx800 and Vx900 graphic cards. The newly updated community page greatly improves users’ ability to ask questions and get answers on any FirePro related topic:

For example, here’s an interesting post just answered:

I’m trying to solve a very specific problem. For this, I need to run 4 displays at 120Hz, and have the vsync for each trigger a quarter of a frame after the previous:
Display 0, start frames at 0s, 4/480s, 8/480s, ...
Display 1, start frames at 1/480s, 5/480s, 9/480s, ...
Display 2, start frames at 2/480s, 6/480s, 10/480s, ...
Display 3, start frames at 3/480s, 7/480s, 11/480s, ...

Now the question is, can I do this with one S400 and one or more graphic cards?

1: Yes, you can connect up to 4 GPUs to a single S400 board.
2: You can genlock multiple displays that are connected to a single GPU as long as the connected monitors are identical.
3: The accepted input range for the house sync is between 15 and 120 H

Page 2 of 5 pages  <  1 2 3 4 >  Last »