Why HP Workstation for CAE · lan 또는 커넥션wan 사용 사례: 원격 워크스테이션,...
Transcript of Why HP Workstation for CAE · lan 또는 커넥션wan 사용 사례: 원격 워크스테이션,...
Copyright 2014 Hewlett-Packard
Why HP Workstation for CAE
정운영 상무
HP 아시아 태평양 워크스테이션 사업개발관리자
CAE Conference, 2014.10.8 Copyright 2014 Hewlett-Packard
Agenda
• Ansys 15.0 on HP Z workstation
• Simulia Abaqus 6.13-1 on HP Z Workstation
• Tuning Tips
• Z Turbo Drive
• Z Virtual Workstation
CAE Conference, 2014.10.8 Copyright 2014 Hewlett-Packard
ANSYS 15.0 Benchmark on HP Z Workstations
CAE Conference, 2014.10 Copyright 2014 Hewlett-Packard 4
HP Z820 E5-2687W v2 3.4/4.0 128GB 1866MHz 3x SSD RAID0 Windows 7 64 SP1
2 CPU Cores
2 CPU Cores + 1 GPU
8 CPU Cores
16 CPU Cores
16 CPU Cores + 1 GPU
CAE Conference, 2014.10 Copyright 2014 Hewlett-Packard 5
HP Workstation Recommended Configurations ANSYS Fluent 15.0 and CFX 15.0
HP Z420 Intel® Xeon® E5-1680 v2 8-Core 3.0/4.0, 64GB memory, SSD, NVIDIA Quadro K600
HP Z620 Intel® Xeon® E5-2667 v2 16-Core 3.3/4.0, 64GB memory, SSD, NVIDIA Quadro K2000
HP Z820 Intel ® Xeon® E5-2697 v2 24-Core 2.7/3.5, 64GB memory, SSD, NVIDIA Quadro K2000
NVIDIA GPU Compute Options
Z420 (600W) Z620 (800W) Z820 (1125W)
Tesla K40 Max 1 Max 2
Tesla K20c Max 1 Max 1 Max 2
Quadro K6000 Max 1 Max 1 Max 2
CAE Conference, 2014.10 Copyright 2014 Hewlett-Packard 6
HP Z820 E5-2687W v2 3.4/4.0GHz 128GB 1866MHz 3x SSD RAID0 Windows 7 64 SP1
1 CPU Cores + 1 GPU
2 CPU Cores
7 CPU Cores + 1 GPU
8 CPU Cores
16 CPU Cores
CAE Conference, 2014.10 Copyright 2014 Hewlett-Packard 7
HP Workstation Recommended Configurations ANSYS Mechanical 15.0
HP Z420 Intel® Xeon® E5-1660 v2 6-Core 3.7/4.0, 64GB memory, SSD RAID0, Quadro K600
HP Z620 Intel® Xeon® E5-2667 v2 16-Core 3.3/4.0, 96GB memory, SSD RAID0, Quadro K2000
HP Z820 Intel ® Xeon® E5-2687W v2 16-Core 3.4/4.0, 128GB memory, SSD RAID0, Quadro K2000
NVIDIA GPU Compute Options
Z420 (600W) Z620 (800W) Z820 (1125W)
Tesla K40 Max 1 Max 2
Tesla K20c Max 1 Max 1 Max 2
Quadro K6000 Max 1 Max 1 Max 2
CAE Conference, 2014.10.8 Copyright 2014 Hewlett-Packard
SIMULIA Abaqus Benchmark on HP Z Workstations
CAE Conference, 2014.10 Copyright 2014 Hewlett-Packard 9
SIMULIA Abaqus/Standard
S4: Cylinder head bolt-up • b) direct solver, Nonlinear • [5,237K DOF]
1.00
1.82
3.04
4.79
6.10
6.74 7.20
7.71
0.00
1.00
2.00
3.00
4.00
5.00
6.00
7.00
8.00
9.00
1 Core 2 Cores 4 Cores 8 Core 12 Cores 16 Cores 20 Cores 24 Cores
SIMULIA Abaqus/Standard 6.13-4 Direct Sparse Solver benchmark s4b improves significantly with more cores
Z820 E5-2697v2 12 Core 2.7 dual processor Maximus K4000 + K20c Tesla(s)
CAE Conference, 2014.10 Copyright 2014 Hewlett-Packard 10
SIMULIA Abaqus/Explicit
E6: Concentric spheres • [244K elements]
1.00 1.31
2.37
3.57
4.54
5.09
6.31
0.00
1.00
2.00
3.00
4.00
5.00
6.00
7.00
2 Cores 4 Cores 8 Cores 12 Cores 16 Cores 20 Cores 24 Cores
Re
lati
ve
Pe
rfo
rma
nce
(h
igh
er
is b
ett
er)
SIMULIA Abaqus/Explicit 6.13-4 e6 Concentric Spheres benchmark improves significantly with more cores
Z820 E5-2697v2 12 Core 2.7 dual processor Maximus K4000 + K20c Tesla(s)
CAE Conference, 2014.10 Copyright 2014 Hewlett-Packard 11
SIMULIA Abaqus/Standard with Maximus Config
1.00
2.55
1.82
4.07
3.04
5.63
4.79
7.24
6.10
8.11
6.74
8.37
7.20
8.63
7.71
9.15
0.00
1.00
2.00
3.00
4.00
5.00
6.00
7.00
8.00
9.00
10.00
1 Core 1 Core+K20c
2 Cores 2 Cores+K20c
4 Cores 4 Cores+K20c
8 Core 8 Cores+K20c
12 Cores 12 Cores+K20c
16 Cores 16 Cores+ 2 K20c
20 Cores 20 Cores+ 2 K20c
24 Cores 24 Cores+ 2 K20c
Re
lati
ve
Pe
rfo
rma
nce
(h
igh
er
is b
ett
er)
SIMULIA Abaqus/Standard 6.13-4 Direct Sparse Solver is faster
when run with Telsa K20c GPU compute
1st CPU Socket 2nd CPU Socket
(1)
K2
0c
Te
sla
(1)
K2
0c
Tes
la
(1)
K2
0c
Tes
la
(2)
K2
0c
Tes
las
(2)
K2
0c
Tes
las
(1)
K2
0c
Tes
la
Benchmark: s4b Model - Cylinder head bolt-up 5,000,000 DOF, 5 iterations, Static, Nonlinear
- Z820 E5-2697v2 12 Core 2.7 dual processor Maximus K4000 + K20c Tesla(s)
(1)
K2
0c
Tes
la
(2)
K2
0c
Tes
las
CAE Conference, 2014.10 Copyright 2014 Hewlett-Packard 12
SIMULIA Abaqus/Standard Licensing
1.00
2.55
1.82
4.07
3.04
5.63
4.79
7.24
6.10
8.11
6.74
8.37
7.20
8.63
7.71
9.15
0
2
4
6
8
10
12
14
16
18
20
0.00
1.00
2.00
3.00
4.00
5.00
6.00
7.00
8.00
9.00
10.00
1 Core 1 Core+K20c
2 Cores 2 Cores+K20c
4 Cores 4 Cores+K20c
8 Core 8 Cores+K20c
12 Cores 12Cores+K20c
16 Cores 16 Cores+ 2K20c
20 Cores 20 Cores+ 2K20c
24 Cores 24 Cores+ 2K20c
Re
lati
ve
Pe
rfo
rma
nce
(h
igh
er
is b
ett
er)
SIMULIA Abaqus/Standard 6.13-4 Direct Sparse Solver is >6 times faster with 12 cores vs 1 core. - Add a K20c Tesla GPU for >8 times faster. - The 12 core+K20c GPU is 1.33x faster than without K20c GPU
Rel Perf
TokensBenchmark: s4b Model - Cylinder head bolt-up 5,000,000 DOF, 5 iterations, Static, Nonlinear
- Z820 E5-2697v2 12 Core 2.7 dual processor Maximus K4000 + K20c Tesla(s)
CAE Conference, 2014.10 Copyright 2014 Hewlett-Packard 13
SIMULIA Abaqus 6.13-4
HP Workstation Recommended Configurations
Occasional use up to 48-64 GB, 8-Core, Tesla HP Z420 Intel® Xeon® E5-2687Wv2 8-Core 3.4Ghz/4GHz Turbo Boost, 48 GB-64 GB memory, 512 GB7 SATA SSD for OS, NVIDIA Tesla K20c, NVIDIA Quadro K600
Moderate use up to 48 GB-96 GB, 20-Core, Tesla HP Z820 2x Intel® Xeon® E5-2690v2 10-Core 3.0Ghz/3.6 Turbo Boost, 48 GB-96 GB memory, 512 GB7 SATA SSD for OS, NVIDIA Tesla® K20c, NVIDIA Quadro K2000 or K4000
Intensive use up to 96 GB-192 GB, 24-Core, Tesla HP Z820 2x Intel® Xeon® E5-2697v2 12-Core 2.7GHz/3.5Ghz Turbo Boost, 96 GB-192 GB memory, 512 GB7 SATA SSD for OS, two NVIDIA Tesla®, K20c (1 GPU per CPU socket), NVIDIA Quadro K2000 or K4000
CAE Conference, 2014.10.8 Copyright 2014 Hewlett-Packard
Tips and Resource Monitoring
CAE Conference, 2014.10 Copyright 2014 Hewlett-Packard 15
SIMULIA Abaqus 6.13-4
HP Workstation Tuning Tips
• Memory
• Use as much memory as possible to minimize the I/O required
• Storage
• Abaqus/Explicit. Need fast file system if Abaqus/Standard job does not fit in memory
• Simulations that do not fit in physical memory will benefit from a RAID 0 storage volume configuration for performance.
LSI 9260-8i card for hardware RAID.
– 2-3x 6 Gb/s SSD RAID 0
– 3-4x 6 Gb/s SAS HDD RAID 0
• BIOS Settings All set to Default except set HyperThreading to Disable
• NVIDIA Tesla CPU Acceleration setting – change default ECC Mode off
• C:\Program Files\NVIDIA Corporation\NVSMI>nvidia-smi -e 0 or from the NVIDIA control panel Manage Maximus Settings.
CAE Conference, 2014.10 Copyright 2014 Hewlett-Packard 16
HP Performance Advisor Workstation Monitor
CAE Conference, 2014.10 Copyright 2014 Hewlett-Packard 17
HP Performance Advisor Workstation Monitor
CAE Conference, 2014.10 Copyright 2014 Hewlett-Packard 18
COMPUTATIONAL STRUCTURAL MECHANICS
NVIDIA CUDA Accelerated Applications
APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU
SUPPORT
Abaqus/Standard Simulation and analysis tool for structural mechanics Direct sparse solver Yes
ANSYS Mechanical Simulation and analysis tool for structural mechanics Direct and iterative solvers Yes
DS Exsight Uses Abaqus Standard for GPU computing Direct sparse solver Single only
DS DesignSight Uses Abaqus Standard for GPU computing Direct sparse solver Single only
Impetus Afea Predicts large deformations of structures and components exposed to extreme loading conditions Linear equation solver Yes
LS-DYNA Implicit Multiphysics simulation package used Linear equation solver Yes
MSC Nastran Simulation and analysis tool for structural mechanics Direct sparse solver Yes
MSC Marc Simulation and analysis tool for structural mechanics Direct sparse solver Yes
NX Nastran Simulation and analysis tool for structural mechanics Linear equation solver Single only
OptiStruct Simulation and analysis tool for structural mechanics Direct and iterative solvers Yes
PAM-CRASH Implicit Multiphysics simulation package used Linear equation solver Single only
CAE Conference, 2014.10 Copyright 2014 Hewlett-Packard 19
COMPUTATIONAL FLUID DYNAMICS
NVIDIA CUDA Accelerated Applications
APPLICATION DESCRIPTION SUPPORTED FEATURES MULTI-GPU
SUPPORT
Altair AcuSolve General purpose CFD software Linear equation solver Yes
ANSYS Fluent General purpose CFD software Radiation heat transfer model,
linear equation solver
Yes
Autodesk Moldflow Plastic mold injection software Linear equation solver Single only
CPFD Barracuda-VR and
Barracuda
Fluidized bed modeling software Linear equation solver,
particle calculations
Single only
FluiDyna Culises for
OpenFOAM
Solver library for general purpose CFD
software
Linear equation solvers Yes
FluiDyna LBultra General purpose CFD software Lattice-Boltzmann solver Yes
Prometech Particleworks Particle-based CFD software Implicit and explicit solvers Single only
Turbostream Ltd. CFD software for turbomachinery flows Explicit solver Yes
Vratis ARAEL General purpose CFD software based on
FVM with OpenFOAM compatibility
Linear equation solver Yes
Vratis SpeedIT extreme for
OpenFOAM
Solver library for general purpose CFD
software
Linear equation solvers Yes
CAE Conference, 2014.10.8 Copyright 2014 Hewlett-Packard
HP Z Turbo Drive for HP Workstations
© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. 21
0
500
1000
1500
2000
2500
2010 2011 2012 2013 2014 2015 2016
MB
/s
Get up to 2x the storage performance at a cost similar to commercial SATA SSDs
HP Z Turbo Drive
Reduce bottlenecks by connecting directly to the PCIe bus
• Excellent for large files / big data workflows
• Reliable ecosystem with storage driver
• Supports boot and data drive
SSDs
SATA/SAS 6G
PCIe G2 x2
PCIe G2 x4
HDDs
PCIe SSD - HP Z Turbo Drive:
revolutionary performance increases
SATA SSD: At performance ceiling
SATA HDD: At performance ceiling
© Copyright 2014 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. 22
Benefits
HP Z Turbo Drive
Features
PCIe connected SSD
Price comparably to SATA SSDs
Workstation support today
Performance >1GB/s (read)
Highest price/performance device
Desktop, boot & data drives
Application & cost specific 256 GB and 512 GB capacities
Workstation innovation in storage performance
CAE Conference, 2014.10 Copyright 2014 Hewlett-Packard 23
Performance comparison • 100% improvement for sequential read/write
vs SATA SSDs
• Read performance exceeds one gigabyte per second - roughly twice as fast as commercial SATA SSDs
• More than six times faster in sequential read performance than a commercial-grade HDD
*Performance measured using IOMeter 2006 with queue depth 32 **Performance measured using SPECwpc 1.0 Workstation IOMeter script
0
200
400
600
800
1000
1200
Sequential read Sequential write
7200 SATA HDD Samsung Enterprise SATA SSD
Micron M550 SATA SSD HP Z Turbo Drive (PCIe SSD)
Sequential R/W performance* (MB/s)
0
50000
100000
150000
Random read Random write
7200 SATA HDD Samsung Enterprise SATA SSD
Micron M550 SATA SSD HP Z Turbo Drive (PCIe SSD)
Random R/W performance* (IOPS)
0
500
1000
1500
2000
2500
3000
3500
Random read Random write
Samsung Enterprise SATA SSD Micron M550 SATA SSD
HP Z Turbo Drive (PCIe SSD)
SolidWorks specific benchmark SPECwpc** (IOPS)
CAE Conference, 2014.10.8 Copyright 2014 Hewlett-Packard
HP Z Virtual Workstation
CAE Conference, 2014.10 Copyright 2014 Hewlett-Packard 25
HP DL380z Gen8 가상 워크스테이션 사양 정보
가상화 워크스테이션의 핵심 요소
• 최대 8명의 워크스테이션 사용자를 지원하는 2U 랙 시스템
• 업계 표준 가상화 에코시스템
• Citrix, VMware
• HP RGS 및 HP Velocity
• ISV 인증
직관성, 호환성을 갖춘 관리 시스템
• HP iLo 관리 옵션
• HP 인사이트 컨트롤 및 HPP 인사이트 온라인
• HP 스마트 업데이트
고성능 그래픽
• NVIDIA™ GRID™ K2
• NVIDIA™ Quadro™ K4000/K5000/K6000
다양한 장치 지원
HP 워크스테이션, HP 씬클라이언트 또는 태블릿
• 수신 장치와 OS에 따라 다양한 수신장치 소프트웨어 지원
업계가 인정한 리더십
• 30년 이상 워크스테이션 고객의 니즈에 부응해 온 전문성
• NVIDIA, Citrix, Vmware, ISVs와의 탄탄한 협력 관계
• HP 워크스테이션 성능 튜닝 기술
• 독보적인 HP 포트폴리오
LAN 또는 WAN
사용 사례: 원격 워크스테이션 , pGPU, vGPU
클라이언트 OS
원격 기반 SW
소프트웨어 수신
P M G
P – 프로세서 또는 코어
M – 메모리
G - GPU
클라이언트
사례 A: 원격 워크스테이션
SW 수신
P M G1
사례 B: Pass-thru GPU
SW 수신
G2
하이퍼바이저
원격 기반 SW
가상화 클라이언트 OS A
가상화 클라이언트
OS B
원격 기반 SW
A B
SW 수신
P M G1*
사례 C: 가상화 GPU
SW 수신
G2*
하이퍼바이저
원격 기반 SW
가상화 클라이언트 OS A
가상화 클라이언트 OS B
A B
vGPU HV 지원
원격 기반 SW 원격 기반 SW
가상화 클라이언트 OS
C
SW 수신
C …
…
픽셀
USB / HID
픽셀
USB / HID
픽셀
USB / HID
픽셀
USB / HID
픽셀
USB / HID
픽셀
USB / HID
…
* 가상화 가능한 GPU. E.g. NVIDIA GRID K1/K2
… …
커넥션 매니저 커넥션 매니저
CAE Conference, 2014.10.8 Copyright 2014 Hewlett-Packard
감사합니다
© Copyright 2014 Hewlett-Packard Development Company, L.P. 이 문서의 모든 내용은 통보 없이 변경될 수 있습니다.
CAE Conference, 2014.10 Copyright 2014 Hewlett-Packard 28
HP Workstation family ANSYS 15.0 Pre-Processing and Post-Processing
HP Workstation
Intel® Processor(s)
# CPU Cores Max Memory GB
Graphics
HP Z1 G2 Xeon® E3-1200v3
Core™ i3, i5
4
2, 4
32 NVIDIA Quadro
HP Z230 Xeon® E3-1200v3
Core™ i5, i7
4
4, 4
32 AMD FirePro
NVIDIA Quadro
HP Z420 Xeon® E5-1600v2
Xeon® E5-2600v2
4, 6, 8
8
64 AMD FirePro
NVIDIA Quadro
HP Z620 Xeon® E5-1600v2
Xeon® E5-2600v2
4, 6, 8
4, 6, 8, 10, 12
96 AMD FirePro
NVIDIA Quadro
HP Z820 Xeon® E5-2600v2 4, 6, 8, 10, 12 512 AMD FirePro
NVIDIA Quadro
HP ZBook 14 Core™ ULT i5, i7 2, 2 16 AMD FirePro
HP ZBook 15, 17 Core™ M i5, i7 2, 4 32 NVIDIA Quadro