Welcome to Scribd!

Control 1

Uploaded by

0% found this document useful (0 votes)

6 views4 pages

This document discusses several concepts related to high performance computing (HPC). It defines FLOPS as floating point operations per second, which is used to measure computer performance. It provides examples of peak (Rpeak) and maximum (Rmax) FLOPS values for the top supercomputers. It also discusses topics like cache misses due to replacement policies, the role of the translation lookaside buffer (TLB) in virtual to physical memory mapping, false sharing between cores updating nearby memory, and how stride size affects cache efficiency and misses.

Original Description:

hpc control 1

Original Title

Control1

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

6 views4 pages

Control 1

Uploaded by

alonc wolonc

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 4

Search inside document

Control HPC

Pregunta 1:

FLOP significa Floating point operations. Para el contexto supondré que se refiere a flops que
serían operaciones de punto flotante por segundo. Se utiliza este concepto porque existen varios
factores que afectan el rendimiento de un computador y lo que nos interesa en la práctica es que
tantas instrucciones se pudieron ejecutar en cierto periodo de tiempo. Aunque los procesadores
sean muy rápidos también afecta la transmisión de datos, cache y de como sea el proceso
ejecutado.

Entro los criterios encontramos Rmax y Rpeak, siendo Rpeak el máximo teórico que alcanza la
máquina y Rmax el máximo de flops alcanzado al ejecutar el código de LINPACK (1). Un ejemplo de
esto sería el super computador Frontier en USA que alcanza un Rmax de 1,194.00 Pflops y tiene un
Rpeak de 1,679.82 Pflops. Otro ejemplo sería el super computador Fugaku en Japón que tiene
Rmax de 442.01 Pflops y un Rpeak de 537.21 Pflops. Esto de acuerdo con la página oficial de
Top500.

Pregunta 2:

Un ejemplo podría ser.

X = a[0] + 5

X = b[0] + 5

X = a[0] + 5

X = c[0] + 5 (4)

X = a[0] + 5 (5)

Suponemos que hay 2 páginas de cache y que a, b y c están en páginas diferentes. En este caso
primero se ponen las paginas A y B. llegamos al paso 4, Con FIFO reemplazamos A por C y en 5
volvemos a querer usar A pero hay un miss, particular un Conflict miss, pues se sacó información
de la cache lo que causó el miss, sin embargo había una mejor forma de sacar cosas de cache que
habría evitado el conflicto, por eso es conflicto miss.

Con LRU en el paso 4 habríamos quitado la página B (pues A lo habíamos usado de forma más
reciente) y en 5 no tendríamos una cache miss.

Pregunta 3:

La TLB se encarga de mapear la memoria virtual con la memoria real. Como el computador tiene
muchos lugarse donde guardar memoria, como las caches, registros, ram, disco duro, etc. Se
utiliza una memoria virtual, que simula la memoría de forma simplificada, sin embargo en algún
lado debemos saber hacia donde se mapea esa memoria virtual a real y eso es lo que indica la TLB.
A diferencia de las caches, la TLB no guarda información del programa si no que guarda
información de donde está almacenada la información.

Afecta a la velocidad del programa, pues cada vez que queremos ir a buscar algo a memoria
debemos revisar en la TLB cual es el lugar real de memoria y por ende tiene un costo utilizarla que
afecta la velocidad del programa.

Pregunta 4:

El false sharing se da cuando tenemos que núcleos distintos a datos que están cercanos en
memoria (no son el mismo dato). El problema es que hay que traspasar esta información y por lo
mismo se puede perder tiempo en tener actualizadas las caches, aunque no haya una race
condition como tal. Eso hace que se pierda bastante eficiencia.

Se puede reducir bastante como se muestra en el libro cuando por ejemplo solo se hacen estas
actualizaciones al terminar un loop, así se evita tener que compartir memoria innecesesaria a
mitad de un proceso.

Pregunta 5:

Afecta en cuantos misses podemos tener y por lo mismo que tanto vamos a tener que refrescar
memoria. Un stride es un intervalo de acceso a datos, por ejemplo un stride pequeño hará que
tengamos acceso de elementos cercanos en memoria. Como la cache se actualiza por cachelines,
tendremo que cuando vamos a buscar un dato, también se traerá cierta cantidad de datos vecinos,
por eso la localidad espacial ayuda mucho a la hora de tener eficiencia a nivel de cache. Un stride
grande hará que excedamos los datos que se traen en una cache line y por lo mismo tendremos
más misses.

Pregunta 6:
Bibliografía:

1. https://www.top500.org/project/top500_description/

Async Rust
Document102 pages
Async Rust
Daniele Marsiglia
No ratings yet
Chapter 05 Computer Organization and Design, Fifth Edition: The Hardware/Software Interface (The Morgan Kaufmann Series in Computer Architecture and Design) 5th Edition
Document105 pages
Chapter 05 Computer Organization and Design, Fifth Edition: The Hardware/Software Interface (The Morgan Kaufmann Series in Computer Architecture and Design) 5th Edition
Priyanka Meena
75% (4)
Computer Design Supervision 3
Document3 pages
Computer Design Supervision 3
Tom Patterson
No ratings yet
CSC 314 Tutorial Sheet
Document2 pages
CSC 314 Tutorial Sheet
SC20A942 Jeme Beseka Foncham
No ratings yet
Welcome To Part 3: Memory Systems and I/O
Document31 pages
Welcome To Part 3: Memory Systems and I/O
Pir Saib
No ratings yet
Cache: Why Level It: Departamento de Informática, Universidade Do Minho 4710 - 057 Braga, Portugal Nunods@ipb - PT
Document8 pages
Cache: Why Level It: Departamento de Informática, Universidade Do Minho 4710 - 057 Braga, Portugal Nunods@ipb - PT
sothymohan1293
No ratings yet
Cache Memory Homework
Document7 pages
Cache Memory Homework
wrhkwohjf
100% (1)
Operating System
Document11 pages
Operating System
faiyaz pardiwala
No ratings yet
Question
Document3 pages
Question
Akaitom
No ratings yet
Memory Management - What and Where Are The Stack and Heap - Stack Overflow
Document12 pages
Memory Management - What and Where Are The Stack and Heap - Stack Overflow
tousifahmedkhan
No ratings yet
CPU Cache: How Caching Works
Document6 pages
CPU Cache: How Caching Works
Rav Thind
No ratings yet
Lecture 7 8405 Computer Architecture
Document7 pages
Lecture 7 8405 Computer Architecture
bokadash
No ratings yet
Processor Multithreading Vs Multi-Core
Document3 pages
Processor Multithreading Vs Multi-Core
Narendran Murugayah
No ratings yet
Computer Design Supervision 2
Document6 pages
Computer Design Supervision 2
Tom Patterson
No ratings yet
Computer Design Supervision 5
Document3 pages
Computer Design Supervision 5
Tom Patterson
No ratings yet
The Heap Explained
Document9 pages
The Heap Explained
Monika Yadav
No ratings yet
Limitation of Memory Sys Per
Document38 pages
Limitation of Memory Sys Per
Jyotiprakash Nanda
No ratings yet
Lec 6
Document9 pages
Lec 6
Elisée Ndjabu
No ratings yet
Python Downloadcopy Code: Import Import
Document5 pages
Python Downloadcopy Code: Import Import
kokobberihu14
No ratings yet
Previous Home Next
Document12 pages
Previous Home Next
Hassaan Rana
No ratings yet
Windows Assignment
Document8 pages
Windows Assignment
Sagar M S
No ratings yet
Smash The Stack
Document29 pages
Smash The Stack
wp1baraba
100% (1)
Virtual Memory Term Paper
Document7 pages
Virtual Memory Term Paper
afmaamehdbosuo
100% (1)
Os Module 4 Notes
Document51 pages
Os Module 4 Notes
19. sai roopesh
No ratings yet
You Probably Dont Need RAC
Document10 pages
You Probably Dont Need RAC
myron
No ratings yet
L15 Cache Introduction
Document35 pages
L15 Cache Introduction
Rakshan Kumar
No ratings yet
Unit 5 Dpco
Document20 pages
Unit 5 Dpco
shinyshiny966
No ratings yet
Cache Memory Term Paper
Document6 pages
Cache Memory Term Paper
afdttricd
100% (1)
Research Paper On Cache Memory
Document8 pages
Research Paper On Cache Memory
pib0b1nisyj2
100% (1)
Literature Review of Cache Memory
Document7 pages
Literature Review of Cache Memory
afmzhuwwumwjgf
100% (1)
Scimakelatex 28282 Mary+jones
Document6 pages
Scimakelatex 28282 Mary+jones
LK
No ratings yet
FEROLIN, Mary Bernadette J. November29, 2020 BSCS-2 CS 3104 (4:30 - 6:00, MW) Chapter 10: Virtual Memory
Document43 pages
FEROLIN, Mary Bernadette J. November29, 2020 BSCS-2 CS 3104 (4:30 - 6:00, MW) Chapter 10: Virtual Memory
Mary Bernadette
No ratings yet
Mit 101 Activity2
Document3 pages
Mit 101 Activity2
Carlo Moon Corpuz
No ratings yet
Oracle 12c DBA Handson - 1st Half
Document51 pages
Oracle 12c DBA Handson - 1st Half
pavonnvarma
100% (1)
Simplified Way of Processing Large Data Using Chunk in Laravel
Document4 pages
Simplified Way of Processing Large Data Using Chunk in Laravel
Dinesh Suthar
No ratings yet
Computer Organization Answer
Document6 pages
Computer Organization Answer
samir pramanik
No ratings yet
Process Management: Processes (Redux)
Document7 pages
Process Management: Processes (Redux)
Ian Lopez
No ratings yet
Cs604 - Final Term Subjective With Reference Solved by Umair Saulat
Document29 pages
Cs604 - Final Term Subjective With Reference Solved by Umair Saulat
chi
No ratings yet
Ram As Swap Space
Document9 pages
Ram As Swap Space
Surya Prakash Singh
No ratings yet
On The Deployment of The Memory Bus
Document7 pages
On The Deployment of The Memory Bus
Larch
No ratings yet
L04 Parallel Systems Synchronization Communication Scheduling
Document117 pages
L04 Parallel Systems Synchronization Communication Scheduling
Jiaxu Chen
No ratings yet
L - 3-AssociativeMapping - Virtual Memory
Document52 pages
L - 3-AssociativeMapping - Virtual Memory
Lekshmi
No ratings yet
Matthew Williams CISB305 Fall 2022 Assignment 4
Document4 pages
Matthew Williams CISB305 Fall 2022 Assignment 4
Matthew Williams
No ratings yet
Final Sample Sol
Document12 pages
Final Sample Sol
özlem Erdem
No ratings yet
Scimakelatex 9977 Le Jaimit
Document5 pages
Scimakelatex 9977 Le Jaimit
LK
No ratings yet
Apache Flink: Flink's Core Is A
Document20 pages
Apache Flink: Flink's Core Is A
Anonymous jN6bVk1f
No ratings yet
Memcached: Local Database Query Cache: Your Database May Have It's Own Native Query Caching, Which
Document4 pages
Memcached: Local Database Query Cache: Your Database May Have It's Own Native Query Caching, Which
kumar2me
No ratings yet
How To Optimize PHP Script To Increase Speed
Document4 pages
How To Optimize PHP Script To Increase Speed
Tran Manh
No ratings yet
Computer Architecture
Document5 pages
Computer Architecture
rudemaverick
No ratings yet
The RAM Model of Computation
Document1 page
The RAM Model of Computation
Gaurav Agrawal
No ratings yet
Understanding CPU Caching
Document7 pages
Understanding CPU Caching
danhorii
No ratings yet
Anatomy of A Program in Memory
Document19 pages
Anatomy of A Program in Memory
Victor J. Pernia
No ratings yet
Ruby Concurrency Explained
Document7 pages
Ruby Concurrency Explained
achhu
No ratings yet
Cache Memory Thesis
Document5 pages
Cache Memory Thesis
jenniferwrightclarksville
100% (2)
Ac 2005 Scalable We Barch
Document74 pages
Ac 2005 Scalable We Barch
rinkesh88
No ratings yet
Spark Optimization PDF
Document14 pages
Spark Optimization PDF
Naveen Naik
No ratings yet
OMC 303 - Section A
Document5 pages
OMC 303 - Section A
Karo
No ratings yet
LBM I Memory PDF
Document23 pages
LBM I Memory PDF
hpss77
No ratings yet
Learn Python in One Hour: Programming by Example
From Everand
Learn Python in One Hour: Programming by Example
Victor R. Volkman
Rating: 3 out of 5 stars
3/5 (2)
Getting started with php & mysql: Professional training
From Everand
Getting started with php & mysql: Professional training
Rémy Lentzner
No ratings yet