Welcome to Scribd!

Tfrecord - Vs - hdf5 - Jupyter Notebook

Uploaded by

0% found this document useful (0 votes)

259 views2 pages

This document summarizes the comparison between two different file formats: tfrecords and hdf5. The comparison was based on the time required to access, store and read these files. I used around 40000 images to compare both.

Original Title

tfrecord_Vs_hdf5 - Jupyter Notebook

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

259 views2 pages

Tfrecord - Vs - hdf5 - Jupyter Notebook

Uploaded by

kailash kher

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 2

Search inside document

TF Records Vs HDF5

Dataset contains 250 images for 40 different classes, so a total of 10,000 images
Original size for each image was (1920 X 1200)
Memory required for these 10,000 original images was 37.3 GB
Each image was resized to (224 X 224) before storing it.

Importing Dependencies

In [1]: # importing writer and generator modules from dataset package

from dataset.writer import FileWriter
from dataset.reader import DatasetGenerator
...

Storing Images
In [2]: # Instantiating file writer object
writer = FileWriter()

[INFO] Preparing writer..

[INFO] Reading dataset..

[INFO] One Hot encoding the labels..

In [3]: # creating tf records file

writer.create_tfrecord()

Creating tfrecord file: 100% |##################################| Time: 0:30:01

In [4]: # creating hdf5 file

writer.create_hdf5()

Creating hdf5 file: 100% |######################################| Time: 0:29:20

Memory used

Reading Images

Reading full dataset, i.e. completing 1 epoch

In [5]: # Instantiating data generator object
dataGen = DatasetGenerator()

[INFO] Preparing Dataset Generator

In [6]: # reading tf records file

dataGen.read_tfrecord()

WARNING:tensorflow:From D:\shubham\Research\1. Handling Mass Data\dataset\reader.

py:120: DatasetV1.make_one_shot_iterator (from tensorflow.python.data.ops.dataset
_ops) is deprecated and will be removed in a future version.
Instructions for updating:
Use `for ... in dataset:` to iterate over a dataset. If using `tf.estimator`, ret
urn the `Dataset` object directly from your input function. As a last resort, you
can use `tf.compat.v1.data.make_one_shot_iterator(dataset)`.

Reading tfrecord file: 100% |###################################| Time: 0:00:44

In [7]: # reading hdf5 file

dataGen.read_hdf5()

Reading hdf5 file: 100% |#######################################| Time: 0:00:42

HW - Regex: 1 Instructions HW - Regular Expression - 10 Points
Document9 pages
HW - Regex: 1 Instructions HW - Regular Expression - 10 Points
Stephen Kamau
No ratings yet
Draw Debug
Document1 page
Draw Debug
Roko Deur
No ratings yet
Section 6 - Jupyter Notebook
Document11 pages
Section 6 - Jupyter Notebook
Mohamed Aymen
No ratings yet
Sysctl - Conf For Best Configration
Document2 pages
Sysctl - Conf For Best Configration
spgbagde
No ratings yet
The TF - Data API
Document47 pages
The TF - Data API
ladooroy
No ratings yet
Dpa Lab Practical File All Iu2041230030
Document22 pages
Dpa Lab Practical File All Iu2041230030
Devansh Chauhan
No ratings yet
Converting ZeroShell 2GB Disk Image To A 1GB Disk Image With Installation
Document4 pages
Converting ZeroShell 2GB Disk Image To A 1GB Disk Image With Installation
Agus Kresna Ardiyana
No ratings yet
Compaction
Document3 pages
Compaction
Snig Kav
No ratings yet
Dicom Image 3D Mainpulation
Document3 pages
Dicom Image 3D Mainpulation
Ali Nawaz
No ratings yet
English - Word Cloud
Document4 pages
English - Word Cloud
Vicky Kumar
No ratings yet
Q: Can We Use Syncdb?: ./manage - Py Makemigrations ./manage - Py Migrate
Document7 pages
Q: Can We Use Syncdb?: ./manage - Py Makemigrations ./manage - Py Migrate
satyach123
No ratings yet
Output Log
Document3 pages
Output Log
Дмитрий Манько
No ratings yet
Gerald Corzo 5/27/2020: Google Colab Platform 2 Reading Data 2
Document11 pages
Gerald Corzo 5/27/2020: Google Colab Platform 2 Reading Data 2
Andres Humberto Otalora Carmona
No ratings yet
Dynamic Topic Modelling Tutorial
Document13 pages
Dynamic Topic Modelling Tutorial
spider9385
No ratings yet
Test IG Chapter 1
Document4 pages
Test IG Chapter 1
Riaz Khan
No ratings yet
Tensorflow Neural Network Lab: Notmnist
Document15 pages
Tensorflow Neural Network Lab: Notmnist
Daniel Petrov
No ratings yet
Time-series-Forecasting Time Series Forecasting Jupyter Code - Ipynb at Main Chetandudhane Time-series-Forecasting GitHub
Document162 pages
Time-series-Forecasting Time Series Forecasting Jupyter Code - Ipynb at Main Chetandudhane Time-series-Forecasting GitHub
Arun Kumar
100% (2)
6 - Text Vectorization-CSC688-SP22
Document5 pages
6 - Text Vectorization-CSC688-SP22
Crypto Genius
No ratings yet
Assignment WRT Lab5: Solve Exercise 2 and Upload The Solution With All The Graphs. Do Not Forget To Rename The Folder With Your BITS ID
Document3 pages
Assignment WRT Lab5: Solve Exercise 2 and Upload The Solution With All The Graphs. Do Not Forget To Rename The Folder With Your BITS ID
NOAH GEORGE
No ratings yet
How To Pull Data From A Microsoft SQL Database: Pandas PD Sys Sqlalchemy
Document4 pages
How To Pull Data From A Microsoft SQL Database: Pandas PD Sys Sqlalchemy
MamataMaharana
No ratings yet
Documentation For Pyhton Project
Document3 pages
Documentation For Pyhton Project
Mark Ngugi
No ratings yet
How To Store File Into Database Using Java
Document15 pages
How To Store File Into Database Using Java
ipia0070
No ratings yet
Lab5 Example Fall 23
Document4 pages
Lab5 Example Fall 23
Patel Vedant
No ratings yet
Add Disk and Create A Partition
Document2 pages
Add Disk and Create A Partition
aitlhaj
No ratings yet
10 Read Netcdf Python
Document9 pages
10 Read Netcdf Python
Eliton Figueiredo
No ratings yet
ResNet50 Training Code
Document9 pages
ResNet50 Training Code
natashamarie.relampagos
No ratings yet
Using Context Managers: Shayne Miel
Document37 pages
Using Context Managers: Shayne Miel
asfsa
No ratings yet
Assign 3
Document1 page
Assign 3
Tayub khan.A
No ratings yet
Ds File
Document58 pages
Ds File
tapcom19
No ratings yet
Untitled 20181006 112517
Document83 pages
Untitled 20181006 112517
fernanda LA
No ratings yet
Problems and Solutions Python
Document8 pages
Problems and Solutions Python
nandakishore
No ratings yet
Data Cleaning and Exploratory Data Analysis With Pandas On Trending Youtube Video Statistics
Document5 pages
Data Cleaning and Exploratory Data Analysis With Pandas On Trending Youtube Video Statistics
Babar Roomi
No ratings yet
Medical Text Classifier GabrieldeOlaguibel
Document12 pages
Medical Text Classifier GabrieldeOlaguibel
gabriel-l
No ratings yet
DBA Notes PDF
Document102 pages
DBA Notes PDF
rajkumarpomaji
No ratings yet
Output Log
Document20 pages
Output Log
Juan
No ratings yet
Ai Lab 02
Document12 pages
Ai Lab 02
doodhjaleybi
No ratings yet
Output Log
Document221 pages
Output Log
Juan
No ratings yet
KLTimagecompress Copy1
Document10 pages
KLTimagecompress Copy1
onlineclass net
No ratings yet
File Programs
Document23 pages
File Programs
Ishaan Seth
No ratings yet
Chapter1-Working With Big Data
Document44 pages
Chapter1-Working With Big Data
Komi David ABOTSITSE
No ratings yet
Output Log
Document11 pages
Output Log
Anemona Candea
No ratings yet
W11 Lab
Document4 pages
W11 Lab
Mai Tera Hero
No ratings yet
Data Analysis With Python
Document49 pages
Data Analysis With Python
TBN1
100% (3)
Video Servellinece
Document11 pages
Video Servellinece
raghuvaran84u
No ratings yet
Three Ways of Storing and Accessing Lots of Images in Python
Document27 pages
Three Ways of Storing and Accessing Lots of Images in Python
Dilan Nery
No ratings yet
Ip Worksheet 3 - Q'S
Document6 pages
Ip Worksheet 3 - Q'S
Shabin Muhammed
No ratings yet
Pattern Report Final
Document10 pages
Pattern Report Final
MD Rubel Amin
No ratings yet
Output Log
Document199 pages
Output Log
leonel rodas
No ratings yet
Aolserver Installation With Google Performance Tools
Document4 pages
Aolserver Installation With Google Performance Tools
Syed Atif Ali
100% (1)
Unit6 - Working With Data
Document29 pages
Unit6 - Working With Data
vvloggingzone05
No ratings yet
Assignment No 3
Document5 pages
Assignment No 3
Akshata Chopade
No ratings yet
Aryan Cs Project
Document28 pages
Aryan Cs Project
aryan12gautam12
No ratings yet
Dcgan
Document9 pages
Dcgan
Priti Sharma
No ratings yet
Linux DD Command
Document11 pages
Linux DD Command
toto7755
No ratings yet
Dsaa Group Project
Document3 pages
Dsaa Group Project
msroshi madhu
No ratings yet
A30 327
Document20 pages
A30 327
Lewis Bruce
No ratings yet
SEU - DS510 - Module 4 Input-Output and Data Structure
Document68 pages
SEU - DS510 - Module 4 Input-Output and Data Structure
g230001495
No ratings yet
Anexos y Practicas
Document10 pages
Anexos y Practicas
andmati10
No ratings yet
PHP Package Mastery: 100 Essential Tools in One Hour - 2024 Edition
From Everand
PHP Package Mastery: 100 Essential Tools in One Hour - 2024 Edition
Kanto
No ratings yet
State-of-the-Art Deep Learning Models in TensorFlow: Modern Machine Learning in the Google Colab Ecosystem
From Everand
State-of-the-Art Deep Learning Models in TensorFlow: Modern Machine Learning in the Google Colab Ecosystem
David Paper
No ratings yet