Cover image of Storage Developer Conference
(4)
Technology

Storage Developer Conference

Updated 3 days ago

Technology
Read more

Storage developer Podcast, created by developers for developers.

Read more

Storage developer Podcast, created by developers for developers.

iTunes Ratings

4 Ratings
Average Ratings
4
0
0
0
0

iTunes Ratings

4 Ratings
Average Ratings
4
0
0
0
0
Cover image of Storage Developer Conference

Storage Developer Conference

Updated 3 days ago

Read more

Storage developer Podcast, created by developers for developers.

Rank #1: #102: Achieving 10-Million IOPS from a single VM on Windows Hyper-V

Podcast cover
Read more
Many server workloads, for example OLTP database workloads, require high I/O throughput and low latency. With the industry trend of moving high-end scale-up workloads to virtualization environment, it is essential for cloud providers and on-premises servers to achieve near native performance by reducing I/O virtualization overhead which mainly comes from two sources: DMA operations and interrupt delivery mechanism for I/O completions. The direct PCIe NVMe device assign techniques allow a VM to interact with HW devices directly and avoid using traditional Hyper-V para-virtualized I/O path.

To improve interrupt handling in the virtualization environment, Intel introduces Posted Interrupts (PI) as an enhanced method to mitigate interrupt delivery overhead in a virtualized environment, bypassing hypervisor involvement completely. In this talk, we will present Microsoft implementation and optimization of Intel PI and Hyper-V direct PCIe NVMe access on Windows platform. The results showed that we were able to achieve more than 10-Million IOPS from a single VM for the first time in the industry using an Intel Skylake based HPE commodity server with these techniques.
Jul 15 2019
44 mins
Play

Rank #2: #101: Introduction to Persistent Memory Configuration and Analysis Tools

Podcast cover
Read more
Have you heard of non-volatile/persistent memory but don’t know how to get started with this disruptive technology? Memory is the new Storage. Next generation storage tiered architectures are evolving with persistent memory and hardware delivering NVDIMMs. Are you a Linux or Windows application developer familiar with C, C++, Java, or Python, keen to develop the next revolutionary application or modify an existing application, but not sure where to start? Do you know what performance and analysis tools can be used to identify optimizations in your app to take advantage of persistent memory? Are you a software, server, or cloud architect that wants to get a jump start on this disruptive technology? This presentation will get you started on the persistent memory solution path. The future is in your hands. The future is now!

Learning Objectives: 1) We’ll deliver an introductory understanding of persistent memory, introduce the SNIA Programming Model, Direct Access (DAX) filesystems, and show where persistent memory fits in the storage hierarchy; 2) We’ll provide several options for creating development environments (you don’t need physical modules to get started!); 3) We’ll introduce application programming using the Persistent Memory Developers Kit (PMDK); 4) We’ll introduce and describe how to create and manage Persistent Memory Regions, Namespaces, and Labels; 5) Describe existing analysis tools to identify applications that are good candidates for persistent memory.
Jul 08 2019
44 mins
Play

Rank #3: #99: SNIA Nonvolatile Memory Programming TWG - Remote Persistent Memory

Podcast cover
Read more
The SNIA NVMP Technical Workgroup (TWG) continues to make significant progress on defining the architecture for interfacing applications to PM. In this talk, we will focus on the important Remote Persistent Memory scenario, and how the NVMP TWG’s programming model applies. Application use of these interfaces, along with fabric support such as RDMA and platform extensions, are part of this, and the talk will describe how the larger ecosystem fits together to support PM as low-latency remote storage.

Learning Objectives: 1) Persistent Memory programming; 2) RDMA extensions; 3) SNIA PM initiatives.
Jun 17 2019
47 mins
Play

Rank #4: #98: Rethinking Ceph Architecture for Disaggregation Using NVMe-over-Fabrics

Podcast cover
Read more
Ceph protects data by making 2-3 copies of the same data but that means 2-3x more storage servers and related costs. It also means higher write latencies as data hops between OSD nodes. Customers are now starting to deploy Ceph using SSDs for high-performance workloads and for data lakes supporting real-time analytics. We describe a novel approach that eliminates the added server cost by creating Containerized, stateless OSDs and leveraging NVMe-over-fabrics to replicate data in server-less storage nodes. We propose redefining the boundaries of separation within SDS architectures to address disaggregation overheads. Specifically, we decouple control and data plane operations and transfer block ownership to execute on remote storage targets. It also dramatically reduces write latency to enable Ceph to be used for databases and to speed up large file writes. As part of the solution, we also describe how OSD node failover is preserved via a novel mechanism using standby stateless OSD nodes.

Learning Objectives: 1) Storage disaggregation; 2) NVMe over fabrics; 3) Ceph architecture.
Jun 10 2019
34 mins
Play

Rank #5: #97: Delivering Scalable Distributed Block Storage using NVMe over Fabrics

Podcast cover
Read more
NVMe and NVMe over Fabrics (NVMe-oF) protocols provide a highly efficient access to flash storage inside a server and over the network respectively. Current generation of distributed storage software stacks use proprietary protocols which are sub-optimal to deliver end to end low latency. Moreover it increases operational complexity to manage NVMe-oF managed flash storage and distributed flash storage in private cloud infrastructure. In this session, we present NVMe over Fabrics based high performance distributed block storage that combines the best of both worlds to deliver performance, elasticity and rich data services.

Learning Objectives: 1) NVMe, NVMe-oF for flash data path IO architecture; 2) Programming, architecture and optimization for flash; 3) Distributed storage, data services.
Jun 03 2019
48 mins
Play

Rank #6: #96: Solid State Datacenter Transformation

Podcast cover
Read more
Intel Fellow Amber Huffman has been at the center of Intel’s development for SSDs, with emphasis on SSD storage interfaces and next generation form factors. In this talk, she will discuss the rationale behind decisions that were made in advancing storage architecture leading to the emergence of solid state only Data Centers. Amber will also discuss the key factors influencing the future of the Data Center and the important role storage continues to play.
May 20 2019
48 mins
Play

Rank #7: #92: Fibre Channel – The Most Trusted Fabric Delivers NVMe

Podcast cover
Read more
As data-intensive workloads transition to low-latency NVMe flash-based storage to meet increasing user demand, the Fibre Channel industry is combining the lossless, highly deterministic nature of Fibre Channel with NVMe. FC-NVMe targets the performance, application response time, and scalability needed for next-generation data centers while leveraging existing Fibre Channel infrastructures. This presentation will provide an overview of why Fibre Channel’s inherent multi-queue capability, parallelism, deep queues, and battle-hardened reliability make it an ideal transport for NVMe across the fabric.

Learning Objectives: 1) A reminder of how Fibre Channel works; 2) A reminder of how NVMe over Fabrics work; 3) A high-level overview of Fibre Channel and NVMe, especially how they work together.
Apr 16 2019
32 mins
Play

Rank #8: #90: FPGA Accelerator Disaggregation Using NVMe-over-Fabrics

Podcast cover
Read more
Leveraging the NVMe standard to present FPGA accelerators as NVMe namespaces allows user space code to access accelerators as simple block devices via standard in-box drivers. NVMe-over-Fabrics (NVMe-oF) is a network protocol for NVMe that is used to communicate between a host machine and NVMe devices over high-performance Ethernet networks. Leveraging NVMe-oF allows an accelerator, that presents as a standard NVMe namespace, to be shared across existing transports such as RDMA, TCP/IP or Fibre Channel. With NVMe-oF, client machines can borrow accelerators over the fabrics connection which can then be accessed as if they were local to the system, allowing the same user code to run on direct attached accelerators and over-Fabrics accelerators. The ease and granularity available when setting up the over-fabrics connections allows servers to share accelerators on demand, allowing for the disaggregation of acceleration compute resources. Using NVMe Controller Memory Buffer (CMB), the fabrics connection is further improved by allowing devices to send and receive data directory from the accelerator without accessing host memory. We will discuss the advantages of accelerator disaggregation and its impact on resource usage.

Learning Objectives: 1) Advantages and performance of NVMe for storage workload acceleration; 2) Performance of NVMe over Fabrics for storage workloads and acceleration; 3) Advantages of using Controller Memory Buffer coupled with NVMe over Fabrics.
Mar 25 2019
37 mins
Play

Rank #9: #86: Emerging Interconnects: Open Coherent Accelerator Processor Interface (OpenCAPI) and Gen-Z

Podcast cover
Read more
Two emerging interconnect efforts, OpenCAPI and Gen-Z, are top-of-mind today in system architecture and storage, including advanced platform support of Persistent Memory. Come hear distinguished leads from each project give an update on the goals, progress, and futures of each. Expect to learn both the differences and commonalities between these potentially symbiotic standards, and how they may enable new platforms and new storage solutions as they move forward.
Feb 14 2019
1 hour 7 mins
Play

Rank #10: #85: Bulletproofing Stateful Applications on Kubernetes

Podcast cover
Read more
Kubernetes supports external storage through volume plugins, however, the framework was built using generic concepts so as to work with all types of storage including legacy SANs as well as newer Software Defined Storage (SDS) solutions. As a result of this approach, Kubernetes cannot natively take advantage of some of the benefits of cloud-native storage systems like hyperconvergence, and also lacks native support for advanced storage monitoring and snapshots.

Dinesh will present STORK (STorage Orchestrator Runtime for Kubernetes) a new open source extension for Kubernetes, that provides additional intelligence and control for your storage. He will show how STORK can be used to co-locate pods with their data, deal with storage failures seamlessly and take application consistent volume snapshots through Kubernetes.

This talk will show how users can overcome limitations of stateful services and bulletproof them in Kubernetes using STORK. Dinesh will demonstrate how, using STORK, Kubernetes based applications can respond automatically to common failure modes like network partitions, and node and driver failures without downtime. He will talk about how this model can be extended for other drivers and the future roadmap.

Learning Objectives: 1) Optimizing stateful applications when using Kubernetes; 2) Running highly available stateful applications without any user intervention; 3) Creating and using application consistent volume snapshots with Kubernetes.
Feb 06 2019
37 mins
Play

Rank #11: #78: Managing Disk Volumes in Kubernetes

Podcast cover
Read more
Kubernetes streamlines the deployment and orchestration of clustered workloads, both on-premise and in the cloud. The Container Storage Interface (CSI) is a newly-defined model for defining templates for mountable volumes and file systems, instantiating them, and binding them to individual containers. In this talk, we will present an overview of the CSI and how it enables rapid migration of workloads to cloud. We will also discuss future directions for extending Kubernetes storage interfaces.

Learning Objectives: 1) How Container Storage Interface (CSI) enables container workloads to easily migrate from on-premise to cloud; 2) How to control volume attributes such as redundancy and location; 3) Opportunities to extend Kubernetes to support new storage technologies and use cases.
Nov 05 2018
48 mins
Play

Rank #12: #75: Deep Sea Fishing: A Swordfish Deep-Dive

Podcast cover
Read more
Building on the concepts presented in the Introduction to Swordfish sessions, this session will go into more detail on the new Swordfish Scalable Storage Management API specification, including details of the Swordfish Class of Service concepts, structure and usage. It will also cover details of constructing file vs block systems. The deep-dive will also provide a look at the schema, and the ability to support both RESTful and OData clients.
Sep 18 2018
42 mins
Play

Rank #13: #71: Self-Optimizing Caches

Podcast cover
Read more
Caches in modern storage systems lack the ability to adapt automatically and optimize for dynamic workload mixes. Despite the potential for huge improvements in cost, performance, and predictability, such adaptability is extremely challenging, due to inherently complex, non-linear, and workload-dependent behavior. Even when manually-tunable controls are provided to support dynamic cache sizing, partitioning, and parameter tuning, administrators simply don’t have the information required to make good decisions.

In this talk, we will present an overview of the significant opportunity for self-optimizing caches by examining several examples from production systems. We will review recently-published research in this area, including robust, general methods for efficient cache modeling. Optimizations that leverage these models promise to improve the performance of most workloads and cache policies automatically.
Jun 13 2018
53 mins
Play

Rank #14: #70: SPDK Blobstore: A Look Inside the NVM Optimized Allocator

Podcast cover
Read more
The Storage Performance Development Kit (SPDK) is an open source set of tools and libraries for writing high performance, scalable, user-mode storage applications. It achieves high performance by moving all of the necessary drivers into userspace and operating in a polled mode instead of relying on interrupts. The Blobstore is a relative newcomer to SPDK and provides local, persistent, power-fail safe block allocator designed to replace filesystem usage in many popular databases. Most importantly, the Blobstore has been designed for the properties of flash and next-generation media from the start and directly leverages NVMe features. The team has already ported a popular embedded key/value database, RocksDB, to use the Blobstore which demonstrated a significant improvement for database queries under common workloads. In this session we will explore the basics of the Blobstore and review some of the latest exciting performance data!
May 30 2018
52 mins
Play

Rank #15: #68: Andromeda: Building the Next-Generation High-Density Storage Interface for Successful Adoption

Podcast cover
Read more
Open Channel describes a new interface to Solid State Drives (SSDs) which promises to increase usage of SSDs’ raw bandwidth from 40% to 95%, increase user-visible flash capacity from 50%-70% to 99%, increase I/O bandwidth by 3x and reduce per-GB hardware cost by 50%. Despite many proposals and implementations proving these benefits, industry has seen limited adoption and no standards body has integrated the concept. One of the largest hurdles to adoption is that the proposed changes permeate every layer in the storage stack, from device firmware to application. To reap the benefits, we need not only an end goal, but a pragmatic approach to introducing these changes to one or two layers at a time.
We present relevant information about host and drive architecture, the expected use cases for Open Channel, and a general-purpose, maintainable end target for Open Channel. The final architecture refactors Flash Translation Layer into Log-Management, handled in the host, and NAND Management, handled in the drive. This consolidates repeated software algorithms, and retains media-specific functionality in the drive.
The results of this design set up the storage community to innovate in traditionally independent areas. With a new interface in place, host-side software developers can apply a wide variety of software and technologies to further optimize their storage logs, and firmware and drive designers can focus on improvements in media management and density improvements.
Mar 27 2018
49 mins
Play

Rank #16: #61: Persistent Memory Security

Podcast cover
Read more
Persistent memory NVDIMMs create security challenges, some of which are familiar while others are new. The NVM Programming TWG has been working on a security threat model for Persistent Memory that exposes gaps where solutions need to evolve beyond those that met the needs of hard disks and SSD’s. This session describes the current work in progress of the TWG that will appear in a future white paper.

Learning Objectives: 1)Learn what how persistent memory security solutions may differ from hard disk and SSD solutions; 2)Learn how multi-tenancy in a cloud datacenter relates to Persistent Memory security features; 3)Show how protection of data at rest can facilitate secure reuse of NVDIMMs.
Jan 15 2018
43 mins
Play

Rank #17: #51: USB Cloud Storage Gateway

Podcast cover
Read more
Cloud block storage implementations, such as Ceph RADOS Block Device (RBD) and Microsoft Azure Page Blobs, are considered flexible, reliable and relatively performant.
Exposing these implementations for access via an embedded USB storage gadget can solve a number of factors limiting adoption, namely:

Interoperability - Cloud storage can now be consumed by almost any system with a USB port
Ease of use - Configure once, then plug and play
Security - Encryption can be performed on the USB device itself, reducing reliance on cloud storage providers

This presentation will introduce and demonstrate a USB cloud storage gateway prototype developed during SUSE Hack Week, running on an embedded Linux ARM board.

Learning Objectives: 1) Knowledge of existing Ceph and Azure block storage implementations; 2) Awareness of problems limiting cloud storage adoption; 3) Evaluate a USB cloud storage gateway device as a solution for factors limiting adoption.
Jul 18 2017
30 mins
Play

Rank #18: #45: Data Retention and Preservation: The IT Budget Killer is Tamed

Podcast cover
Read more
Data retention and preservation is rapidly becoming the most impacting requirement to data storage. Regulatory and corporate guidelines are causing stress on storage requirements. Cloud and Big Data environments are stressed even more by the growth of rarely touched data due to the need to improve margins in storage. There are many choices in the market for data retention, managing the data for decades must be as automated as possible. This presentation will outline the most effective storage for Long term data preservation, emphasizing Total Cost of Ownership, ease of use and management, and lowering the carbon footprint of the storage environment.
May 25 2017
42 mins
Play

Rank #19: #44: What Can One Billion Hours of Spinning Hard Drives Tell Us?

Podcast cover
Read more
Over the past 3 years we’ve been collecting daily SMART stats from the 60,000+ hard drives in our data center. These drives have over one billion hours of operation on them. We have data from over 20 drive models from all major hard drive manufacturers and we’d like to share what we’ve learned. We’ll start with annual failure rates of the different drive models. Then we’ll look at the failure curve over time, does it follow the “bathtub curve” as we expect. We’ll finish by looking a couple of SMART stats to see if they can reliably predict drive failure.

Learning Objectives: 1) What is the annual failure rate of commonly used hard drives?; 2) Do hard drives follow a predictable pattern of failure over time?; 3) How reliable are drive SMART stats in predicting drive failure?
May 09 2017
52 mins
Play

Rank #20: #42: The Role of Active Archive in Long-Term Data Preservation

Podcast cover
Read more
Anyone managing a mass storage infrastructure for HPC, Big Data, Cloud, research, etc., is painfully aware that the growth, access requirements and retention needs for data are relentless. At the heart of that problem is the need to rationalize the way that data is managed, and create online access to all that data without maintaining it in a continuous, power-consuming state. The solution lies in creating an active archive that enables straight-from-the-desktop access to data stored at any tier for rapid data access via existing file systems that expand over flash, disk and tape library storage technologies. Active archives provide organizations with a persistent view of the data and make it easier to access files whenever needed, regardless of the storage medium being utilized.

Learning Objectives: 1) Understand how active archive technologies work and how companies are using them to enable reliable, online and efficient access to archived data; 2) Learn the implications of data longevity and planning considerations for long-term retention and data integrity assurance; 3) Learn why active archive solutions can achieve unmatched efficiency and cost savings as data continues to grow much faster than storage budgets.
Apr 26 2017
53 mins
Play

Similar Podcasts