Data Access Concurrency Control Research Papers

A Provenance-Policy Based Access Control Model For Data Usage Validation In Cloud

2025, arXiv (Cornell University)

In an organization specifically as virtual as cloud there is need for access control systems to constrain users direct or backhanded action that could lead to breach of security. In cloud, apart from owner access to confidential data the... more

descriptionView Paper arrow_downwardDownload

Enhancing data sovereignty to improve intelligent mobility services in smart cities

by Bokolo Anthony Jnr.

2025, Urban Governance

Smart cities aim to provide more digitalized, equitable, sustainable, and liveable cities. In smart cities data evolves as an important asset and citizens data in particular is being used to provide data-driven mobility services.... more

descriptionView Paper arrow_downwardDownload

Cloud analytics: Do we really need to reinvent the storage stack?

by Renu Tewari

2024, Proceedings of the …

Cloud computing offers a powerful abstraction that provides a scalable, virtualized infrastructure as a service where the complexity of fine-grained resource management is hidden from the end-user. Running data analytics applications in... more

descriptionView Paper arrow_downwardDownload

Performance Evaluation of Read and Write Operations in Hadoop Distributed File System

by sudheer kumar battula

2024, 2014 Sixth International Symposium on Parallel Architectures, Algorithms and Programming

Hadoop Distributed File System (HDFS) is the core component of Apache Hadoop project. In HDFS, the computation is carried out in the nodes where relevant data is stored. Hadoop also implemented a parallel computational paradigm named as... more

descriptionView Paper arrow_downwardDownload

Improving performance of a distributed file system using a speculative semantics-based algorithm

by sudheer kumar battula

2024, Tsinghua Science and Technology

File-sharing semantics is used by the file systems for sharing data among concurrent client processes in a consistent manner. Session semantics is a widely used file-sharing semantics in Distributed File Systems (DFSs). The main... more

descriptionView Paper arrow_downwardDownload

Performance Evaluation of Read and Write Operations in Hadoop Distributed File System

by sudheer kumar battula

2024, 2014 Sixth International Symposium on Parallel Architectures, Algorithms and Programming

Hadoop Distributed File System (HDFS) is the core component of Apache Hadoop project. In HDFS, the computation is carried out in the nodes where relevant data is stored. Hadoop also implemented a parallel computational paradigm named as... more

descriptionView Paper arrow_downwardDownload

Research on Improving, Evaluating and Applying the Ternary Search Tree and Binary Search for Storing and Searching Content - Based Address for Forwarding Technique in Service-Oriented Routing

by 04- Tran Minh 04 Chien

2024, Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering

Service-based network infrastructure is a new network interface in which the flow of messages is controlled by class of services that generated it. Next is its content, improved shipping address specified by the sender and attached to the... more

descriptionView Paper arrow_downwardDownload

Optimizing Map Reduce Scheduling Using Parallel Processing Model On Data Nodes in Hadoop Environment

by arjun parihar

2024, International journal of scientific research in science, engineering and technology

From the most recent years, there has been a quick progress in cloud, with the growing amounts of associations turning number of associations relying upon use resources in the cloud, there is a requirement for securing the data of various... more

descriptionView Paper arrow_downwardDownload

Conflict scheduling of transactions on XML documents

by Jan Hidders

2024, Proceedings of the 15th Australasian database …

In the last few years an interest in native XML databases has surfaced. With other authors we argue that such databases need their own provisions for concurrency control since traditional methods are inadequate to capture the complicated... more

descriptionView Paper arrow_downwardDownload

ArrayUDF

by Kesheng Wu

2024, Proceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing

User-Defined Functions (UDF) allow application programmers to specify analysis operations on data, while leaving the data management tasks to the system. This general approach enables numerous custom analysis functions and is at the heart... more

descriptionView Paper arrow_downwardDownload

The main characteristics of five distributed file systems required for big data: A comparative study

by larbi Hassouni

2024, Advances in Science, Technology and Engineering Systems Journal

These last years, the amount of data generated by information systems has exploded. It is not only the quantities of information that are now estimated in Exabyte, but also the variety of these data which is more and more structurally... more

descriptionView Paper arrow_downwardDownload

Mahasen: Distributed Storage Resource Broker

by Shelan Perera

2024, Lecture Notes in Computer Science

Modern day systems are facing an avalanche of data, and they are being forced to handle more and more data intensive use cases. These data comes in many forms and shapes: Sensors (RFID, Near Field Communication, Weather Sensors),... more

descriptionView Paper arrow_downwardDownload

BlobSeer Scalability: A Multi-version Managers Approach

by Mag CHALABI Baya

2024, Journal of Networking Technology

With the emergence of Cloud Computing, the amount of data generated in different fields such as physics, medical, social networks, etc. is growing exponentially. This increase in the volume of data and their large scale make the problem... more

descriptionView Paper arrow_downwardDownload

Statistical Approach for Load Distribution in Decentralized Cloud Computing

by Chandu Vaidya

2023, HELIX

In this era of developing technologies, one of the most promising is cloud computing that has been functioning since years and used by individuals and large enterprises to provide different kind of services to the world. Cloud computing... more

descriptionView Paper arrow_downwardDownload

Conflict scheduling of transactions on XML documents

by Jan Hidders

2023, Australasian Database Conference

In the last few years an interest in native XML databases has surfaced. With other authors we argue that such databases need their own provisions for concurrency control since traditional methods are inadequate to capture the complicated... more

descriptionView Paper arrow_downwardDownload

Multi-Objective Optimization for Virtual Machine Allocation and Replica Placement in Virtualized Hadoop

by Belén Bermejo

2023, IEEE Transactions on Parallel and Distributed Systems

Resource management is a key factor in the performance and efficient utilization of cloud systems, and many research works have proposed efficient policies to optimize such systems. However, these policies have traditionally managed the... more

descriptionView Paper arrow_downwardDownload

Performance Evaluation of Read and Write Operations in Hadoop Distributed File System

by Lakshmi Siva Ramakrishna Talluri

2023, 2014 Sixth International Symposium on Parallel Architectures, Algorithms and Programming

Hadoop Distributed File System (HDFS) is the core component of Apache Hadoop project. In HDFS, the computation is carried out in the nodes where relevant data is stored. Hadoop also implemented a parallel computational paradigm named as... more

TABLE V. PERFORMANCE OF READ OPERATION FOR | GB DATA SIZE

TABLE VI. PERFORMANCE OF READ OPERATION FOR 2 GB DATA SIZE

TABLE VIII. PerForMANCE OF READ OPERATION FOR 8 GB DATA SIZE

TABLE II. PERFORMANCE OF WRITE OPERATION FOR 2 GB DATA SIZE The performance of read operations in HDFS for the files with the total sizes 1GB, 2GB, 4GB and 8GB are shown in the figures 5,6,7,8 and tables 5, 6, 7 and 8. In Fig. 5, we can observe that, HDFS takes more time for reading 1 GB of data when the files use the block size which is less than that of the default block size. Note that, when the block size of the file increases HDFS requires less time to read 1 GB of data. Similar trend can be observed in Fig. 6, Fig. 7 and Fig. 8.

TABLE III. PERFORMANCE OF WRITE OPERATION FOR 4 GB DATA SIZE

TABLE VII. PERFORMANCE OF READ OPERATION FOR 4 GB DaTA SIZE Fig.7. Performance of Read Operation for 4 GB Data Size

Fig.8. Performance of Read Operation for 8 GB Data Size

Fig.1. Performance of Write Operation forlGB Data Size TABLE I. PERFORMANCE OF WRITE OPERATION FOR | GB DATA SIZE

TABLE IV. PERFORMANCE OF WRITE OPERATION FOR 8 GB DATA SIZE

descriptionView Paper arrow_downwardDownload

How to scale transactional storage systems

by Miguel Castro

2023, Proceedings of the 7th workshop on ACM SIGOPS European workshop Systems support for worldwide applications - EW 7

Applications of the future will need to support large numbers of clients and will require scalable storage systems that allow state to be shared reliably. Recent research in distributed file systems provides technology that increases the... more

descriptionView Paper arrow_downwardDownload

How to scale transactional storage systems

by Miguel Castro

2023, Proceedings of the 7th workshop on ACM SIGOPS European workshop Systems support for worldwide applications - EW 7

Applications of the future will need to support large numbers of clients and will require scalable storage systems that allow state to be shared reliably. Recent research in distributed file systems provides technology that increases the... more

descriptionView Paper arrow_downwardDownload

An Adaptive Document Version Management Scheme

by Mehregan Mahdavi

2023, Notes on Numerical Fluid Mechanics and Multidisciplinary Design

This paper addresses the design and implementation of an adaptive document version management scheme. Existing schemes typically assume: (i) a priori expectations for how versions will be manipulated and (ii) fixed priorities between... more

descriptionView Paper arrow_downwardDownload

OPTIMIS and VISION Cloud: How to Manage Data in Clouds

by Hillel Kolodner

2023

In the rapidly evolving Cloud market, the amount of data being generated is growing continuously and as a consequence storage as a service plays an increasingly important role. In this paper, we describe and compare two new approaches,... more

descriptionView Paper arrow_downwardDownload

Efficient Management of Complex Striped Files in Active Storage

by Juan Carlos Ponce Piernas

2023, Lecture Notes in Computer Science

Active Storage provides an opportunity for reducing the bandwidth requirements between the storage and compute elements of current supercomputing systems, and leveraging the processing power of the storage nodes used by some modern file... more

descriptionView Paper arrow_downwardDownload

Sigma: A Fault-Tolerant Mutual Exclusion Algorithm in Dynamic Distributed Systems Subject to Process Crashes and Memory Losses

by Qiao Lian

2023, 11th Pacific Rim International Symposium on Dependable Computing (PRDC'05)

This paper introduces the Sigma algorithm that solves fault-tolerant mutual exclusion problem in dynamic systems where the set of processes may be large and change dynamically, processes may crash, and the recovery or replacement of... more

descriptionView Paper arrow_downwardDownload

Event-Driven Database Information Sharing

by luis vargas

2023, Lecture Notes in Computer Science

Database systems have been designed to manage business critical information and provide this information on request to connected clients, a passive model. Increasingly, applications need to share information actively with clients and/or... more

descriptionView Paper arrow_downwardDownload

Flexibility, manageability, and performance in a Grid storage appliance

by John Bent

2023, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing

We present NeST, a flexible software-only storage appliance designed to meet the storage needs of the Grid. NeST has three key features that make it well-suited for deployment in a Grid environment. First, NeST provides a generic data... more

Figure 1. NeST Software Design. The diagram depicts NeST and its four major components: the protocol layer, the storage manager, the transfer manager, and the dispatcher. Both control and data fbw paths are depicted.

Figure 3. Multiple Protocols. The experiment mea- sures bandwidth when four clients request 10 MB files for each protocol. In the first four sets of bars, only a single protocol is used within each workload (and thus only a single server for JBOS). In the last set of bars, the work- load contains all protocols. Within each pair, the first bar shows the performance with NeST and the second bar with JBOS.

Figure 4. Proportional Protocol Scheduling. This workload is identical to that used in Figure 3. Results are shown only for NeST. Within each set of bars, the first bar represents the total delivered bandwidth across all pro- tocols; the remaining bars show the bandwidth per pro- tocol. The labels for the sets of bars show the specified proportional ratios; the desired lines show what the ideal proportions would be. Note that NeST is able to achieve very close to the desired ratios in each case except the right-most.

Figure 5. Adaptive Concurrency. Jn the graph on the left, the experiment measures average request latency on Solaris for 1 KB requests under events, threads, and the adaptive NeST approach. In the graph on the right, the experiment measures bandwidth on Linux for 10 MB re- quests, again under all three models. In both cases, NeST adaptively picks the better model, though there is an over- head to doing so. Note that the process model is disabled in these experiments for the sake of clarity.

Figure 6. Overhead of lots This graph shows the overhead imposed by implementing lots using the kernel quota system. Notice that for small files, the cost is negli- gible but increases quickly with file size.

descriptionView Paper arrow_downwardDownload

Mixing Hadoop and HPC workloads on parallel filesystems

by John Bent

2023, Proceedings of the 4th Annual Workshop on Petascale Data Storage

MapReduce-tailored distributed filesystems-such as HDFS for Hadoop MapReduce-and parallel high-performance computing filesystems are tailored for considerably different workloads. The purpose of our work is to examine the performance of... more

descriptionView Paper arrow_downwardDownload

Mrap

by John Bent

2023, Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing

Due to the explosive growth in the size of scientific data sets, data-intensive computing is an emerging trend in computational science. Many application scientists are looking to integrate data-intensive computing into... more

descriptionView Paper arrow_downwardDownload

Gfarm v2: A Grid file system that supports high-performance distributed and parallel data computing

by S. Matsuoka

2023

Grid Datafarm architecture is designed for facilitating reliable file sharing and high-performance distributed and parallel data computing in a Grid across administrative domains by providing a global virtual file system. Gfarm v2 is an... more

descriptionView Paper arrow_downwardDownload

Enabling High Data Throughput in Desktop Grids through Decentralized Data and Metadata Management: The BlobSeer Approach

by A. Gabriel

2023, Lecture Notes in Computer Science

Whereas traditional Desktop Grids rely on centralized servers for data management, some recent progress has been made to enable distributed, large input data, using to peer-to-peer (P2P) protocols and Content Distribution Networks (CDN).... more

descriptionView Paper arrow_downwardDownload

Distributed Management of Massive Data: An Efficient Fine-Grain Data Access Scheme

by A. Gabriel

2023, Lecture Notes in Computer Science

This paper addresses the problem of efficiently storing and accessing massive data blocks in a large-scale distributed environment, while providing efficient fine-grain access to data subsets. This issue is crucial in the context of... more

descriptionView Paper arrow_downwardDownload

BlobSeer: Efficient data management for data-intensive applications distributed at large-scale

by A. Gabriel

2023, 2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)

descriptionView Paper arrow_downwardDownload

Towards scalable array-oriented active storage

by A. Gabriel

2023, ACM SIGOPS Operating Systems Review

The recent explosion in data sizes manipulated by distributed scientific applications has prompted the need to develop specialized storage systems capable to deal with specific access patterns in a scalable fashion. In this context, a... more

descriptionView Paper arrow_downwardDownload

Bringing Introspection Into the BlobSeer Data-Management System Using the MonALISA Distributed Monitoring Framework

by A. Gabriel

2023, 2010 International Conference on Complex, Intelligent and Software Intensive Systems

Introspection is the prerequisite of an autonomic behavior, the first step towards a performance improvement and a resource-usage optimization for largescale distributed systems. In grid environments, the task of observing the application... more

descriptionView Paper arrow_downwardDownload

Towards a Transparent Data Access Model for the GridRPC Paradigm

by mathieu jan

2023, Lecture Notes in Computer Science

As grids become more and more attractive for solving complex problems with high computational and storage requirements, the need for adequate grid programming models is considerable. To this purpose, the GridRPC model has been proposed as... more

descriptionView Paper arrow_downwardDownload

Peer-to-peer distributed shared memory?

by mathieu jan

2023

In this paper, we show that, although P2P systems and DSM systems have been designed in rather different contexts, both can serve as major sources of inspiration for the design of a hybrid system, with intermediate hypotheses and... more

descriptionView Paper arrow_downwardDownload

An Effective Storage Mechanism for High Performance Computing (HPC)

by Fatima El

2023, International Journal of Advanced Computer Science and Applications

All over the process of treating data on HPC Systems, parallel file systems play a significant role. With more and more applications, the need for high performance Input-Output is rising. Different possibilities exist: General Parallel... more

descriptionView Paper arrow_downwardDownload

Asynchronous snapshots of actor systems for latency-sensitive applications

by Hanspeter Mössenböck

2023, Proceedings of the 16th ACM SIGPLAN International Conference on Managed Programming Languages and Runtimes

The actor model is popular for many types of server applications. Efficient snapshotting of applications is crucial in the deployment of pre-initialized applications or moving running applications to different machines, e.g for debugging... more

descriptionView Paper arrow_downwardDownload

Building Hierarchical Grid Storage Using the Gfarm Global File System and the JuxMem Grid Data-Sharing Service

by Majd Ghareeb

2022, Lecture Notes in Computer Science

As more and more large-scale applications need to generate and process very large volumes of data, the need for adequate storage facilities is growing. It becomes crucial to efficiently and reliably store and retrieve large sets of data... more

descriptionView Paper arrow_downwardDownload

Metadata Traces and Workload Models for Evaluating Big Storage Systems

by huong luu

2022, 2012 IEEE Fifth International Conference on Utility and Cloud Computing

Efficient namespace metadata management is increasingly important as next-generation file systems are designed for peta and exascales. New schemes have been proposed; however, their evaluation has been insufficient due to a lack of... more

descriptionView Paper arrow_downwardDownload

A Comprehensive Overview of Privacy and Data Security for Cloud Storage

by Dr. Yusuf Perwej

2022, International Journal of Scientific Research in Science, Engineering and Technology

People used to carry their documents about on CDs only a few years ago. Many people have recently turned to memory sticks. Cloud computing, in this case, refers to the capacity to access and edit data stored on remote servers from any... more

descriptionView Paper arrow_downwardDownload

A Comprehensive Overview of Privacy and Data Security for Cloud Storage

by Mrs. Sheeba Praveen

2022, International Journal of Scientific Research in Science, Engineering and Technology

People used to carry their documents about on CDs only a few years ago. Many people have recently turned to memory sticks. Cloud computing, in this case, refers to the capacity to access and edit data stored on remote servers from any... more

descriptionView Paper arrow_downwardDownload

MetaSync: Coordinating Storage across Multiple File Synchronization Services

by Haichen Shen

2022, IEEE Internet Computing

Cloud-based file synchronization services are a worldwide resource for many millions of users. However, individual services often have tight resource limits, suffer from outages or shutdowns, and sometimes silently corrupt or leak user... more

descriptionView Paper arrow_downwardDownload

Data-Intensive Workload Consolidation for the Hadoop Distributed File System

by Nikzad Babaii Rizvandi

2022, 2012 ACM/IEEE 13th International Conference on Grid Computing

Workload consolidation, sharing physical resources among multiple workloads, is a promising technique to save cost and energy in cluster computing systems. This paper highlights a few challenges of workload consolidation for Hadoop as one... more

descriptionView Paper arrow_downwardDownload

Azure Data Lake Store

by Rogério Ramos

2022, IoT Solutions in Microsoft's Azure IoT Suite

Azure Data Lake Store (ADLS) is a fully-managed, elastic, scalable, and secure file system that supports Hadoop distributed file system (HDFS) and Cosmos semantics. It is specifically designed and optimized for a broad spectrum of Big... more

descriptionView Paper arrow_downwardDownload

Agility and Performance in Elastic Distributed Storage

by Chiou-lan Chern

2022, ACM Transactions on Storage

Elastic storage systems can be expanded or contracted to meet current demand, allowing servers to be turned off or used for other tasks. However, the usefulness of an elastic distributed storage system is limited by its agility: how... more

descriptionView Paper arrow_downwardDownload

Data link layer: two impossibility results

by Yishay Mansour

2022, Proceedings of the seventh annual ACM Symposium on Principles of distributed computing - PODC '88

The data link layer in a layered communication network is designed to ensure reliable data transfer over a noisy physical channel. Formal specifications are given for physical channels and data links, in terms of I/O automata. Based on... more

descriptionView Paper arrow_downwardDownload

The impossibility of implementing reliable communication in the face of crashes

by Yishay Mansour

2022, Journal of the ACM

An important function of communication networks is to implement reliable data transfer over an unreliable underlying network. Formal specifications are given for reliable and unreliable communication layers, in terms of 1/0 automata.... more

descriptionView Paper arrow_downwardDownload

MPI-Vector-IO

by Sushil Prasad

2022, Proceedings of the 47th International Conference on Parallel Processing

In recent times, geospatial datasets are growing in terms of size, complexity and heterogeneity. High performance systems are needed to analyze such data to produce actionable insights in an efficient manner. For polygonal a.k.a vector... more

descriptionView Paper arrow_downwardDownload

Valmar: High-bandwidth real-time streaming data management

by Hsing-Bung Chen

2022, 012 IEEE 28th Symposium on Mass Storage Systems and Technologies (MSST)

In applications ranging from radio telescopes to Internet traffic monitoring, our ability to generate data has outpaced our ability to effectively capture, mine, and manage it. These ultra-high-bandwidth data streams typically contain... more

descriptionView Paper arrow_downwardDownload

Beyond the convenience of the internet of things: Security and privacy concerns

by Jabu Mtsweni

2022, 2017 IST-Africa Week Conference (IST-Africa)

The significant growth of the Internet of Things (IoT) is revolutionizing the way people live by transforming everyday Internet-enabled objects into an interconnected ecosystem of digital and personal information accessible anytime and... more

descriptionView Paper arrow_downwardDownload

Log In

Data Access Concurrency Control

Related Topics