大数据 论文

Papers

  • Published in 2014
  • Published in 2013
  • Published in 2012
  • Published in 2011
  • Published in 2010
  • Published in 2009
  • Published in 2008
  • Published in 2007
  • Published in 2006
  • Published in 2005
  • Published in 2004
  • Published in 2003
  • Published in 2002
  • Published in 2001
  • Published in 2000
  • Published in 1999
  • Published in 1998
  • Published in 1997

2014

  • 2014 - 3D Object Manipulation in a Single Photograph using Stock 3D Models
  • 2014 - A Partitioning Framework for Aggressive Data Skipping
  • 2014 - A Self-Configurable Geo-Replicated Cloud Storage System
  • 2014 - Coordination Avoidance in Database Systems
  • 2014 - DeepFace: Closing the Gap to Human-Level Performance in Face Verification
  • 2014 - Execution Primitives for Scalable Joins and Aggregations in Map Reduce
  • 2014 - f4: Facebookâs Warm BLOB Storage System
  • 2014 - Fastpass: A Centralized "Zero-Queue" Datacenter Network
  • 2014 - First-person Hyper-lapse Videos
  • 2014 - Guess Who Rated This Movie: Identifying Users Through Subspace Clustering
  • 2014 - In Search of an Understandable Consensus Algorithm
  • 2014 - Log-structured Memory for DRAM-based Storage
  • 2014 - Logical Physical Clocks and Consistent Snapshots in Globally Distributed Databases
  • 2014 - MapGraph: A High Level API for Fast Development of High Performance Graph Analytics on GPUs
  • 2014 - Mesa: Geo-Replicated, Near Real-Time, Scalable Data Warehousing
  • 2014 - Orca A Modular Query Optimizer Architecture for Big Data
  • 2014 - Pigeon: A Spatial MapReduce Language
  • 2014 - Scalable Object Detection using Deep Neural Networks
  • 2014 - Sequence to Sequence Learning with Neural Networks
  • 2014 - Show and Tell: A Neural Image Caption Generator

2013

  • 2013 - A Demonstration of SpatailHadoop: An Efficient MapReduce Framework for Spatial Data
  • 2013 - CG_Hadoop: Computational Geometry in MapReduce
  • 2013 - Consistency-Based Service Level Agreements for Cloud Storage
  • 2013 - Dimension Independent Matrix Square using MapReduce
  • 2013 - Druid A Real-time Analytical Data Store
  • 2013 - Event labeling combining ensemble detectors and background knowledge
  • 2013 - Everything You Always Wanted to Know About Synchronization but Were Afraid to Ask
  • 2013 - F1: A Distributed SQL Database That Scales
  • 2013 - GraphX: A Resilient Distributed Graph System on Spark
  • 2013 - HyperLogLog in Practice: Algorithmic Engineering of a State of The Art Cardinality 2013 Estimation Algorithm
  • 2013 - MillWheel: Fault-Tolerant Stream Processing at Internet Scale
  • 2013 - MLbase: A Distributed Machine-learning System
  • 2013 - Naiad: A Timely Dataflow System
  • 2013 - Online, Asynchronous Schema Change in F1
  • 2013 - Presto: Distributed Machine Learning and Graph Processing with Sparse Matrices
  • 2013 - Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank
  • 2013 - Rich feature hierarchies for accurate object detection and semantic segmentation
  • 2013 - Scalable Progressive Analytics on Big Data in the Cloud
  • 2013 - Scaling Memcache at Facebook
  • 2013 - Scuba: Diving into Data at Facebook
  • 2013 - Shark: SQL and Rich Analytics at Scale
  • 2013 - Some Improvements on Deep Convolutional Neural Network Based Image Classification
  • 2013 - TAO: Facebookâs Distributed Data Store for the Social Graph
  • 2013 - Toward Common Patterns for Distributed, Concurrent, Fault-Tolerant Code
  • 2013 - Unicorn: A System for Searching the Social Graph
  • 2013 - Warp: Lightweight Multi-Key Transactions for Key-Value Stores

2012

  • 2012 - A Few Useful Things to Know about Machine Learning
  • 2012 - A Sublinear Time Algorithm for PageRank Computations
  • 2012 - Avatara: OLAP for Web-scale Analytics Products
  • 2012 - Blink and It's Done. Interactive Queries on Very Large Data
  • 2012 - BlinkDB: Queries with Bounded Errors and Bounded Response Times on Very Large Data
  • 2012 - Dimension Independent Similarity Computation
  • 2012 - Earlybird: Real-Time Search at Twitter
  • 2012 - Fast and Interactive Analytics over Hadoop Data with Spark
  • 2012 - HyperDex: A Distributed, Searchable Key-Value Store
  • 2012 - ImageNet Classification with Deep Convolutional Neural Networks
  • 2012 - Large:Scale Machine Learning at Twitter
  • 2012 - Multi-Scale Matrix Sampling and Sublinear-Time PageRank Computation
  • 2012 - Paxos Made Parallel
  • 2012 - Paxos Replicated State Machines as the Basis of a High-Performance Data Store
  • 2012 - Processing a Trillion Cells per Mouse Click
  • 2012 - Shark: Fast Data Analysis Using Coarse-grained Distributed Memory
  • 2012 - Spanner: Google's Globally-Distributed Database
  • 2012 - The Unified Logging Infrastructure for Data Analytics at Twitter
  • 2012 - The Vertica Analytic Database- C-Store 7 Years Later

2011

  • 2011 - CrowdDB: Answering Queries with Crowdsourcing
  • 2011 - CrowdDB: Query Processing with the VLDB Crowd
  • 2011 - Fast Crash Recovery in RAMCloud
  • 2011 - Hogwild!: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent
  • 2011 - It's Time for Low Latency
  • 2011 - Matching Unstructured Product Offers to Structured Product Specifications
  • 2011 - Megastore: Providing Scalable, Highly Available Storage for Interactive Services
  • 2011 - Resilient Distributed Datasets- A Fault-Tolerant Abstraction for In-Memory Cluster Computing
  • 2011 - Scarlett: Coping with Skewed Content Popularity in MapReduce Clusters

2010

  • 2010 - Dapper, a Large-Scale Distributed Systems Tracing Infrastructure
  • 2010 - Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers
  • 2010 - Dremel: Interactive Analysis of Web-Scale Datasets
  • 2010 - Finding a needle in Haystack- Facebook's photo storage
  • 2010 - FlumeJava: Easy, Eff¥cient Data-Parallel Pipelines
  • 2010 - Large:scale Incremental Processing Using Distributed Transactions and Notifications
  • 2010 - Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center
  • 2010 - Pregel: A System for Large-Scale Graph Processing
  • 2010 - S4: Distributed Stream Computing Platform
  • 2010 - Spark: Cluster Computing with Working Sets
  • 2010 - The Learning Behind Gmail Priority Inbox
  • 2010 - ZooKeeper: Wait-free coordination for Internet-scale systems

2009

  • 2009 - Cassandra - A Decentralized Structured Storage System
  • 2009 - HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads
  • 2009 - Vertical Paxos and Primary-Backup Replication

2008

  • 2008 - Chukwa: A large-scale monitoring system
  • 2008 - Column:Stores vs. Row-Stores- How Different Are They Really?
  • 2008 - PNUTS: Yahoo!Õs Hosted Data Serving Platform
  • 2008 - Top 10 algorithms in data mining

2007

  • 2007 - Dryad: Distributed Data-Parallel Programs from Sequential Building Blocks
  • 2007 - Dynamo: Amazon's Highly Available Key-value Store
  • 2007 - Labeled Faces in the Wild: A Database for Studying Face Recognition in Unconstrained Environments
  • 2007 - Life beyond Distributed Transactions: an ApostateÕs Opinion
  • 2007 - Paxos Made Live - An Engineering Perspective

2006

  • 2006 - Bigtable: A Distributed Storage System for Structured Data
  • 2006 - Ceph: A Scalable, High-Performance Distributed File System
  • 2006 - Map-Reduce for Machine Learning on Multicore
  • 2006 - The Chubby lock service for loosely-coupled distributed systems

2005

  • 2005 - Fast Paxos

2004

  • 2004 - Cheap Paxos
  • 2004 - MapReduce: Simplified Data Processing on Large Clusters

2003

  • 2003 - The Google File System

2002

  • 2002 - Brewer's Conjecture and the Feasibility of Consistent, Available, Partition-Tolerant Web Services

2001

  • 2001 - Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications
  • 2001 - Paxos Made Simple
  • 2001 - Random Forrest

1999

  • 1999 - Pasting Small Votes for Classification in Large Databases and On-Line
  • 1999 - The PageRank Citation Ranking: Bringing Order to the Web

1997

  • 1997 - Application-Controlled Demand Paging for Out-of-Core Visualization

你可能感兴趣的:(大数据 论文)