mirror of
https://github.com/harvard-edge/cs249r_book.git
synced 2026-04-30 01:29:07 -05:00
225 lines
17 KiB
Plaintext
225 lines
17 KiB
Plaintext
Hallucinator reference check report
|
|
====================================
|
|
|
|
Sources: ['/Users/VJ/GitHub/mlsysbook-vols/book/quarto/contents/vol1/backmatter/references.bib', '/Users/VJ/GitHub/mlsysbook-vols/book/quarto/contents/vol2/backmatter/references.bib']
|
|
|
|
Summary
|
|
-------
|
|
Verified: 886
|
|
Not found: 186
|
|
Author mismatch: 23
|
|
Total: 1095
|
|
|
|
Not found (potential typos or non-indexed):
|
|
[abiresearch2024tinyml] Tiny ML: The Next Big Opportunity in Tech
|
|
[ansible] Automating Server Deployments with Ansible: Utilizing Automation in DevOps.
|
|
[anylogic_synthetic] Synthetic Data for Artificial Intelligence
|
|
[arm_bf16alt] BFloat16 floating-point widening multiply-add long
|
|
[ARM2020] Arm Cortex-M55 Processor Technical Reference Manual
|
|
[bai2019onnx] ONNX: Open Neural Network Exchange
|
|
[beazley2010understanding] Understanding the Python GIL
|
|
[ben2019cost] The Resource-as-a-Service (RaaS) Cloud
|
|
[brutlag2009speed] Speed Matters for Google Web Search
|
|
[cerebras_website] Cerebras Systems: Wafer-Scale AI Accelerators
|
|
[Cerebras2021] Wafer-Scale Deep Learning Acceleration with the Cerebras CS-2
|
|
[Cerebras2021wse2] The Wafer-Scale Engine 2: Scaling AI Compute Beyond GPUs
|
|
[chapman2000crisp] CRISP-DM 1.0: Step-by-step data mining guide
|
|
[choi2020dataechoing] Data Echoing for Efficient Training
|
|
[circleci] CircleCI: Continuous Integration and Delivery Platform
|
|
[cntk_website] Microsoft Cognitive Toolkit (CNTK)
|
|
[crowdflower2016data] Data Science Report 2016
|
|
[dbt] dbt (data build tool)
|
|
[deepbench_github] DeepBench: Benchmarking Deep Learning Operations on Different Hardware
|
|
[deepspeed_training_system_2021] DeepSpeed: Extreme-scale Model Training for Everyone
|
|
[dehghani2022data] Data Mesh: Delivering Data-Driven Value at Scale
|
|
[discord2020rust] Why Discord is switching from Go to Rust
|
|
[dvc] Data Version Control (DVC)
|
|
[elastic] Elasticsearch: Distributed Search and Analytics Engine
|
|
[farmbeats_website] FarmBeats: AI, Edge and IoT for Agriculture
|
|
[fastai_website] fast.ai: Making Neural Nets Uncool Again
|
|
[fayyad1996kdd] From Data Mining to Knowledge Discovery in Databases.
|
|
[fda2021artificial] Artificial Intelligence/Machine Learning (AI/ML)-Based Software as a Medical Device (SaMD) Action Plan
|
|
[Feldman2020] The Cerebras Wafer-Scale Engine: Opportunities and Challenges of Building an Accelerator at Wafer Scale
|
|
[fisher_8087_1981] The 8087 Numeric Data Processor
|
|
[gartner2024cloud] Gartner Forecasts Worldwide Public Cloud End-User Spending to Total \$679 Billion in 2024
|
|
[google_bfloat16] BFloat16: The secret to high performance on Cloud TPUs
|
|
[google_cloud] Google Cloud Platform Documentation
|
|
[google_crowdsource] Crowdsource by Google: A Platform for Collecting Inclusive and Representative Machine Learning Data
|
|
[google_litert] LiteRT (formerly TensorFlow Lite)
|
|
[google2024staticdynamic] Static vs. Dynamic Inference
|
|
[GoogleXLA] XLA: Optimizing Compiler for Machine Learning
|
|
[Graphcore2020] The Colossus MK2 IPU Processor
|
|
[gu2023deep] Deep Learning Model Compression (ii) by Ivy Gu Medium
|
|
[gudivada2017data] Data quality considerations for big data and machine learning: Going beyond data cleaning and transformations
|
|
[guo2019mobile] A Survey on Deep Learning Based Mobile and Online Payment Security
|
|
[harvard_law_chatgpt] Does ChatGPT Violate New York Times Copyrights?
|
|
[hermann2017meet] Meet Michelangelo: Uber's Machine Learning Platform
|
|
[hydra] Hydra: A Framework for Elegantly Configuring Complex Applications
|
|
[ibm_data_drift] IBM Watson OpenScale: Data Drift Detection
|
|
[ieee_2416_2019] IEEE 2416-2019: Standard for Power Modeling to Enable System-Level Analysis
|
|
[ieee_754_2019] IEEE 754-2019: Standard for Floating-Point Arithmetic
|
|
[ieee_spectrum_relu] The History of the ReLU
|
|
[ieee_working_groups] IEEE Standards Association: Working Groups for AI and ML
|
|
[intel2021amx] Intel Advanced Matrix Extensions (Intel AMX)
|
|
[iso_tc] ISO/IEC JTC 1/SC 42 Artificial Intelligence
|
|
[janapa2022mlperf] MLPerf Mobile v2. 0: An Industry-Standard Benchmark Suite for Mobile Machine Learning
|
|
[jax2018github] JAX: composable transformations of Python+NumPy programs
|
|
[kaggle2021state] State of Machine Learning and Data Science 2021
|
|
[keras_website] Keras: Deep Learning for Humans
|
|
[kserve] KServe: Highly Scalable and Standards-Based Model Inference Platform on Kubernetes
|
|
[label_studio] Label Studio: Open Source Data Labeling Platform
|
|
[li2020estimating] Estimating the Training Cost of GPT-3
|
|
[li2021survey] A Survey on Memory Management Strategies for Machine Learning Systems
|
|
[linden2006marissa] Marissa Mayer at Web 2.0
|
|
[little1961proof] A Proof for the Queuing Formula: <i>L</i> = \ensuremath\lambda<i>W</i>
|
|
[mccarthy1956dartmouth] A Proposal for the Dartmouth Summer Research Project on Artificial Intelligence
|
|
[mckinsey2021iot] The Internet of Things: Catching Up to an Accelerating Opportunity
|
|
[mlflow_website] MLflow: An Open Source Platform for the Machine Learning Lifecycle
|
|
[ng2021datacentric] MLOps: From Model-centric to Data-centric AI
|
|
[ntsb2019uber] Collision Between Vehicle Controlled by Developmental Automated Driving System and Pedestrian
|
|
[nvidia_cublas] cuBLAS: CUDA Basic Linear Algebra Subprograms
|
|
[nvidia_cusparse_block] Accelerating Matrix Multiplication with Block Sparse Format and NVIDIA Tensor Cores
|
|
[nvidia_nccl] NVIDIA Collective Communications Library (NCCL)
|
|
[nvidia_simulation] NVIDIA Omniverse and Simulation
|
|
[nvidia_tensorRT_2021] TensorRT: High-Performance Deep Learning Inference Library
|
|
[nvidia_tensors_fp16_2017] Training with Mixed Precision
|
|
[nvidia_tf32] TensorFloat-32 in the A100 GPU Accelerates AI Training, HPC up to 20x
|
|
[nvidia_triton] NVIDIA Triton Inference Server
|
|
[nvidia2017gpu] NVIDIA Tesla V100 GPU Architecture
|
|
[nvidia2020ampere] NVIDIA A100 Tensor Core GPU Architecture
|
|
[NVIDIA2020nvlink] NVLink: Scalable High-Performance Interconnect
|
|
[nvidia2021cudnn] NVIDIA cuDNN Developer Guide
|
|
[nvidia2024tensorrt] NVIDIA TensorRT: Programmable Inference Accelerator
|
|
[nvidia2024triton] NVIDIA Triton Inference Server: Developer Documentation
|
|
[oecd_ai_2021] Measuring the Geographic Distribution of AI Computing Capacity
|
|
[oneDNN2021] oneDNN: Intel's Deep Learning Neural Network Library
|
|
[onnxruntime2024] ONNX Runtime: Cross-Platform Inference and Training Machine-Learning Accelerator
|
|
[palmer_8087_1981] The INTEL\textregistered 8087 numeric data processor
|
|
[patrick_debois] DevOpsDays: The Birth of DevOps
|
|
[prefect] Prefect: Workflow Orchestration Framework for Python
|
|
[prometheus] Prometheus: Monitoring System and Time Series Database
|
|
[pytorch_sparsity_blog] Accelerating Neural Network Training with Sparse Tensors
|
|
[radford2018improving] Improving language understanding by generative pre-training
|
|
[rosenblatt1957perceptron] The Perceptron: A Perceiving and Recognizing Automaton
|
|
[sambanova_website] SambaNova: The Fastest AI Inference Platform and Hardware
|
|
[savage2009flaw] The Flaw of Averages: Why We Underestimate Risk in the Face of Uncertainty
|
|
[scardapane2020should] Why should I trust you? A survey of explainability of machine learning for healthcare
|
|
[schelter2018automating] Automating large-scale machine learning model management
|
|
[scikit_learn_confusion_matrix] sklearn.metrics.confusion\_matrix --- scikit-learn documentation
|
|
[scikit_learn_feature_selection] Feature selection --- scikit-learn documentation
|
|
[scikit_learn_metrics] Metrics and scoring: quantifying the quality of predictions --- scikit-learn documentation
|
|
[sec2013knight] Securities Exchange Act of 1934, Release No. 70694: Knight Capital Americas LLC
|
|
[Shang2018GenomicsAccel] Accelerating Genomic Data Analysis with Domain-Specific Architectures
|
|
[sifre2014rigid] Rigid-Motion Scattering for Image Classification
|
|
[sodani2017knl] Knights landing (KNL): 2nd Generation Intel\textregistered Xeon Phi processor
|
|
[standish2020chaos] CHAOS 2020: Beyond Infinity
|
|
[statista2024email] Number of sent and received e-mails per day worldwide from 2017 to 2027
|
|
[tang2020understanding] Primordial black holes and secondary gravitational waves from k/G inflation
|
|
[tensorflow_data_2015] TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems
|
|
[time_openai_kenya] Exclusive: OpenAI Used Kenyan Workers on Less Than \$2 Per Hour to Make ChatGPT Less Toxic
|
|
[uber2017michelangelo] Michelangelo: Uber's machine learning platform
|
|
[uci_repo] UCI Machine Learning Repository
|
|
[venturebeat_datasets] 3 Big Problems with Datasets in AI and Machine Learning
|
|
[vertex_ai_model_registry] Vertex AI Model Registry
|
|
[wandb] Weights \& Biases: The AI Developer Platform
|
|
[waymo_website] Waymo: Autonomous Driving Technology
|
|
[wikitext_website] The WikiText Long Term Dependency Language Modeling Dataset
|
|
[wolf2017we] Why We Use Bigtable: Googles Software Infrastructure for Storing Machine Learning Training Data
|
|
[wu_tensor_2019] Tensor Cores: Understanding, Programming, and Performance Analysis
|
|
[wu2019fast] Fast Neural Networks: Efficient and Adaptive Computation for Inference
|
|
[yu2022orca] Orca: A Distributed Serving System for Transformer-Based Generative Models.
|
|
[zaharia2018accelerating] Accelerating the Machine Learning Lifecycle with MLflow
|
|
[zhang2020optimizing] Optimizing Memory Access for Deep Learning Workloads
|
|
[zhang2021learning] Learning-based Efficient Sparsity and Quantization for Neural Network Compression
|
|
[amodei2016concrete] Concrete Problems in AI Safety
|
|
[autogpt2023] AutoGPT: An Autonomous GPT-4 Experiment
|
|
[aws2020s3] Amazon S3 Strong Consistency
|
|
[Balde_2017] The Global E-Waste Monitor 2017: Quantities, Flows and Resources
|
|
[Balde2017] The Global E-waste Monitor 2017: Quantities, Flows, and Resources
|
|
[barocas-hardt-narayanan] Fairness and Machine Learning: Limitations and Opportunities
|
|
[berger2014kolmogorov] Wiley StatsRef: Statistics Reference Online
|
|
[berkeley2024compound] The Shift from Models to Compound AI Systems
|
|
[brakerski2022federated] Federated Learning and the Rise of Edge Intelligence: Challenges and Opportunities
|
|
[brown2021longterm] Long-Term Software Support: A Key Factor in Sustainable AI Hardware
|
|
[cadwalladr2018cambridge] Revealed: 50 million Facebook profiles harvested for Cambridge Analytica in major data breach
|
|
[chollet2019measure] On the Measure of Intelligence
|
|
[cope2009pure] Pure water, semiconductors and the recession
|
|
[davies2011endangered] Endangered elements: Critical thinking
|
|
[davis2022uptime] Uptime Institute Global Data Center Survey 2022
|
|
[dean2024mlsys] AI Hypercomputer: Towards an Architecture for Exascale AI
|
|
[deng2022tinytrain] TinyTrain: Learning to Train Compact Neural Networks on the Edge
|
|
[diao2023sparse] Pruning and Sparse Training for On-Device Neural Network Optimization
|
|
[ellenmacarthur2017circular] Circular Economy and Sustainable AI: Designing Out Waste in the Tech Industry
|
|
[euaiact2024] Regulation (EU) 2024/1689 of the European Parliament and of the Council laying down harmonised rules on artificial intelligence (AI Act)
|
|
[EuropeanCommission2023sustainability] Sustainable Digital Markets Act: Environmental Transparency in AI
|
|
[evans2016deepmind] DeepMind AI Reduces Google Data Centre Cooling Bill by 40\%
|
|
[facebook2016fblearner] Introducing FBLearner Flow: Facebook's AI backbone
|
|
[feast2019] Feast: An open source feature store for machine learning
|
|
[framework2022modular] Modular Laptops: A New Approach to Sustainable Computing
|
|
[gdpr2016] Regulation (EU) 2016/679 of the European Parliament and of the Council on the protection of natural persons with regard to the processing of personal data and on the free movement of such data (General Data Protection Regulation)
|
|
[gdpr2016regulation] Regulation (EU) 2016/679 of the European Parliament and of the Council
|
|
[ghgprotocol2023] Greenhouse Gas Protocol: Corporate Standard
|
|
[google2023cooling] Efficiency: How We Do It
|
|
[greengard2021internet] The Internet of Things
|
|
[handlin1965science] Science and technology in popular culture
|
|
[harris2023semiconductor] The Environmental Cost of Next-Generation AI Chips: Energy, Water, and Carbon Impacts
|
|
[infiniband2000spec] InfiniBand Architecture Specification Volume 1
|
|
[johnson2018right] The Right to Repair Movement and Its Implications for AI Hardware Longevity
|
|
[Jones2021datacenters] The Environmental Impact of Data Centers: Challenges and Sustainable Solutions
|
|
[lecun2022path] A Path Towards Autonomous Machine Intelligence
|
|
[mckinsey2018ai] Notes from the AI Frontier: Modeling the Impact of AI on the World Economy
|
|
[meta2022rsc] Building the Most Powerful AI Supercomputer: Meta's AI Research SuperCluster
|
|
[nasa2000mpl] Report on the Loss of the Mars Polar Lander and Deep Space 2 Missions
|
|
[netflix2019interleaving] Innovating faster on personalization algorithms at Netflix using interleaving
|
|
[ntsb2017tesla] Collision Between a Car Operating With Automated Vehicle Control Systems and a Tractor-Semitrailer Truck Near Williston, Florida, May 7, 2016
|
|
[ntsb2019uber] Collision Between Vehicle Controlled by Developmental Automated Driving System and Pedestrian, Tempe, Arizona, March 18, 2018
|
|
[Patterson2022carbonaware] Carbon-Aware Computing for Sustainable AI
|
|
[pew2023ai] What Americans Know About AI, Cybersecurity and Big Tech
|
|
[pont2002using] Using watchdog timers to improve the reliability of single-processor embedded systems: Seven new patterns and a case study
|
|
[puckett2016e-waste] E-Waste and the Global Environment: The Hidden Cost of Discarded Electronics
|
|
[russell2022tech] Tech Industry Trends in Hardware Lock-In and Their Sustainability Implications
|
|
[savas2022ml] ML-ExRay: Visibility and explainability for monitoring ML model behavior
|
|
[schulman2017proximal] Proximal Policy Optimization Algorithms
|
|
[schwan2003lustre] Lustre: Building a File System for 1,000-node Clusters
|
|
[semianalysisGPT4] GPT-4 Architecture, Infrastructure, Training Dataset, Costs, Vision, MoE
|
|
[shafahi2019adversarial] Adversarial Training for Free!
|
|
[sharma2020industrial] Industrial AI and Vendor Lock-In: The Hidden Costs of Proprietary Ecosystems
|
|
[sigelman2010dapper] Dapper, a large-scale distributed systems tracing infrastructure
|
|
[sornin2015lora] Low Power Long Range Transmitter
|
|
[spotify2019mlinfra] The winding road to better machine learning infrastructure through TensorFlow Extended and Kubeflow
|
|
[Statista_2022] Number of Internet of Things (IoT) connected devices worldwide from 2019 to 2030
|
|
[TheBigHa77] The Big Hack: How China Used a Tiny Chip to Infiltrate U.S. Companies - Bloomberg
|
|
[thegreengrid2007pue] Green Grid Data Center Power Efficiency Metrics: PUE and DCIE
|
|
[triton.jit] triton.jit Decorator --- Triton Programming Model
|
|
[tsmc2023water] TSMC Arizona Fab 21 Water Usage Impact Assessment
|
|
[un2019circular] A New Circular Vision for Electronics, Time for a Global Reboot
|
|
[wan2023vpp] Vpp: The vulnerability-proportional protection paradigm towards reliable autonomous machines
|
|
[worldbank2016ebola] 2014-2015 West Africa Ebola Crisis: Impact Update
|
|
[Xu2021edge] Edge Intelligence: Architectures, Challenges, and Applications
|
|
[zhang2020efficient] Efficient Task-Specific Adaptation for Deep Models
|
|
|
|
Author mismatch:
|
|
[aws] Amazon Web Services (AWS)
|
|
[aws_s3] Amazon Simple Storage Service (S3)
|
|
[brynjolfsson2014second] The second machine age: work, progress, and prosperity in a time of brilliant technologies, 1st Edition.
|
|
[domingos2015master] The master algorithm: how the quest for the ultimate learning machine will remake our world
|
|
[google2024gemini] Gemini: A Family of Highly Capable Multimodal Models
|
|
[inmon2005building] Building the data warehouse
|
|
[lime_github] LIME (Local Interpretable Model-Agnostic Explanations)
|
|
[sculley2015hidden] Technical Debt in Machine Learning Systems
|
|
[Taylor2017ASICMining] The Evolution of Bitcoin Hardware
|
|
[terraform] Terraform: Infrastructure as Code
|
|
[who2019classification] International Classification of Diseases, 11th Revision (ICD-11)
|
|
[claude2022constitutional] Constitutional AI: Harmlessness from AI Feedback
|
|
[gibiansky2017baidu] BRINGING BACK OLD PHOTOS INTO LIFE USING DEEP LEARNING TECHNIQUES
|
|
[grossman2007high] High tech trash: Digital devices, hidden toxics, and human health
|
|
[hipaa1996health] The Health Insurance Portability and Accountability Act of 1996: understanding the anti-kickback laws.
|
|
[jain1991art] The Art of Computer Systems Performance Analysis: Techniques for Experimental Design, Measurement, Simulation, and Modeling
|
|
[miller2015remote] The Antivirus Hacker's Handbook
|
|
[poff2002aquatic] Aquatic ecosystems \& Global climate change
|
|
[quinonero2009dataset] Dataset Shift in Machine Learning
|
|
[schafer2023notorious] The Notorious GPT: science communication in the age of artificial intelligence
|
|
[wei2022emergent] Emergent Abilities of Large Language Models
|
|
[who2016ebola] Lessons learned from the World Health Organization's late initial response to the 2014-2016 Ebola outbreak in West Africa
|
|
[work_of_the_future_2020] The Work of the Future: Building Better Jobs in an Age of Intelligent Machines
|