Tag Archives: dataset

對 Open Data 的攻擊手段

前陣子看到的「Membership Inference Attacks against Machine Learning Models」,裡面試著做到的攻擊手法: [G]iven a data record and black-box access to a model, determine if the record was in the model's training dataset. 也就是拿到一組 Open Data 的存取權限,然後發展一套方法判斷某筆資料是否在裡面。而驗證攻擊的手法當然就是直接攻擊看效果: We empirically evaluate our inference techniques on classification models … Continue reading

Posted in Computer, Murmuring, Privacy, Programming, Search Engine, Security | Tagged , , , , , , , , , , | 1 Comment

Google 整理並公開出九百萬張圖片以及對應的 tag

Google 放出了九百萬張以 CC 授權釋出的圖片,標上 tag 後變成 Open Images dataset:「Introducing the Open Images Dataset」,像是這樣: Annotated images form the Open Images dataset. Left: Ghost Arches by Kevin Krejci. Right: Some Silverware by J B. Both images used under CC BY 2.0 license … Continue reading

Posted in Computer, Murmuring, Network, Programming | Tagged , , , , , , , | Leave a comment

Google BigQuery 提供的 Public Datasets

跟 AWS 的「AWS Public Data Sets」一樣,Google Cloud Platform 也提供了類似的服務給使用 Google BigQuery 的人使用:「Google BigQuery Public Datasets」。 目前資料看起來比較少 (因為最近才建立),包括了這六個項目: USA Names Data NYC TLC Trips Hacker News USA Disease Data GDELT Books Corpus NOAA GSOD Weather 在「Other Public Datasets」的地方就是不寫 AWS 的... XD

Posted in AWS, Cloud, Computer, Murmuring, Network, Science | Tagged , , , , , , , , | Leave a comment