Library – Page 12 – Gea-Suan Lin's BLOG

Intel 最新的 Ice Lake 系列對 AES 的加速

Twitter 上看到這篇，講 Intel 推出新的指令集，對 AES 的加速效果：

#ePrint Making AES great again: the forthcoming vectorized AES instruction: N Drucker, S Gueron https://t.co/kM0u6XHv3e

— IACR (@IACR_News) May 1, 2018

進去看以後發現是講四月推出的 Ice Lake，在上面新增的 VPCLMULQDQ 指令對效能的幫助：

The introduction of the processor instructions AES-NI and VPCLMULQDQ, that are designed for speeding up encryption, and their continual performance improvements through processor generations, has significantly reduced the costs of encryption overheads.

而他們發表出來的數據說 AES-GCM 的效率直接從 ~23 cycles/byte 降到 0.64 cycles/byte，大約是 35 倍的改進？

More and more applications and platforms encrypt all of their data and traffic. As an example, we note the world wide proliferation of the use of AES-GCM, with performance dropping down to 0.64 cycles per byte (from ~23 before the instructions), on the latest Intel processors.

就算不是 AES-GCM，而是其他的 AES 相關演算法，也是三倍以上的改善：

這效能差異...

Trac 1.2 上惡搞 TracTicketReferencePlugin 讓他會動...

TracTicketReferencePlugin 是個 Trac 上設定 ticket 之間關聯性的 plugin，與子母票有比較強烈的關係不同，有些人喜歡把相關的資料都掛在這邊。先前用都是在 Trac 1.0 上用，也沒什麼問題，最近在 Trac 1.2 上跑發現直接有 js error...

原因是 TracTicketReferencePlugin 用到 jQuery.live()，而這個函式在 jQuery 1.7 被宣告 deprecated，在 jQuery 1.9 被正式拔掉了。而 Trac 1.0 用的是 jQuery 1.7，所以不會有問題；Trac 1.2 用的是 jQuery 1.12，於是就爛掉了...

剛好就是昨天，有人在作者的 repository 上發 issue 出來 (另外也提供了 patch)：「t2y / trac.plugins.ticketref / issues / #9 - JavaScript errors with Trac 1.2 — Bitbucket」。雖然 patch 本身不算難，但我實在懶的查 Mercurial 的操作，於是決定用 workaround 解...

想法是讓 jQuery.live() 會動，這點在 jQuery 官方出的 Migrate 就可以達到 (需要用 1.4.1 版，而不是 3.0.1 版)，於是第一步就是在 site.html 內直接先下手為強，讀 jQuery 後馬上讀 Migrate：

  <!--! Add site-specific style sheet -->
  <head py:match="head" py:attrs="select('@*')">
    <script src="https://code.jquery.com/jquery-1.12.4.min.js"></script>
    <script src="https://code.jquery.com/jquery-migrate-1.4.1.min.js"></script>
    ${select('*|comment()|text()')}

但 Trac 還是會再載入一次的 jQuery 而蓋過去，所以我們要讓 Trac 不會載入，在 trac.ini 內的 jquery_location 設定讓他讀一個空的 js 檔：

jquery_location = site/null.js

然後生一個空的 trac/htdocs/null.js 讓他讀。

最後開一張票追蹤，看什麼時候官方放新版，再把這串 workaround 拔掉...

TLS 1.3 進入 Proposed Standard

最近蠻熱的一個新聞，TLS 1.3 的 draft-ietf-tls-tls13-28.txt 進入 Proposed Standard 了 (在「draft-ietf-tls-tls13-28 - The Transport Layer Security (TLS) Protocol Version 1.3」這邊可以看到歷史記錄)：「Protocol Action: 'The Transport Layer Security (TLS) Protocol Version 1.3' to Proposed Standard (draft-ietf-tls-tls13-28.txt)」。

沒意外的話這就會是最終版本了。如果要看 TLS 1.2 與 TLS 1.3 的差異，看維基百科上的 Transport Layer Security - TLS 1.3 會比較清楚。

大家等很久了... 像是 OpenSSL 1.1.1 其實一部分也是在等 TLS 1.3 正式推出：(出自「Using TLS1.3 With OpenSSL」)

OpenSSL 1.1.1 will not be released until (at least) TLSv1.3 is finalised. In the meantime the OpenSSL git master branch contains our development TLSv1.3 code which can be used for testing purposes (i.e. it is not for production use).

主要還是期待非 NSA 派系的 cipher (其實幾乎都是 djb 的戰果) 與 1-RTT handshake，後續等 TLS 1.3 變成 Standard Track 應該就會被各家瀏覽器開預設值了...

PyPI 支援 Markdown

在「Markdown Descriptions on PyPI」這邊看到 PyPI 的網站要支援 Markdown 語法了，目前有 markdown-description-example 這個專案可以看效果。

同一個 package，舊的在「markdown-description-example 0.0.1 : Python Package Index」這邊可以看到，新的則可以在「markdown-description-example · Warehouse Preview (PyPI)」這邊看到。

code block 的地方看起來還不太好看，也許之後還會調整？

拿來畫手繪感的 Rough.js

看到「Rough.js」這個專案，可以拿來產生手繪感的圖，像這些：

先記錄起來...

測試 TPUv2 的 C/P 值

有人用相同演算法實際測試 Google 的 TPUv2 與 NVIDIA 的 Tesla P100 的 C/P 值了：「Benchmarking Google’s new TPUv2」。

如果以 ResNet-50 當作計算的演算法，可以看到其實 C/P 值的差距沒有想像中大。主要原因是 GPU 可以使用較低的精度計算以加快速度，而非 Google 之前新聞稿故意使用較高精度比較 (TPU 使用 8-bit matrix engine，所以 GPU 使用較低的 fp16 版本比較會比較有參考價值)：

真正的差異是在 LSTM：

It turns out that the TPU is even faster on the LSTM model (21402 examples/s): ~12.9 times faster than a P100 (1658 examples/s) and ~7.7 times faster than a V100 (2778 examples/s)!

不過這邊就沒特別提到精度了...

Seam Carving (接縫裁剪)

看到有人實做 Seam Carving (接縫裁剪) 了，用 Golang 寫的，放在 GitHub 的 esimov/caire 這邊，副標題「Content aware image resize library」。實做了「Seam Carving for Content-Aware Image Resizing」這篇論文。

Seam Carving 指的是知道內容的 resize，像是把上面這張變成下面這張：

或是變大：

馬上可以想到的應用是需要保留資訊內容，但又想要大量提供資訊的地方，像是 Nuzzle 的縮圖 (或是以前的 Zite)，或是網路新聞媒體的首頁所用的縮圖。不知道還有沒有其他地方可以用...

2FA 的 QR code 與 CanvasFingerprintBlock

在「Rasmus Lerdorf 關於 VPS 的介紹測試...」這篇的留言裡，Jimmy 提到 Vultr 是有 2FA 可以用的 (當初沒找到...)，於是就花了點時間設定...

但設定的過程中發現 TOTP 的 QR code 出不來，但在 dev console 裡面卻看得到 img 元素。

這種情況前幾個月在另外一個網站上也遇過 (當下拿 Firefox 測也不行)，於是就認為他們網站的問題，開了 support ticket 也沒回，一直沒下文的情況下就丟著。現在在 Vultr 上又遇到同樣的問題的話，看起來有可能是我的問題 (或是他們兩個站台都用同樣的 library)，於是就仔細點找...

找的過程中間發現有 canvas 元素，然後 canvas 元素有個 inline css 是 display: none;，先試著把這條拿掉，就出現了... 接下來就好猜了。

在 2014 年的「用 Canvas Fingerprint 取代部份 Cookie」這篇就有提過可以用 Canvas 追蹤使用者的問題，於是就有介紹了 CanvasFingerprintBlock 這個在 Google Chrome 上的套件，阻擋 Canvas 的存取。一關掉這個套件就正常了 XDDD

為了隱私問題，套件本身還是掛著，但當遇到發現有 QR code 出不來的時候就知道去 dev console 內改掉 XDDD

然後回到原來本來以為有問題的那個網站，也是一樣進 dev console 改掉後就看得到可以掃了... 看起來這兩個站可能是用一樣的 library？找出來再去戳這兩個站好了...

用 Composer 的 require 限制，擋掉有安全漏洞的 library...

查資料的時候查到的，在 GitHub 上的 Roave/SecurityAdvisories 這個專案利用 Composer 的 require 條件限制，擋掉有安全漏洞的 library：

This package ensures that your application doesn't have installed dependencies with known security vulnerabilities.

看一下 composer.json 就知道作法了，裡面的 description 也說明了這個專案的用法：

Prevents installation of composer packages with known security vulnerabilities: no API, simply require it

這方法頗不賴的 XDDD

兩個 gperf...

翻資料的時候覺得怎麼跟印象中的不太一樣，多花些時間翻了一下，發現原來有兩個東西同名...

一個是 GNU 的 gperf，給定字串集合，產生 C 或 C++ 的 perfect hash function (i.e. no collision)：

GNU gperf is a perfect hash function generator. For a given list of strings, it produces a hash function and hash table, in form of C or C++ code, for looking up a value depending on the input string. The hash function is perfect, which means that the hash table has no collisions, and the hash table lookup needs a single string comparison only.

另外一個是 Google 弄出來的 gperftools，malloc() 的替代品以及效能分析工具：

gperftools is a collection of a high-performance multi-threaded malloc() implementation, plus some pretty nifty performance analysis tools.