跳到主要內容
One of the great perks of living in the San Francisco Bay Area is proximity to some amazing wine regions. Over the last couple years, I've visited vineyards in regions like Napa Valley, Sonoma Valley, Paso Robles, and even Malibu. I recently ran into a machine learning data set that has data on 6000 Portuguese wines that includes a 1-10 quality rating, which seems like a great excuse to build a neural network that can predict the 1-10 quality rating based on factors like residual sugar and alcohol content. Effectively, this neural network attempts to match the wine palate of whoever put this data set together.

Training a Neural Network with Brain.js

Brain.js is a simple npm module for building neural networks, a common machine learning model that you might see in an undergraduate AI class.
The wine data can be downloaded here. The file is a CSV that uses semi-colons (;) as a delimiter. The contents look like this:
The first 11 columns are various chemical properties of a given wine, and the 12th and final column is a "quality" score that represents how good this wine tastes according to the person who recorded this data.
"Training" is how you build a neural network. Given some training data, Brain.js builds a mathematical model for predicting the quality rating of a wine based on the chemical properties. Below is an example from the Brain.js docs about how to train and then use a neural network.
var net = new brain.NeuralNetwork();

net.train([{input: { r: 0.03, g: 0.7, b: 0.5 }, output: { black: 1 }},
           {input: { r: 0.16, g: 0.09, b: 0.2 }, output: { white: 1 }},
           {input: { r: 0.5, g: 0.5, b: 1.0 }, output: { white: 1 }}]);

var output = net.run({ r: 1, g: 0.4, b: 0 });  // { white: 0.99, black: 0.002 }
For the wine data, the input will be an object representing the chemical properties, and the output will contain one property, the quality. One key detail about Brain.js is that all inputs must be between 0 and 1, so you need to scale some of the inputs. Below is the first wine from the CSV converted into a format that Brain.js can use for training a neural network.
{ input:
   { 'fixed acidity': 0.7,
     'volatile acidity': 0.027000000000000003,
     'citric acid': 0.036,
     'residual sugar': 0.0207,
     chlorides: 0.0045,
     'free sulfur dioxide': 0.045,
     'total sulfur dioxide': 0.17,
     density: 0.1001,
     pH: 0.3,
     sulphates: 0.045,
     alcohol: 0.08800000000000001 },
  output: { quality: 0.6 } }
Below is the code for training a neural network on the first 1000 wines in the CSV.
const { NeuralNetwork } = require('brain.js');
const _ = require('lodash');
const fs = require('fs');

const raw = fs.readFileSync('./winequality-white.csv', 'utf8').split('\n');
const headers = raw[0].split(';').map(header => header.replace(/"/g, ''));

// Convert the raw data from a string into an array of objects where property
// names match the column headers.
const data = raw.
  slice(1).
  map(line => line.split(';').
  reduce((cur, v, i) => {
    // Ensure that numberic values are between 0 and 1
    // Admittedly this is a bit hacky, and I'd love to hear how machine
    // learning experts handle this.
    if (headers[i].includes('sulfur') || headers[i].includes('sugar')) {
      cur[headers[i]] = parseFloat(v) / 1000;
    } else if (headers[i].includes('alcohol')) {
      cur[headers[i]] = parseFloat(v) / 100;
    } else {
      // Quality will be 0.1-1 rather than 1-10
      cur[headers[i]] = parseFloat(v) / 10;
    }
    return cur;
  }, {}));

const net = new NeuralNetwork();
const numTrainingData = 1000;

const trainingData = data.
  slice(0, numTrainingData).
  map(obj => ({
    input: _.omit(obj, ['quality']),
    output: _.pick(obj, ['quality'])
  }));

console.log(trainingData[0]);

console.log('done training', net.train(trainingData));
Once you have trained a neural network, you can use it to estimate the quality of subsequent wines based on their chemical properties. Below is code that takes the neural network, runs it on the next 50 wines, and calculates the average difference between the neural network's prediction and the actual quality of the wine.
let error = 0;
for (let i = 0; i < 50; ++i) {
  const { quality } = net.run(_.omit(data[numTrainingData + i], ['quality']));
  error += Math.abs(quality - data[numTrainingData + i].quality);
  console.log(i, quality, data[numTrainingData + i].quality);
}
console.log('Average error', error / 50);

console.log('done');
Below is the truncated output. This rudimentary neural network gets within about 0.6 of the actual quality rating on average.
45 0.602045476436615 0.5
46 0.5928407311439514 0.5
47 0.4441471993923187 0.5
48 0.449766606092453 0.5
49 0.7137854695320129 0.6
Average error 0.06042885661125182

Serializing the Neural Network

In practice you don't want to recompute the neural network every time, because even in this simple example training the neural network takes approximately 20 seconds. You can serialize the neural network using the toJSON() function:
// Serialize the neural network as JSON to a file
fs.writeFileSync('./net.json', JSON.stringify(net.toJSON(), null, '  '));
Open up the net.json file to see what the neural network looks like. Neural networks consist of "nodes" or "neurons" that assign a weight to each input. When you train a neural network, brain.js searches to try to come up with weights that match the training data as closely as possible. Here's a sample node from the net.json file that shows the weights for each parameter.
{
  "bias": -5.532558917999268,
  "weights": {
    "fixed acidity": 1.0129427909851074,
    "volatile acidity": -3.8902039527893066,
    "citric acid": -0.4018211364746094,
    "residual sugar": -0.5149407386779785,
    "chlorides": -3.0765116214752197,
    "free sulfur dioxide": 2.4955267906188965,
    "total sulfur dioxide": -0.5537568926811218,
    "density": -1.1998544931411743,
    "pH": 3.0909314155578613,
    "sulphates": 2.17152738571167,
    "alcohol": 9.936287879943848
  }
}
You can then load the neural network from the JSON file and re-use it.
const net = new NeuralNetwork();

net.fromJSON(JSON.parse(fs.readFileSync('./net.json', 'utf8')));

// ...

let error = 0;
for (let i = 0; i < 50; ++i) {
  const { quality } = net.run(_.omit(data[numTrainingData + i], ['quality']));
  error += Math.abs(quality - data[numTrainingData + i].quality);
  console.log(i, quality, data[numTrainingData + i].quality);
}
console.log('Average error', error / 50);

Moving On

There's an npm module for just about everything, even machine learning. Brain.js is one of the older libraries. There's also a newer one by Google that supposedly has better performance called deeplearn. If you're interested in the theory of machine learning, I highly recommend Artificial Intelligence: A Modern Approach by Stuart Russell and Peter Norvig. R&N is the standard textbook for undergraduate AI courses and serves as an excellent introduction.

留言

這個網誌中的熱門文章

2017通訊大賽「聯發科技物聯網開發競賽」決賽團隊29強出爐!作品都在11月24日頒獎典禮進行展示

2017通訊大賽「聯發科技物聯網開發競賽」決賽團隊29強出爐!作品都在11月24日頒獎典禮進行展示 LIS   發表於 2017年11月16日 10:31   收藏此文 2017通訊大賽「聯發科技物聯網開發競賽」決賽於11月4日在台北文創大樓舉行,共有29個隊伍進入決賽,角逐最後的大獎,並於11月24日進行頒獎,現場會有全部進入決賽團隊的展示攤位,總計約為100個,各種創意作品琳琅滿目,非常值得一看,這次錯過就要等一年。 「聯發科技物聯網開發競賽」決賽持續一整天,每個團隊都有15分鐘面對評審團做簡報與展示,並接受評審們的詢問。在所有團隊完成簡報與展示後,主辦單位便統計所有評審的分數,並由評審們進行審慎的討論,決定冠亞季軍及其他各獎項得主,結果將於11月24日的「2017通訊大賽頒獎典禮暨成果展」現場公佈並頒獎。 在「2017通訊大賽頒獎典禮暨成果展」現場,所有入圍決賽的團隊會設置攤位,總計約為100個,展示他們辛苦研發並實作的作品,無論是想觀摩別人的成品、了解物聯網應用有那些新的創意、尋找投資標的、尋找人才、尋求合作機會或是單純有興趣,都很適合花點時間到現場看看。 頒獎典禮暨成果展資訊如下: 日期:2017年11月24日(星期五) 地點:中油大樓國光廳(台北市信義區松仁路3號) 我要報名參加「2017通訊大賽頒獎典禮暨成果展」>>> 在參加「2017通訊大賽頒獎典禮暨成果展」之前,可以先在本文觀看各團隊的作品介紹。 決賽29強團隊如下: 長者安全救星 可隨意描繪或書寫之電子筆記系統 微觀天下 體適能訓練管理裝置 肌少症之行走速率檢測系統 Sugar Robot 賽亞人的飛機維修輔助器 iTemp你的溫度個人化管家 語音行動冰箱 MR模擬飛行 智慧防盜自行車 跨平台X-Y視覺馬達控制 Ironmet 菸消雲散 無人小艇 (Mini-USV) 救OK-緊急救援小幫手 穿戴式長照輔助系統 應用於教育之模組機器人教具 這味兒很台味 Aquarium Hub 發展遲緩兒童之擴增實境學習系統 蚊房四寶 車輛相控陣列聲納環境偵測系統 戶外團隊運動管理裝置 懷舊治療數位桌曆 SeeM智能眼罩 觸覺點字學習系統
2019全台精選3+個燈會,週邊順遊景點懶人包 2019燈會要去哪裡看?全台精選3+個燈會介紹、週邊順遊景點整理給你。 東港小鎮燈區-鮪鮪到來。 2019-02-15 微笑台灣編輯室 全台灣 各縣市政府 1435 延伸閱讀 ►  元宵節不只看燈會!全台元宵祭典精選、順遊景點整理 [屏東]2019台灣燈會在屏東 2/9-3/3:屏東市 · 東港鎮 · 大鵬灣國家風景區 台灣燈會自1990年起開始辦理,至2019年邁入第30週年,也是首次在屏東舉辦,屏東縣政府與交通部觀光局導入創新、科技元素,融入在地特色文化設計,在東港大鵬灣國家風景區打造廣闊的海洋灣域燈區,東港鎮結合漁港及宗教文化的小鎮燈區,及屏東市綿延近5公里長的綵燈節河岸燈區,讓屏東成為璀璨的光之南國,迎向國際。 詳細介紹 ►  2019台灣燈會在屏東 第一次移師國境之南 大鵬灣燈區 主題樂園式燈會也是主燈所在區,區內分為農業海洋燈區、客家燈區、原住民燈區、綠能環保燈區、藝術燈區、宗教燈區、競賽花燈及317個社區關懷據點手作的萬歲光廊等。 客家燈籠隧道。 平日:周一~周四14:00-22:30(熄燈) 假日:周五~周六10:00-22:30(熄燈)  屏東燈區: 萬年溪畔 屏東綵燈節藍區-生態。 綵燈節--每日17:30 - 22:00(熄燈) 勝利星村--平日:14:00 - 22:30(熄燈) 假日:10:00 - 22:30(熄燈) 燈區以「彩虹」為主題,沿著蜿蜒市區的萬年溪打造近5公里長的光之流域,50組水上、音樂及互動科技等不同類型燈飾,呈現紅色熱情、橙色活力、黃色甜美、綠色雄偉、藍色壯闊、靛色神祕、紫色華麗等屏東風情。勝利星村另有懷舊風的燈飾,及屏東公園聖誕節燈飾。 東港小鎮燈區 東港小鎮燈區-鮪鮪到來。 小鎮燈區以海的屏東為主題,用漁港風情及宗教文化內涵規劃4個主題區,分別為張燈結綵趣、東津好風情、神遊幸福海、延平老街區。每日17:00~22:30(熄燈) 以上台灣燈會資料來源: 2019台灣燈會官網 、 i屏東~愛屏東 。 >> 順遊行程 小吃旅行-東港小鎮 東港小吃和東港人一樣,熱情澎湃而且誠意滿滿,從市街找到巷裡,早餐吃到宵夜,可惜
自製直播音源線 Bird Liang   October 6, 2016   in  View Bird Liang, Chief Engineer (梁子凌 / 技術長兼工程輔導長) 負責 AppWorks 技術策略與佈署,同時主導工程輔導。人生的第一份正職工作是創業,之後在外商圈電子業中闖蕩多年,經歷過 NXP、Sony、Newport Imagining、Crossmatch 等企業,從事無線通訊、影像系統、手機、面板、半導體、生物辨識等不同領域產品開發。熱愛學習新事物,協助團隊解決技術問題。放棄了幾近退休般的生活加入 AppWorks,為的是幫助更多在創業路上的人,並重新體驗創業的熱情。台大農機系、台科大電子所畢業,熱愛賞鳥、演奏管風琴,亦是不折不扣的熱血 Maker。 隨著 Facebook 開放一般帳號直播,現在我們只要拿起手機,隨時隨地都可以開始直播。回想幾年前 AppWorks 剛開始進行 Demo Day 直播時,還要將 HDMI 訊號接進 PC 中、再編碼打進 YouTube 的複雜度,實不可同日而語。 但用手機或平板直播最大的問題往往不是影像而是聲音。iPhone 或 iPad 上的攝影機,感度和解析度早已不輸數年前的專業攝影機,只要現場光不太差,大概都可以拍出令人滿意的畫面。但直播的聲音一直是個大問題,手機上的麥克風跟人耳所聽到的聲音其實有很大的差距,在比較大的場子裡,光是仰賴內建麥克風的收音多半無法有令人滿意的效果。 在大型的活動中,現場通常會有 PA 系統,最理想的方式還是想辦法將 PA 的訊號餵進 iPad 或 iPhone 中,保證聲音乾淨又清楚,絕對不會有其它有的沒的現場音。 iPhone 的耳機孔雖然可以插帶有麥克風的耳機 (如 Apple 原廠的 EarPods),但它的訊號位準是電容式麥克風的位準。PA 控台的輸出幾乎都是 line level 的,兩者的訊號電壓相差百倍以上,我們得做個小東西來解決這個差距。 Line 與 Mic 在 mixer 上,我們常會看到輸入可以在兩種規格中切換: line level 和 mic level。Mic level 顧名思義就是從麥克風來的訊號,這個訊號的規格是從不需供電的傳統動圈麥克風來的。因為不需供電,所有的訊號都來自於聲壓