読書会(Hadoop 第2版)第7回議事録

[ 戻る ]


=============================================
Java読書会BOF 「Hadoop 第2版」を読む会 第7回
=============================================

.. csv-table:: 開催概要

   "日時", "2012年4月14日(土) 10:00 - 17:00"
   "場所", "川崎市産業振興会館 第2会議室"
   "出席者(敬称略)", "高橋(徹)、村山、岩室、今井、門脇、吉本、小棚木(書記)"
   "会場費", "300円/ひとり"

議事
====

11章 Pig
-------------------------

11.7 実践Pig
------------

11.7.1 並列性
```````````````````````

11.7.2 パラメータ置換
```````````````````````

11.7.2.1 動的パラメータ
```````````````````````
* バッククォートで置換がWIndowsではどうなるのか気になる。

11.7.2.2 パラメータ置換の処理
`````````````````````````````

12章 Hive
----------

12.1 Hiveのインストール
-----------------------

12.1.1 Hiveシェル
``````````````````

12.2 例
---------------
* where句に in が使えるといいな。

例:
  WHERE temperature != 9999 AND (quality = 0 OR quality = 1 OR quality = 4 OR quality = 5 OR quality =9)
->
  WHERE temperature != 9999 AND quality IN (0, 1, 4, 5, 9)

12.3 Hiveの実行
----------------

12.3.1 Hiveの設定
``````````````````

12.3.1.1 ロギング
``````````````````

12.3.2 Hiveのサービス
`````````````````````

* 【誤記】 囲み記事のANT_LIBのパス名: /opt/and/lib -> /opt/ant/lib

12.3.2.1 Hiveのクライアント
```````````````````````````

12.3.3 メタストア
`````````````````

12.4 伝統的なデータベースとの比較
---------------------------------

12.4.1 読み込み時のスキーマ適用と書き込み時のスキーマ適用の比較
```````````````````````````````````````````````````````````````

12.4.2 更新、トランザクション、インデックス
```````````````````````````````````````````

12.5 HiveQL
-------------

* 表12-2のデータ型の「一時」とは何?
    -> 「日時」の間違いではないか。

* 【誤記】表示12-2 SQLのデータ型: 一時 -> 日時

12.5.1 データ型
````````````````

12.5.1.1 プリミティブ型
```````````````````````

12.5.1.2 変換
```````````````

12.5.1.3 複合型
````````````````

12.5.2 演算子と関数
```````````````````

12.6 テーブル
--------------

12.6.1 管理対象テーブルと外部テーブル
`````````````````````````````````````

12.6.2 パーティションとバケット
```````````````````````````````

12.6.2.1 パーティション
````````````````````````

12.6.2.2 バケット
```````````````````

12.6.3 ストレージフォーマット
`````````````````````````````

12.6.3.1 でフォルトのストレージフォーマット:デリミタで区切られたテキスト
`````````````````````````````````````````````````````````````````````````

12.6.3.2 バイナリストレージフォーマット:SequenceFileとRCFile
``````````````````````````````````````````````````````````````

12.6.3.3 例:RegexSerDe
`````````````````````````

12.6.4 データのインポート
`````````````````````````

12.6.4.1 INSERT OVERWRITE TABLE
``````````````````````````````````

12.6.4.2 複数テーブルへのインサート
```````````````````````````````````

12.6.4.3 CREATE TABLE ... AS SELECT
``````````````````````````````````````

12.6.5 テーブルの修正
`````````````````````

12.6.6 テーブルのドロップ
`````````````````````````

12.7. データのクエリ
---------------------

12.7.1 ソートと集計
````````````````````

12.7.2 MapReduceのスクリプト
`````````````````````````````

12.7.3 結合
```````````

12.7.3.1 内部結合
``````````````````

12.7.3.2 外部結合
`````````````````

12.7.3.3 セミ結合
`````````````````

12.7.3.4 map結合
````````````````

12.7.4 サブクエリ
``````````````````

12.7.5 ビュー
`````````````

12.8 ユーザー定義関数
---------------------

* 【誤記】 12.8.1の直前の3行目と2行目: LITERAL VIEW -> LATERAL VIEW

12.8.1 UDFの書き方
```````````````````

12.8.2 UDAFの書き方
```````````````````

12.8.2.1 より複雑なUDAF
```````````````````````

13章 HBase
----------

13.1 HBaseの基礎
-----------------

13.1.1 背景
```````````

13.2 概念
---------

13.2.1 データモデルに関する弾丸ツアー
`````````````````````````````````````

13.2.1.1 領域
`````````````
* 領域は原文では regions。

13.2.1.2 ロック
```````````````

13.2.2 実装
```````````

13.2.2.1 HBaseの操作
`````````````````````

13.3 インストール
-----------------

13.3.1 試運転
`````````````

13.4 クライアント
-----------------

13.4.1 Java
````````````

13.4.1.1 MapReduce
``````````````````

13.4.2 Avro, REST, Thrift
`````````````````````````

13.4.2.1 REST
`````````````

13.4.2.2 Thrift
```````````````

13.4.2.3 Avro
`````````````

13.5 例
-------

13.5.1 スキーマ
```````````````

13.5.2 データのロード
`````````````````````

.. note:: 次回は、p.456 13.5.2.1 最適化のための注意事項の数行前(直前)より


[ 戻る ]