============================================= Java読書会BOF 「Hadoop 第2版」を読む会 第7回 ============================================= .. csv-table:: 開催概要 "日時", "2012年4月14日(土) 10:00 - 17:00" "場所", "川崎市産業振興会館 第2会議室" "出席者(敬称略)", "高橋(徹)、村山、岩室、今井、門脇、吉本、小棚木(書記)" "会場費", "300円/ひとり" 議事 ==== 11章 Pig ------------------------- 11.7 実践Pig ------------ 11.7.1 並列性 ``````````````````````` 11.7.2 パラメータ置換 ``````````````````````` 11.7.2.1 動的パラメータ ``````````````````````` * バッククォートで置換がWIndowsではどうなるのか気になる。 11.7.2.2 パラメータ置換の処理 ````````````````````````````` 12章 Hive ---------- 12.1 Hiveのインストール ----------------------- 12.1.1 Hiveシェル `````````````````` 12.2 例 --------------- * where句に in が使えるといいな。 例: WHERE temperature != 9999 AND (quality = 0 OR quality = 1 OR quality = 4 OR quality = 5 OR quality =9) -> WHERE temperature != 9999 AND quality IN (0, 1, 4, 5, 9) 12.3 Hiveの実行 ---------------- 12.3.1 Hiveの設定 `````````````````` 12.3.1.1 ロギング `````````````````` 12.3.2 Hiveのサービス ````````````````````` * 【誤記】 囲み記事のANT_LIBのパス名: /opt/and/lib -> /opt/ant/lib 12.3.2.1 Hiveのクライアント ``````````````````````````` 12.3.3 メタストア ````````````````` 12.4 伝統的なデータベースとの比較 --------------------------------- 12.4.1 読み込み時のスキーマ適用と書き込み時のスキーマ適用の比較 ``````````````````````````````````````````````````````````````` 12.4.2 更新、トランザクション、インデックス ``````````````````````````````````````````` 12.5 HiveQL ------------- * 表12-2のデータ型の「一時」とは何? -> 「日時」の間違いではないか。 * 【誤記】表示12-2 SQLのデータ型: 一時 -> 日時 12.5.1 データ型 ```````````````` 12.5.1.1 プリミティブ型 ``````````````````````` 12.5.1.2 変換 ``````````````` 12.5.1.3 複合型 ```````````````` 12.5.2 演算子と関数 ``````````````````` 12.6 テーブル -------------- 12.6.1 管理対象テーブルと外部テーブル ````````````````````````````````````` 12.6.2 パーティションとバケット ``````````````````````````````` 12.6.2.1 パーティション ```````````````````````` 12.6.2.2 バケット ``````````````````` 12.6.3 ストレージフォーマット ````````````````````````````` 12.6.3.1 でフォルトのストレージフォーマット:デリミタで区切られたテキスト ````````````````````````````````````````````````````````````````````````` 12.6.3.2 バイナリストレージフォーマット:SequenceFileとRCFile `````````````````````````````````````````````````````````````` 12.6.3.3 例:RegexSerDe ````````````````````````` 12.6.4 データのインポート ````````````````````````` 12.6.4.1 INSERT OVERWRITE TABLE `````````````````````````````````` 12.6.4.2 複数テーブルへのインサート ``````````````````````````````````` 12.6.4.3 CREATE TABLE ... AS SELECT `````````````````````````````````````` 12.6.5 テーブルの修正 ````````````````````` 12.6.6 テーブルのドロップ ````````````````````````` 12.7. データのクエリ --------------------- 12.7.1 ソートと集計 ```````````````````` 12.7.2 MapReduceのスクリプト ````````````````````````````` 12.7.3 結合 ``````````` 12.7.3.1 内部結合 `````````````````` 12.7.3.2 外部結合 ````````````````` 12.7.3.3 セミ結合 ````````````````` 12.7.3.4 map結合 ```````````````` 12.7.4 サブクエリ `````````````````` 12.7.5 ビュー ````````````` 12.8 ユーザー定義関数 --------------------- * 【誤記】 12.8.1の直前の3行目と2行目: LITERAL VIEW -> LATERAL VIEW 12.8.1 UDFの書き方 ``````````````````` 12.8.2 UDAFの書き方 ``````````````````` 12.8.2.1 より複雑なUDAF ``````````````````````` 13章 HBase ---------- 13.1 HBaseの基礎 ----------------- 13.1.1 背景 ``````````` 13.2 概念 --------- 13.2.1 データモデルに関する弾丸ツアー ````````````````````````````````````` 13.2.1.1 領域 ````````````` * 領域は原文では regions。 13.2.1.2 ロック ``````````````` 13.2.2 実装 ``````````` 13.2.2.1 HBaseの操作 ````````````````````` 13.3 インストール ----------------- 13.3.1 試運転 ````````````` 13.4 クライアント ----------------- 13.4.1 Java ```````````` 13.4.1.1 MapReduce `````````````````` 13.4.2 Avro, REST, Thrift ````````````````````````` 13.4.2.1 REST ````````````` 13.4.2.2 Thrift ``````````````` 13.4.2.3 Avro ````````````` 13.5 例 ------- 13.5.1 スキーマ ``````````````` 13.5.2 データのロード ````````````````````` .. note:: 次回は、p.456 13.5.2.1 最適化のための注意事項の数行前(直前)より