从写filter步骤和注意事项

2012-12-21

自写filter步骤和注意事项hbase本身提供了很多filter来实现服务器端过滤的功能，诸如Filter, FilterBase, C

自写filter步骤和注意事项

hbase本身提供了很多filter来实现服务器端过滤的功能，诸如

Filter, FilterBase, CompareFilter;

一些Comparators类;

Comparison Filters;

Dedicated Filters;

Decorating Filters;

FilterList;

Custom Filters。

但是这些filter往往不能满足我们的实际需要，因此我们可以针对自己需要来自写filter，然后加到hbase集群中。

主要步骤：

1.新建类，继承自FilterBase或WritableByteArrayComparable，需求不同，需要创建的类不同，不详述

2.将相关类打成jar包

3.将jar包copy到hbase集群中各机器的 $HBASE_HOME/lib 目录下，并重启集群

4.在客户端开发时，添加此jar包到路经

OK，可以使用了。

注意事项：

1.新建类中，需要添加常量 LOG. 否则出现

can't seal package org.apache.hadoop.hbase.filter: already loaded

异常提示

2.必须将jar包copy到hbase集群中各机器的 $HBASE_HOME/lib 目录下，并重启hbase集群

参考代码

如下（网友写的，帮忙解决了点问题）：

/** * Copyright 2008 The Apache Software Foundation * * Licensed to the Apache Software Foundation (ASF) under one * or more contributor license agreements.  See the NOTICE file * distributed with this work for additional information * regarding copyright ownership.  The ASF licenses this file * to you under the Apache License, Version 2.0 (the * "License"); you may not use this file except in compliance * with the License.  You may obtain a copy of the License at * *     http://www.apache.org/licenses/LICENSE-2.0 * * Unless required by applicable law or agreed to in writing, software * distributed under the License is distributed on an "AS IS" BASIS, * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. * See the License for the specific language governing permissions and * limitations under the License. */package org.apache.hadoop.hbase.filter;import java.io.DataInput;import java.io.DataOutput;import java.io.IOException;import org.apache.commons.logging.Log;import org.apache.commons.logging.LogFactory;import org.apache.hadoop.hbase.KeyValue;import org.apache.hadoop.hbase.filter.FilterBase;/** * A filter, based on the PageFilter, takes two arguments: limit and offset. * This filter can be used for specified row count, in order to efficient * lookups and paginated results for end users. *  */public class RowPaginationFilter extends FilterBase {static final Log LOG = LogFactory.getLog(RowPaginationFilter.class);private int rowsAccepted = 0;private int limit = 0;private int offset = 0;/** * Default constructor, filters nothing. Required though for RPC * deserialization. */public RowPaginationFilter() {}/** * Constructor that takes a maximum page size. *  * get row from offset to offset+limit ( offset<= row<=offset+limit ) * @param offset start position * @param limit count from offset position */public RowPaginationFilter(final int offset, final int limit) {this.offset = offset;this.limit = limit;}@Overridepublic void reset() {// noop}@Overridepublic boolean filterAllRemaining() {return this.rowsAccepted > this.limit+this.offset;}@Overridepublic boolean filterRowKey(byte[] rowKey, int offset, int length) {return false;}@Overridepublic void readFields(final DataInput in) throws IOException {this.offset = in.readInt();this.limit = in.readInt();}@Overridepublic void write(final DataOutput out) throws IOException {out.writeInt(offset);out.writeInt(limit);}@Overridepublic ReturnCode filterKeyValue(KeyValue v) {return ReturnCode.INCLUDE;}//true to exclude row, false to include row.@Overridepublic boolean filterRow() {boolean isExclude = this.rowsAccepted < this.offset || this.rowsAccepted>=this.limit+this.offset;rowsAccepted++;return isExclude;}}

热点排行

云计算

从写filter步骤和注意事项