使用流收集数据

原创

一个爱听音乐的程序员 2020-10-30 17:39:00 ©著作权

©著作权归作者所有：来自51CTO博客作者一个爱听音乐的程序员的原创作品，请联系作者获取转载授权，否则将追究法律责任

在上一节中，我们了解到终端操作collect方法用于收集流中的元素，并放到不同类型的结果中，比如List、Set或者Map。其实collect方法可以接受各种Collectors接口的静态方法作为参数来实现更为强大的规约操作，比如查找最大值最小值，汇总，分区和分组等等。

准备工作

为了演示Collectors接口中的静态方法的使用，这里创建一个Dish类（菜谱类）：

 /**
 * @author BNTang
 */
public class Dish {

    public enum Type {MEAT, FISH, OTHER}

    /**
     * 食物名称
     */
    private final String name;

    /**
     * 是否是素食
     */
    private final boolean vegetarian;
    
    /**
     * 卡路里
     */
    private final int calories;

    /**
     * 类型：肉，海鲜，其他
     */
    private final Type type;

    public Dish(String name, boolean vegetarian, int calories, Type type) {
        this.name = name;
        this.vegetarian = vegetarian;
        this.calories = calories;
        this.type = type;
    }

    @Override
    public String toString() {
        return this.getName();
    }
    
    // get方法略
}

然后创建一个List集合，包含各种食材：

使用流收集数据_Java 8

List<Dish> list = Arrays.asList(
        new Dish("pork", false, 800, Dish.Type.MEAT),
        new Dish("beef", false, 700, Dish.Type.MEAT),
        new Dish("chicken", false, 400, Dish.Type.MEAT),
        new Dish("french fries", true, 530, Dish.Type.OTHER),
        new Dish("rice", true, 350, Dish.Type.OTHER),
        new Dish("season fruit", true, 120, Dish.Type.OTHER),
        new Dish("pizza", true, 550, Dish.Type.OTHER),
        new Dish("prawns", false, 300, Dish.Type.FISH),
        new Dish("salmon", false, 450, Dish.Type.FISH));

在测试类中导入所有Collectors接口的静态方法：

使用流收集数据_fish_02

import static java.util.stream.Collectors.*;

规约与汇总

最大最小值

Collectors.maxBy和Collectors.minBy用来计算流中的最大或最小值，比如按卡路里的大小来筛选出卡路里最高的食材：

/**
 * @author BNTang
 */
public class Demo {
    public static void main(String[] args) {
        List<Dish> list = Arrays.asList(
                new Dish("pork", false, 800, Dish.Type.MEAT),
                new Dish("beef", false, 700, Dish.Type.MEAT),
                new Dish("chicken", false, 400, Dish.Type.MEAT),
                new Dish("french fries", true, 530, Dish.Type.OTHER),
                new Dish("rice", true, 350, Dish.Type.OTHER),
                new Dish("season fruit", true, 120, Dish.Type.OTHER),
                new Dish("pizza", true, 550, Dish.Type.OTHER),
                new Dish("prawns", false, 300, Dish.Type.FISH),
                new Dish("salmon", false, 450, Dish.Type.FISH));

        list.stream()
                .collect(maxBy(Comparator.comparingInt(Dish::getCalories)))
                .ifPresent(System.out::println);
    }
}

结果输出为pork。

汇总

Collectors.summingInt可以用于求和，参数类型为int类型。相应的基本类型对应的方法还有Collectors.summingLong和Collectors.summingDouble。比如求所有食材的卡路里：

/**
 * @author BNTang
 */
public class Demo {
    public static void main(String[] args) {
        List<Dish> list = Arrays.asList(
                new Dish("pork", false, 800, Dish.Type.MEAT),
                new Dish("beef", false, 700, Dish.Type.MEAT),
                new Dish("chicken", false, 400, Dish.Type.MEAT),
                new Dish("french fries", true, 530, Dish.Type.OTHER),
                new Dish("rice", true, 350, Dish.Type.OTHER),
                new Dish("season fruit", true, 120, Dish.Type.OTHER),
                new Dish("pizza", true, 550, Dish.Type.OTHER),
                new Dish("prawns", false, 300, Dish.Type.FISH),
                new Dish("salmon", false, 450, Dish.Type.FISH));

        Integer count = list.stream().collect(summingInt(Dish::getCalories));
        System.out.println(count);
    }
}

Collectors.averagingInt方法用于求平均值，参数类型为int类型。相应的基本类型对应的方法还有Collectors.averagingLong和Collectors.averagingDouble。比如求所有食材的平均卡路里：

/**
 * @author BNTang
 */
public class Demo {
    public static void main(String[] args) {
        List<Dish> list = Arrays.asList(
                new Dish("pork", false, 800, Dish.Type.MEAT),
                new Dish("beef", false, 700, Dish.Type.MEAT),
                new Dish("chicken", false, 400, Dish.Type.MEAT),
                new Dish("french fries", true, 530, Dish.Type.OTHER),
                new Dish("rice", true, 350, Dish.Type.OTHER),
                new Dish("season fruit", true, 120, Dish.Type.OTHER),
                new Dish("pizza", true, 550, Dish.Type.OTHER),
                new Dish("prawns", false, 300, Dish.Type.FISH),
                new Dish("salmon", false, 450, Dish.Type.FISH));

        Double num = list.stream().collect(averagingInt(Dish::getCalories));
        System.out.println(num);
    }
}

Collectors.summarizingInt方法可以一次性返回元素的个数，最大值，最小值，平均值和总和：

/**
 * @author BNTang
 */
public class Demo {
    public static void main(String[] args) {
        List<Dish> list = Arrays.asList(
                new Dish("pork", false, 800, Dish.Type.MEAT),
                new Dish("beef", false, 700, Dish.Type.MEAT),
                new Dish("chicken", false, 400, Dish.Type.MEAT),
                new Dish("french fries", true, 530, Dish.Type.OTHER),
                new Dish("rice", true, 350, Dish.Type.OTHER),
                new Dish("season fruit", true, 120, Dish.Type.OTHER),
                new Dish("pizza", true, 550, Dish.Type.OTHER),
                new Dish("prawns", false, 300, Dish.Type.FISH),
                new Dish("salmon", false, 450, Dish.Type.FISH));

        IntSummaryStatistics iss = list.stream().collect(summarizingInt(Dish::getCalories));
        System.out.println(iss);
    }
}

同样，相应的summarizingLong和summarizingDouble方法有相关的LongSummaryStatistics和DoubleSummaryStatistics类型，适用于收集的属性是原始类型long或double的情况。

拼接

Collectors.joining方法会把流中每一个对象应用toString方法得到的所有字符串连接成一个字符串。如：

/**
 * @author BNTang
 */
public class Demo {
    public static void main(String[] args) {
        List<Dish> list = Arrays.asList(
                new Dish("pork", false, 800, Dish.Type.MEAT),
                new Dish("beef", false, 700, Dish.Type.MEAT),
                new Dish("chicken", false, 400, Dish.Type.MEAT),
                new Dish("french fries", true, 530, Dish.Type.OTHER),
                new Dish("rice", true, 350, Dish.Type.OTHER),
                new Dish("season fruit", true, 120, Dish.Type.OTHER),
                new Dish("pizza", true, 550, Dish.Type.OTHER),
                new Dish("prawns", false, 300, Dish.Type.FISH),
                new Dish("salmon", false, 450, Dish.Type.FISH));

        String str = list.stream().map(Dish::getName).collect(joining());
        System.out.println(str);
    }
}

内部拼接其实采用了StringBuilder。除此之外，也可以指定拼接符：

/**
 * @author BNTang
 */
public class Demo {
    public static void main(String[] args) {
        List<Dish> list = Arrays.asList(
                new Dish("pork", false, 800, Dish.Type.MEAT),
                new Dish("beef", false, 700, Dish.Type.MEAT),
                new Dish("chicken", false, 400, Dish.Type.MEAT),
                new Dish("french fries", true, 530, Dish.Type.OTHER),
                new Dish("rice", true, 350, Dish.Type.OTHER),
                new Dish("season fruit", true, 120, Dish.Type.OTHER),
                new Dish("pizza", true, 550, Dish.Type.OTHER),
                new Dish("prawns", false, 300, Dish.Type.FISH),
                new Dish("salmon", false, 450, Dish.Type.FISH));

        String str = list.stream().map(Dish::getName).collect(joining(", "));
        System.out.println(str);
    }
}

reducing

Collectors.reducing方法可以实现求和，最大值最小值的筛选，拼接等操作。上面介绍的方法在编程上更方便快捷，但reducing的可读性更高，实际使用哪种我觉得还是看个人喜好。举个使用reducing求最大值的例子：

/**
 * @author BNTang
 */
public class Demo {
    public static void main(String[] args) {
        List<Dish> list = Arrays.asList(
                new Dish("pork", false, 800, Dish.Type.MEAT),
                new Dish("beef", false, 700, Dish.Type.MEAT),
                new Dish("chicken", false, 400, Dish.Type.MEAT),
                new Dish("french fries", true, 530, Dish.Type.OTHER),
                new Dish("rice", true, 350, Dish.Type.OTHER),
                new Dish("season fruit", true, 120, Dish.Type.OTHER),
                new Dish("pizza", true, 550, Dish.Type.OTHER),
                new Dish("prawns", false, 300, Dish.Type.FISH),
                new Dish("salmon", false, 450, Dish.Type.FISH));

        Integer num = list.stream().collect(reducing(0, Dish::getCalories, Integer::max));
        System.out.println(num);
    }
}

或者：

Integer num = list.stream().map(Dish::getCalories).collect(reducing(0, Integer::max));
System.out.println(num);

分组

分组功能类似于SQL里的group by，可以对流中的元素按照指定分组规则进行分组。

普通分组

Collectors.groupingBy方法可以轻松的完成分组操作。比如现在对List中的食材按照类型进行分组，我这里就不在反复的粘贴一些重复的代码了，我接下来只给出改动了的代码：

Map<Dish.Type, List<Dish>> dishesByType = list.stream().collect(groupingBy(Dish::getType));
System.out.println(dishesByType);

我们也可以自定义分组规则，比如按照卡路里的高低，可以分为高热量，和正常和低热量：

首先定义一个卡路里高低的枚举类型

使用流收集数据_静态方法_03

public enum CaloricLevel {DIET, NORMAL, FAT}

然后编写分组规则：

Map<CaloricLevel, List<Dish>> dishesByCalories = list.stream().collect(
        groupingBy(d -> {
            if (d.getCalories() <= 400) {
                return CaloricLevel.DIET;
            } else if (d.getCalories() <= 700) {
                return CaloricLevel.NORMAL;
            } else {
                return CaloricLevel.FAT;
            }
        })
);
System.out.println(dishesByCalories);

多级分组

Collectors.groupingBy支持嵌套来实现多级分组，比如将食材按照类型分类，然后再按照卡路里的高低分类：

Map<Dish.Type, Map<CaloricLevel, List<Dish>>> dishesGroup = list.stream().collect(
        groupingBy(Dish::getType, groupingBy(d -> {
                    if (d.getCalories() <= 400) {
                        return CaloricLevel.DIET;
                    } else if (d.getCalories() <= 700) {
                        return CaloricLevel.NORMAL;
                    } else {
                        return CaloricLevel.FAT;
                    }
                })
        ));
System.out.println(dishesGroup);

返回结果是一个二级Map，实际上，第二个参数除了Collectors.groupingBy外，也可以传递其他规约操作，规约的结果类型对应Map里的第二个泛型。举些例子，将食材按照类型分，然后统计各个类型对应的数量：

Map<Dish.Type, Long> dishesCountByType = list.stream().collect(groupingBy(Dish::getType, counting()));
System.out.println(dishesCountByType);

因为Collectors.counting方法返回Long类型，所以Map第二个泛型也必须指定为Long。输出结果：{OTHER=4, FISH=2, MEAT=3}。

或者对食材按照类型分，然后选出卡路里最高的食物：

Map<Dish.Type, Optional<Dish>> map = list.stream().collect(groupingBy(
        Dish::getType, maxBy(Comparator.comparing(Dish::getCalories))
));
System.out.println(map);

输出结果：{FISH=Optional[salmon], OTHER=Optional[pizza], MEAT=Optional[pork]}。如果不希望输出结果包含Optional，可以使用Collectors.collectingAndThen方法：

Map<Dish.Type, Dish> map = list.stream().collect(groupingBy(
        Dish::getType, collectingAndThen(maxBy(Comparator.comparing(Dish::getCalories)), Optional::get)
));
System.out.println(map);

输出结果：{OTHER=pizza, FISH=salmon, MEAT=pork}。

常与Collectors.groupingBy组合使用的方法还有Collectors.mapping。Collectors.mapping方法接受两个参数：一个函数对流中的元素做变换，另一个则将变换的结果对象收集起来，比如对食材按照类型分类，然后输出各种类型食材下卡路里等级情况：

Map<Dish.Type, HashSet<CaloricLevel>> map = list.stream().collect(groupingBy(
        Dish::getType, mapping(
                d -> {
                    if (d.getCalories() <= 400) {
                        return CaloricLevel.DIET;
                    } else if (d.getCalories() <= 700) {
                        return CaloricLevel.NORMAL;
                    } else {
                        return CaloricLevel.FAT;
                    }
                }, toCollection(HashSet::new)
        )
));
System.out.println(map);

输出结果：{MEAT=[NORMAL, DIET, FAT], OTHER=[NORMAL, DIET], FISH=[NORMAL, DIET]}。Collectors.toCollection方法可以方便的构造各种类型的集合。

分区

分区类似于分组，只不过分区最多两种结果。Collectors.partitioningBy方法用于分区操作，接收一个Predicate<T>类型的Lambda表达式作为参数。比如将食材按照素食与否分类：

Map<Boolean, List<Dish>> map = list.stream().collect(partitioningBy(Dish::isVegetarian));
System.out.println(map);

输出结果：{false=[pork, beef, chicken, prawns, salmon], true=[french fries, rice, season fruit, pizza]}。

Collectors.partitioningBy方法还支持传入分组函数或者其他规约操作，比如将食材按照素食与否分类，然后按照食材类型进行分类：

Map<Boolean, Map<Dish.Type, List<Dish>>> map = list.stream().collect(
        partitioningBy(Dish::isVegetarian, groupingBy(Dish::getType)));
System.out.println(map);

输出结果：{false={MEAT=[pork, beef, chicken], FISH=[prawns, salmon]}, true={OTHER=[french fries, rice, season fruit, pizza]}}。

如再将食材按照素食与否分类，然后筛选出各自类型中卡路里含量最低的食材：

Map<Boolean, Dish> map = list.stream().collect(
        partitioningBy(Dish::isVegetarian, collectingAndThen(
                minBy(Comparator.comparing(Dish::getCalories)), Optional::get
        )));
System.out.println(map);