第2关:Select 操作


任务描述

本关任务:按照编程要求执行相应的select操作。

相关知识

为了完成本关任务,你需要掌握:1. select语法格式;2. 常用的select语法。

select 语法格式

Hive select操作的语法与SQL-92规范几乎没有区别,其格式语法为:

SELECT [ALL | DISTINCT] select_expr,select_expr,… FROM table_reference
[WHERE where_condition] [GROUP BY col_list] [CLUSTER BY col_list | [DISTRIBUTE BY col_list] [SORT BY col_list]] [LIMIT number]
select 与各种属性的组合
  • 简单的select查询操作,如下面的查询操作返回students表中所有的行和列:
    hive> select * from students;
  • WHERE子句的select条件查询操作,返回满足WHERE指定条件的行。如下面的查询操作返回用户信息表users中的年龄大于10岁且国籍为中国的所有用户:
hive>  select * from users where age > 10 and state = "China";
  • ALLDISTINCT关键字的查询操作作用于确定是否返回重复的行,默认为ALL,即select查询返回重复的行:
hive>  select coll,coll2 from t1;
      1 3
      1 3
      1 4
      2 5
hive>  select distinct coll,coll2 from t1;
      1 3
      1 4
      2 5
hive>  select distinct coll from t1;
      1
      2
  • HAVING关键字的查询操作用于代替复杂的子查询操作 如查询操作:
    hive> select coll from (select coll,sum(col2) as col2sum from t1 group by coll) t2 where t2.col2sum > 10;可以替换为:
hive>  select coll from t1 group by col1 having sum(col2) >10;
  • LIMIT关键字的查询操作用于返回指定数目的满足条件的行(常用于返回Top k 问题)。返回满足条件的5条记录,返回结果为从满足条件的记录中随机选取5条。
    select * from t1 limit 5;
  • Top k问题,返回满足条件的列按col1降序排列的前5条记录:
    select * from t1 sort by col1 desc limit 5;
编程要求

test2数据库中student表结构为:

INFO

TYPE

COMMENT

Sno

INT

student sno

name

STRING

student name

age

INT

student age

sex

STRING

student sex

score

STRUCT <Chinese:FLOAT,Math:FLOAT,English:FLOAT>

student score

表中的数据为:

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-UVxgJAba-1666387170509)(https://data.educoder.net/api/attachments/298456)]

  • 切换到test2数据库;
  • 查询student表中所有的行和列;
  • 查询年龄age > 17的女生female
  • 查询语文成绩Chinese > 90的记录;
  • student表中查询前3条记录;
  • 返回按年龄降序的前2条记录。
按照以上要求填写命令。每个要求对应一条命令,共6条命令,以;隔开。
由于hive启动时间较长,测评时请耐心等待,大概需要时间:1-2分钟。
测试说明

平台会对你编写的命令进行测试:

若操作成功,会显示如下信息:

1    Xiaohong    18    female    {"chinese":96.0,"math":88.0,"english":90.5}
2    Xiaoliang    17    male    {"chinese":95.0,"math":88.0,"english":93.5}
3    Xiaoming    19    male    {"chinese":86.5,"math":98.0,"english":91.0}
4    Xiaoguang    18    male    {"chinese":88.0,"math":80.0,"english":94.0}
5    Xiaohua    16    female    {"chinese":97.0,"math":58.5,"english":88.0}
1    Xiaohong    18    female    {"chinese":96.0,"math":88.0,"english":90.5}
1    Xiaohong    18    female    {"chinese":96.0,"math":88.0,"english":90.5}
2    Xiaoliang    17    male    {"chinese":95.0,"math":88.0,"english":93.5}
5    Xiaohua    16    female    {"chinese":97.0,"math":58.5,"english":88.0}
1    Xiaohong    18    female    {"chinese":96.0,"math":88.0,"english":90.5}
2    Xiaoliang    17    male    {"chinese":95.0,"math":88.0,"english":93.5}
3    Xiaoming    19    male    {"chinese":86.5,"math":98.0,"english":91.0}
3    Xiaoming    19    male    {"chinese":86.5,"math":98.0,"english":91.0}
4    Xiaoguang    18    male    {"chinese":88.0,"math":80.0,"english":94.0}

说明:

  • 1-5行返回的是查询student表中所有的行和列的结果
  • 6行返回的是查询年龄age > 17的女生female的结果
  • 7-9行返回的是查询语文成绩Chinese > 90的记录的结果
  • 10-12行返回的是从student表中查询前3条记录的结果
  • 12-13行返回的是返回按年龄降序的前2条记录的结果

开始你的任务吧,祝你成功!


--Begin
USE test2;
select * from student;
select * from student where age > 17 and sex = "female";
select * from student where score.Chinese > 90;
select * from student limit 3;
select * from student sort by age desc limit 2;

--End