javacv FFmpegFrameRecorder 设置分辨率 ffmpeg修改分辨率

转载

mob64ca1418aeab 2024-06-11 20:15:33

文章标签 file url buffer input struct 文章分类 Java 后端开发

先吐一下槽，Android下的多媒体框架真不好搞，感觉还是微软的DirectShow框架更容易扩展。如果有人知道其它更好的完成下面任务的方法，请告知我。

任务是在现有的音视频格式上做一层包装/加密，打开这种新格式文件时文件流要先经过一层自定义的解包处理，再把解包后的文件流交由解码器处理。

任务看似简单，在DirectShow框架中只需要做个源过滤器，随后即可调用现有的解码器等设施成功解码。但Android上就挣扎了。现有的App框架完全满足不了需求（或许是我没找到），使用opencore又似乎太庞大而且工作量不小，最终选择了FFMPEG。

通过FFMPEG完成上面的任务主要途径有两种：一是修改打开文件读文件流识别解码器部分和解码器读文件流部分；二是重新写一个解码器（这似乎和在opencore中实现一个解码器相当）。

分析一下打开文件读文件流识别解码器的部分吧，入口函数是av_open_input_file，其最重要的作用就是填充好AVFormatContext结构。

AVFormatContext结构是解析文件格式、找到对应的解码器的重要的结构之一。其定义略长，在此简述。

struct AVInputFormat *iformat;

struct AVOutputFormat *oformat;

ByteIOContext *pb;

iformat和oformat两者只有一个会生效，如果是播放iformat生效，录制则oformat生效。当然还有转码的过程中，输入iformat生效，输出oformat生效。

pb是对文件流操作的一个抽象，接下来讲解。

FFMPEG打开文件后会读一段文件流，根据文件流中的内容来找到一个对应AVInputFormat，即一种文件格式的抽象。比如mp4，它的AVInputFormat定义在/libffmpeg/libavformat/mov.c中：

AVInputFormat mov_demuxer = {
    "mov,mp4,m4a,3gp,3g2,mj2",
    NULL_IF_CONFIG_SMALL("QuickTime/MPEG-4/Motion JPEG 2000 format"),
    sizeof(MOVContext),
    mov_probe,
    mov_read_header,
    mov_read_packet,
    mov_read_close,
    mov_read_seek,
};

AVInputFormat定义在/libffmpeg/libavformat/avformat.h中，其定义略长，在此简述。

const char *name; // 格式别名

const char *long_name; // 格式的全名，方便人阅读

int priv_data_size; // 额外数据长度

int (*read_probe)(AVProbeData *); // 读取适配数据的函数指针

... // 其它函数指针

FFMPEG努力用C来实现面向对象，其中结构体中含函数指针的写法非常常见。所有格式的AVInputFormat（还有AVOutput协议）都会在/libffmpeg/libavformat/Allformats.c的av_register_all函数中注册，形成一个以

/** head of registered input format linked list */
AVInputFormat *first_iformat = NULL;

为链表头的链表。ff_probe_input_buffer会通过ByteIOContext读取一段适配数据然后调用av_probe_input_format2找到正确的AVInputFormat（即通过遍历first_iformat）。

那么实现上面的任务通过修改ff_probe_input_buffer，在其中先解包，再让其调用av_probe_input_format2即可以达到部分欺骗FFMPEG的目的。但事实并不那么容易，找到正确的AVInputFormat后FFMPEG还会在av_open_input_stream中调用它的read_header来读取文件头部信息。这下麻烦大了，如果包装的文件原来是mp4格式的，那么得改mp4的AVInputFormat的read_header函数，如果是rmvb的……这个复杂性就不用说了。

回过头来看FFMPEG读文件流的过程，依赖于ByteIOContext，在url_fopen--->url_fdopen中生成了该对象并通过

if (init_put_byte(*s, buffer, buffer_size,
                      (h->flags & URL_WRONLY || h->flags & URL_RDWR), h,
                      url_read, url_write, url_seek) < 0)

设置了其内容。ByteIOContext定义如下：

/**
 * Bytestream IO Context.
 * New fields can be added to the end with minor version bumps.
 * Removal, reordering and changes to existing fields require a major
 * version bump.
 * sizeof(ByteIOContext) must not be used outside libav*.
 */
typedef struct {
    unsigned char *buffer;
    int buffer_size;
    unsigned char *buf_ptr, *buf_end;
    void *opaque;
    int (*read_packet)(void *opaque, uint8_t *buf, int buf_size);
    int (*write_packet)(void *opaque, uint8_t *buf, int buf_size);
    int64_t (*seek)(void *opaque, int64_t offset, int whence);
    int64_t pos; /**< position in the file of the current buffer */
    int must_flush; /**< true if the next seek should flush */
    int eof_reached; /**< true if eof reached */
    int write_flag;  /**< true if open for writing */
    int is_streamed;
    int max_packet_size;
    unsigned long checksum;
    unsigned char *checksum_ptr;
    unsigned long (*update_checksum)(unsigned long checksum, const uint8_t *buf, unsigned int size);
    int error;         ///< contains the error code or 0 if no error happened
    int (*read_pause)(void *opaque, int pause);
    int64_t (*read_seek)(void *opaque, int stream_index,
                         int64_t timestamp, int flags);
	void *URLContext;
} ByteIOContext;

通过分析可以看出，url_read就是其读取文件流的方式。这里注意到url_read的实现：

int url_read(URLContext *h, unsigned char *buf, int size)
{
    int ret;
    if (h->flags & URL_WRONLY)
        return AVERROR(EIO);
    ret = h->prot->url_read(h, buf, size);
    return ret;
}

可见读取文件流的过程又与具体的协议内容（URLContext）有关。其实在这里已经能够完成任务了，如下：

int fd = h->prot->url_get_file_handle(h);    // 其实就是h->priv_data
long pos = lseek(fd, 0, SEEK_CUR);
__android_log_print(ANDROID_LOG_INFO, TAG, "url_read ftell pos: %d", pos);

pos代表文件流当前的位置，这样在哪个位置如何解包就一清二楚了。

下面还是简单说说URLContext与URLProtolcol。URLContext的定义如下：

/**
 * URL Context.
 * New fields can be added to the end with minor version bumps.
 * Removal, reordering and changes to existing fields require a major
 * version bump.
 * sizeof(URLContext) must not be used outside libav*.
 */
typedef struct URLContext {
#if LIBAVFORMAT_VERSION_MAJOR >= 53
    const AVClass *av_class; ///< information for av_log(). Set by url_open().
#endif
    struct URLProtocol *prot;
    int flags;
    int is_streamed;  /**< true if streamed (no seek possible), default = false */
    int max_packet_size;  /**< if non zero, the stream is packetized with this max packet size */
    void *priv_data;
    char *filename; /**< specified URL */
} URLContext;

其中最重要的就是URLProtocol了，打开文件的过程中会根据文件的URL生成对应的URLProtocol及URLContext，才有了后面url_read中的ret = h->prot->url_read(h, buf, size);这儿的调用。URLProtocol才是真正负责协议中文件流数据读取的底层数据结构。下面举file协议（即本地文件）的例子。

URLProtocol file_protocol = {
    "file",
    file_open,
    file_read,
    file_write,
    file_seek,
    file_close,
    .url_get_file_handle = file_get_handle,
};

其中file_read：

static int file_read(URLContext *h, unsigned char *buf, int size)
{
    int fd = (intptr_t) h->priv_data;
    return read(fd, buf, size);
}

可以看到了，就是Linux中read读文件。

本文章为转载内容，我们尊重原作者对文章享有的著作权。如有内容错误或侵权问题，欢迎原作者联系我们进行内容更正或删除文章。

上一篇：删除数据后索引节点内部还连续吗删除索引的sql

下一篇：数据库的AUTO_INCREMENT什么意思数据库中是什么意思

提问和评论都可以，用心的回复会被更多人看到评论

发布评论

相关文章

官方博客	全部文章	热门标签	班级博客
了解我们	网站地图	意见反馈

鸿蒙开发者社区	51CTO学堂
51CTO	软考资讯

javacv FFmpegFrameRecorder 设置分辨率 ffmpeg修改分辨率

javacv FFmpegFrameRecorder 设置分辨率 ffmpeg修改分辨率

51CTO博客