画像処理の研究
画像処理とは、電子工学的に画像を処理して便利な情報を取得する分野である。本研究は、人間の三次元姿勢推定するから、白黒写真の自動色付けまで、幅広い課題に応用する。
-
シーンの大域的かつ局所的な整合性を考慮した画像補完
本研究では,畳み込みニューラルネットワークを用いて,シーンの大域的かつ局所的な整合性を考慮した画像補完を行う手法を提案する。提案する補完ネットワークは全層が畳み込み層で構成され,任意のサイズの画像における自由な形状の「穴」を補完できる。この補完ネットワークに,シーンの整合性を考慮した画像補完を学習させるため,本物の画像と補完された画像を識別するための大域識別ネットワークと局所識別ネットワークを構築する。大域識別ネットワークは画像全体が自然な画像になっているかを評価し,局所識別ネットワークは補完領域周辺のより詳細な整合性によって画像を評価する。この2つの識別ネットワーク両方を「だます」ように補完ネットワークを学習させることで,シーン全体で整合性が取れており,かつ局所的にも自然な補完画像を出力することができる。提案手法により,様々なシーンにおいて自然な画像補完が可能となり,さらに従来のパッチベースの手法ではできなかった,入力画像に写っていないテクスチャや物体を新たに生成することもできる。これにより,人間の顔の一部を補完するような,複雑な画像補完を実現した。
-
白黒画像の全自動色付け
本研究では、ディープネットワークを用いて白黒画像をカラー画像に自動変換する手法を提案する。提案手法では、画像の大域特徴と局所特徴を考慮した新たな畳込みネットワークモデルを用いることで、画像全体の構造を考慮した自然な色付けを行うことができる。提案モデルにおいて、大域特徴は画像全体から抽出され、局所特徴はより小さな画像領域から計算される。これらの特徴は“結合レイヤ”によって一つに統合され、色付けネットワークに入力される。このモデル構造は入力画像のサイズが固定されず、どんなサイズの画像でも入力として用いることができる。また、モデルの学習のために既存の大規模な画像分類のデータセットを利用し、それぞれの画像の色とラベルを同時に学習に用いることで、効果的に大域特徴を学習できるようにしている。提案手法により、100年前の白黒写真など、様々な画像において自然な色付けを実現できる。色付けの結果はユーザテストによって評価し、約90%の色付け結果が自然であるという回答が得られた。
-
単眼画像の人間の三次元位置の推定
This line of research focuses on the estimation of the 3D pose of humans from single monocular images. This is an extremely difficult problem due to the large number of ambiguities that rise from the projection of 3D objects to the image plane. We consider image evidence derived from the usage of different detectors for the different parts of the body, which results in noisy 2D estimations where the estimation uncertainty must be compensation. In order to deal with these issues, we propose different approaches using discriminative and generative models to enforce learnt anthropomorphism constraints. We show that by exploiting prior knowledge of human kinematics it is possible to overcome these ambiguities and obtain good pose estimation performance.
論文
@InProceedings{LinWACV2024, author = {Shan Lin and Edgar Simo-Serra}, title = {{Restoring Degraded Old Films with Recursive Recurrent Transformer Networks}}, booktitle = "Proceedings of the Winter Conference on Applications of Computer Vision (WACV)", year = 2024, }
@Inproceedings{HaoSIGGRAPHASIA2023, author = {Guoqing Hao and Satoshi Iizuka and Kensho Hara and Edgar Simo-Serra and Hirokatsu Kataoka and Kazuhiro Fukui}, title = {{Diffusion-based Holistic Texture Rectification and Synthesis}}, booktitle = "ACM SIGGRAPH Asia 2023 Conference Papers", year = 2023, }
@InProceedings{CarrilloCVPRW2023, author = {Hernan Carrillo and Micha\"el Cl/'ement and Aur\'elie Bugeau and Edgar Simo-Serra}, title = {{Diffusart: Enhancing Line Art Colorization with Conditional Diffusion Models}}, booktitle = "Proceedings of the Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)", year = 2023, }
@InProceedings{SasugaMICCAIW2022, author = {Saeko Sasuga and Akira Kudo and Yoshiro Kitamura and Satoshi Iizuka and Edgar Simo-Serra and Atsushi Hamabe and Masayuki Ishii and Ichiro Takemasa}, title = {{Image Synthesis-based Late Stage Cancer Augmentation and Semi-Supervised Segmentation for MRI Rectal Cancer Staging}}, booktitle = "Proceedings of the International Conference on Medical Image Computing and Computer Assisted Intervention Workshops (MICCAIW)", year = 2022, }
@InProceedings{YuanCVPRW2021, author = {Mingcheng Yuan and Edgar Simo-Serra}, title = {{Line Art Colorization with Concatenated Spatial Attention}}, booktitle = "Proceedings of the Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)", year = 2021, }
@InProceedings{TanakaCVPRW2021, author = {Tsunehiko Tanaka and Edgar Simo-Serra}, title = {{LoL-V2T: Large-Scale Esports Video Description Dataset}}, booktitle = "Proceedings of the Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)", year = 2021, }
@InProceedings{HoriuchiCVPRW2021, author = {Yusuke Horiuchi and Edgar Simo-Serra and Satoshi Iizuka and Hiroshi Ishikawa}, title = {{Differentiable Rendering-based Pose-Conditioned Human Image Generation}}, booktitle = "Proceedings of the Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)", year = 2021, }
@InProceedings{MasuzawaMICCAI2020, author = {Naoto Masuzawa and Yoshiro Kitamura and Keigo Nakamura and Satoshi Iizuka and Edgar Simo-Serra}, title = {{Automatic Segmentation, Localization and Identification of Vertebrae in 3D CT Images Using Cascaded Convolutional Neural Networks}}, booktitle = "Proceedings of the International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI)", year = 2020, }
@InProceedings{KeshwaniMICCAI2020, author = {Deepak Keshwani and Yoshiro Kitamura and Satoshi Ihara and Satoshi Iizuka and Edgar Simo-Serra}, title = {{TopNet: Topology Preserving Metric Learning for Vessel Tree Reconstruction and Labelling}}, booktitle = "Proceedings of the International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI)", year = 2020, }
@InProceedings{YokooCVPRW2020, author = {Shuhei Yokoo and Kohei Ozaki and Edgar Simo-Serra and Satoshi Iizuka}, title = {{Two-stage Discriminative Re-ranking for Large-scale Landmark Retrieval}}, booktitle = "Proceedings of the Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)", year = 2020, }
@Article{IizukaSIGGRAPHASIA2019, author = {Satoshi Iizuka and Edgar Simo-Serra}, title = {{DeepRemaster: Temporal Source-Reference Attention Networks for Comprehensive Video Enhancement}}, journal = "ACM Transactions on Graphics (SIGGRAPH Asia)", year = 2019, volume = 38, number = 6, }
@InProceedings{ShinyaICCVW2019, author = {Yosuke Shinya and Edgar Simo-Serra and Taiji Suzuki}, title = {{Understanding the Effects of Pre-training for Object Detectors via Eigenspectrum}}, booktitle = "Proceedings of the International Conference on Computer Vision Workshops (ICCVW)", year = 2019, }
@InProceedings{KudoMICCAIW2019, author = {Akira Kudo and Yoshiro Kitamura and Yuanzhong Li and Satoshi Iizuka and Edgar Simo-Serra}, title = {{Virtual Thin Slice: 3D Conditional GAN-based Super-resolution for CT Slice Interval}}, booktitle = "Proceedings of the International Conference on Medical Image Computing and Computer Assisted Intervention Workshops (MICCAIW)", year = 2019, }
@InProceedings{OmiyaCVPRW2019, author = {Mayu Omiya and Yusuke Horiuchi and Edgar Simo-Serra and Satoshi Iizuka and Hiroshi Ishikawa}, title = {{Optimization-Based Data Generation for Photo Enhancement}}, booktitle = "Proceedings of the Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)", year = 2019, }
@InProceedings{OmiyaSIGGRAPASIABRIEF2018, author = {Mayu Omiya and Edgar Simo-Serra and Satoshi Iizuka and Hiroshi Ishikawa}, title = {{Learning Photo Enhancement by Black-Box Model Optimization Data Generation}}, booktitle = "SIGGRAPH Asia 2018 Technical Briefs", year = 2018, }
@InProceedings{SasakiACPR2017, author = {Kazuma Sasaki and Yuya Nagahama and Zheng Ze and Satoshi Iizuka and Edgar Simo-Serra and Yoshihiko Mochizuki and Hiroshi Ishikawa}, title = {{Adaptive Energy Selection For Content-Aware Image Resizing}}, booktitle = "Proceedings of the Asian Conference on Pattern Recognition (ACPR)", year = 2017, }
@Article{IizukaSIGGRAPH2017, author = {Satoshi Iizuka and Edgar Simo-Serra and Hiroshi Ishikawa}, title = {{Globally and Locally Consistent Image Completion}}, journal = "ACM Transactions on Graphics (SIGGRAPH)", year = 2017, volume = 36, number = 4, }
@InProceedings{RubioICPR2016, author = {Antonio Rubio and Longlong Yu and Edgar Simo-Serra and Francesc Moreno-Noguer}, title = {{BASS: Boundary-Aware Superpixel Segmentation}}, booktitle = "Proceedings of the International Conference on Pattern Recognition (ICPR)", year = 2016, }
@InProceedings{IshiiICPR2016, author = {Tomohiro Ishii and Edgar Simo-Serra and Satoshi Iizuka and Yoshihiko Mochizuki and Akihiro Sugimoto and Hiroshi Ishikawa and Ryosuke Nakamura}, title = {{Detection by Classification of Buildings in Multispectral Satellite Imagery}}, booktitle = "Proceedings of the International Conference on Pattern Recognition (ICPR)", year = 2016, }
@Article{IizukaSIGGRAPH2016, author = {Satoshi Iizuka and Edgar Simo-Serra and Hiroshi Ishikawa}, title = {{Let there be Color!: Joint End-to-end Learning of Global and Local Image Priors for Automatic Image Colorization with Simultaneous Classification}}, journal = "ACM Transactions on Graphics (SIGGRAPH)", year = 2016, volume = 35, number = 4, }
@InProceedings{SimoSerraCVPR2013, author = {Edgar Simo-Serra and Ariadna Quattoni and Carme Torras and Francesc Moreno-Noguer}, title = {{A Joint Model for 2D and 3D Pose Estimation from a Single Image}}, booktitle = "Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR)", year = 2013, }
@InProceedings{SimoSerraCVPR2012, author = {Edgar Simo-Serra and Arnau Ramisa and Guillem Aleny\`a and Carme Torras and Francesc Moreno-Noguer}, title = {{Single Image 3D Human Pose Estimation from Noisy Observations}}, booktitle = "Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR)", year = 2012, }