The Tenyks BloggerThe Foundation Models Reshaping Computer VisionLearn about the Foundation Models — for object classification, object detection, and segmentation — that are redefining Computer Vision.Oct 26, 20234Oct 26, 20234
Jesus RodriguezThe New ImageNet: DeepMind’s New Perception Benchmark for Deep Learning ModelsA new benchmark focuses focuses on multimodal computer vision models.Oct 20, 2022Oct 20, 2022
InSyncedReviewbySyncedDeepMind Paper Provides a Mathematically Precise Overview of Transformer Architectures and…Since their 2017 debut as a novel approach to natural language processing (NLP), transformers have achieved epoch-making performance across…Jul 25, 20222Jul 25, 20222
InSyncedReviewbySyncedTsinghua & NKU’s Visual Attention Network Combines the Advantages of Convolution and…The powerful self-attention mechanisms in transformer architectures have significantly improved the state-of-the-art across a wide range of…Feb 23, 2022Feb 23, 2022
InDeep-Learning-For-Computer-VisionbyInside AI10. Introduction to Deep Learning with Computer Vision— Types of Convolutions & Atrous ConvolutionsWritten by Praveen Kumar.Feb 16, 2020Feb 16, 2020
InSyncedReviewbySyncedUC Berkeley & Google’s BoTNet Applies Self-Attention to CV BottlenecksResearchers from UC Berkeley and Google Research have introduced BoTNet, a “conceptually simple yet powerful” backbone architecture that…Feb 17, 2021Feb 17, 2021
InTowards AIbyLouis-François BouchardCreate 3D Models from Images! AI and Game Development, Design…This promising model called GANverse3D only needs an image to create a 3D figure that can be customized and animated!Apr 18, 2021Apr 18, 2021
InCodeXbyMobiDevSmall Datasets-Based Object Detection: How Much Data is Enough?Getting started with any machine learning project often starts with the question: “How much data is enough?”. The response depends on a…Sep 20, 20211Sep 20, 20211
InGeek CulturebyDickson WuA New Frontier of Machine LearningPaper Summary: “Deep Learning in Spiking Neural Networks”Sep 18, 20212Sep 18, 20212
InSyncedReviewbySyncedAre Patches All You Need?Vision transformer architectures (ViTs) have achieved compelling performance across many computer vision tasks, often outperforming…Oct 12, 20211Oct 12, 20211
InSyncedReviewbySyncedMicrosoft Asia’s Swin Transformer V2 Scales the Award-Winning ViT to 3 Billion Parameters and…In the ICCV (International Conference on Computer Vision) 2021 paper awards announced last month, a team from Microsoft Asia Research was…Nov 22, 2021Nov 22, 2021
Cambridge SparkCoordConv Layer: Deep LearningAn introduction to Uber’s new CoordConv architecture and its applicationsOct 25, 2019Oct 25, 2019
InSyncedReviewbySyncedDeepMind’s PoG Excels in Perfect and Imperfect Information Games, Advancing Research on General…Dec 8, 2021Dec 8, 2021
Sahil ChachraPaper Summary — MetaFormer is Actually What You Need for VisionIn recent times we have seen that Transformers (for vision) have performed very well, i.e., at par or at times surpassing the previously…Jan 7, 20221Jan 7, 20221
Sik-Ho TsangReview: Layer Normalization (LN)Stabilizing Training, Reduce Training TimeFeb 8, 2022Feb 8, 2022
InTDS ArchivebyWanshun WongWhat is Group Normalization?An alternative to Batch NormalizationJun 17, 20202Jun 17, 20202
InTDS ArchivebyLeon SickRegNet: The Most Flexible Network Architecture For Computer VisionA model design that scales for high-efficiency or high-accuracyDec 13, 20211Dec 13, 20211