Pix2PixGenerative-Models Generative-Adversarial-Networks Conditional-Image-to-Image-Translation-Models
InternVideo: General Video Foundation Models via Generative and Discriminative LearningVision-and-Language-Pre-Trained-Models
Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal AlignmentMulti-Modal-Methods
Language-driven Scene Synthesis using Multi-conditional Diffusion Model3D-Representations Diffusion-Models
LipGANGenerative-Adversarial-Networks Conditional-Image-to-Image-Translation-Models Face-to-Face-Translation
Extremely Efficient Spatial Pyramid of Depth-wise Dilated Separable ConvolutionsImage-Model-Blocks Skip-Connection-Blocks
Amplifying Sine Unit: An Oscillatory Activation Function for Deep Neural Networks to Recover Nonlinear Oscillations EfficientlyActivation-Functions
Hybrid Firefly and Particle Swarm OptimizationOptimization Hybrid-Optimization Heuristic-Search-Algorithms
Protagonist Antagonist Induced Regret Environment DesignAdversarial-Training Environment-Design-Methods
Distribution-induced Bidirectional Generative Adversarial Network for Graph Representation LearningGraph-Embeddings
Guided Language to Image Diffusion for Generation and EditingMulti-Modal-Methods Image-Generation-Models
Pansharpening by convolutional neural networks in the full resolution frameworkConvolutional-Neural-Networks
Gradient SparsificationDistributed-Methods Optimization Stochastic-Optimization Data-Parallel-Methods
YOLOPOne-Stage-Object-Detection-Models Object-Detection-Models Semantic-Segmentation-Models Lane-Detection-Models
Segmentation of patchy areas in biomedical images based on local edge density estimationImage-Segmentation-Models
Contour Proposal NetworkObject-Detection-Models Instance-Segmentation-Models One-Stage-Object-Detection-Models
DE-GAN: A Conditional Generative Adversarial Network for Document EnhancementGenerative-Adversarial-Networks
A Framework for Leader Identification in Coordinated ActivityTime-Series-Analysis Leadership-Inference
SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource SettingsMonocular-Depth-Estimation-Models
Distributed Any-Batch Mirror DescentDistributed-Methods Optimization Data-Parallel-Methods Replicated-Data-Parallel
Mirror-BERTSelf-Supervised-Learning Sentence-Embeddings Word-Embeddings Contextualized-Word-Embeddings Static-Word-Embeddings
Encoder-Decoder model with local and pairwise loss along with shared encoder and discriminator network (EDLPS)Document-Embeddings
Multi-source Sentiment Generative Adversarial NetworkGenerative-Adversarial-Networks Domain-Adaptation
Convolutional time-domain audio separation networkTemporal-Convolutions Speech-Separation-Models Music-source-separation Speech-enhancement
MyGym: Modular Toolkit for Visuomotor Robotic TasksRobotic-Manipulation-Models Reinforcement-Learning-Frameworks Policy-Gradient-Methods
Absolute Learning Progress and Gaussian Mixture Models for Automatic Curriculum LearningSelf-Supervised-Learning
Adaptive Content Generating and Preserving NetworkGenerative-Adversarial-Networks Augmented-Reality-Methods
Multi-Heads of Mixed AttentionAttention-Modules Attention Attention-Mechanisms Transformers Vision-Transformers Rendezvous
Factorization machines with cubic splines for numerical featuresFactorization-Machines Recommendation-Systems
Parts, Poses, and Occlusions in 3D Visual Question AnsweringMulti-Modal-Methods 6D-Pose-Estimation-Models
Transformer in TransformerTransformers Backbone-Architectures Image-Models Image-Model-Blocks Vision-Transformers
Learning Cross-Modality Encoder Representations from TransformersVision-and-Language-Pre-Trained-Models
Wavelet-integrated Identity Preserving Adversarial Network for face super-resolutionFace-Restoration-Models
Contextualized Topic ModelsTopic-Embeddings Contextualized-Word-Embeddings Clustering Document-Embeddings
Context-aware Visual Attention-based (CoVA) webpage object detection pipelineObject-Detection-Models Webpage-Object-Detection-Pipeline