Dan Ben Ami

AI Researcher & Algorithm Developer

Your Photo

About Me

AI researcher specializing in deep learning, computer vision, and vision-language models (VLMs).

I design and build innovative, scalable AI systems and complex pipelines, integrating creative problem-solving with expertise in multimodal learning, algorithm optimization, and end-to-end system design for real-world impact.

Work Experience

Elop - Computer Vision & AI Algorithm Developer

Feb/2023 - Present

  • Performed a diverse range of Computer Vision and Vision-Languege tasks and developed a deep understanding of various state-of-the-art (SOTA) models and architectures.
  • Received project requirements and system constraints, then designed, optimized, and integrated custom algorithmic pipelines into larger systems.
  • Conducted extensive literature reviews—evaluating over 70 research papers annually on customized data with numerical performance metrics.

Scene Understanding and Threat Recognition System

  • Built an end-to-end pipeline that ingests multi-object tracking outputs and leverages Video-LLMs for complex-scene understanding and per-object threat scoring.
  • Led a 3-person project; authored and executed a one-year roadmap (versioning, scope per release, task allocation) to deliver the project.
  • Benchmarked multimodal models from 3B–78B (qwen2.5-VL, LLaVA-OV, etc.) on multiple datasets, comparing accuracy and runtime/latency to select deployment candidates.

Multi-Object (Multi-Sensor) Tracking

  • Developed a tracking solution that increased precision from 72% (naive off-the-shelf YOLO+DeepSORT pipeline) to 93% using a customized pipeline.
  • Integrated an ensemble of video classification models (MViT, TimeSFormer, etc.) focused on reducing false alarm rates.
  • Implemented a three-sensor tracking system (RGB, SWIR, MWIR) to ensure robust performance across different imaging modalities.

Video Description: Visualization of simple basic off-the-shelf multi-object tracking in RGB (Vis) sensor without the full pipeline (Due to IP and confidentiality).

Zero-Shot Keypoint Tracking

  • Explored zero-shot tracking for keypoints in infrared videos characterized by extreme noise conditions.
  • Utilized simulative data generated with UE5 specifically for testing under high interference scenarios.
  • Rigorously evaluated SOTA models to determine limitations and ensure their suitability under adverse conditions.

Video Description: Visualization of tracking keypoints on a passenger airplane from standart RGB (no noise) video (Due to IP and confidentiality).

Video Restoration and Super Resolution

  • Investigated and compared multiple SOTA models, including a self-designed architecture and a self-trained model.
  • Applied knowledge distillation from a FeMaSR teacher model to optimize the chosen architecture.
  • Achieved performance improvements of 5–20% over classical methods, as measured by metrics such as MTF, SSIM, and ESF.

Video Description: Demonstration of video restoration and turbulence mitigation. This output is from the DATUM model (Zhang et al., CVPR 2024), as my own work’s data is confidential.

Video Classification for False Target Filtering

  • Designed a comprehensive data management infrastructure for experimental model training and evaluation.
  • Developed tools for dynamic data integration, version management, attribute-based filtering, and complex augmentations (e.g., smart copy-paste and 3D rotations relative to the camera plane).
  • Conducted hundreds of training experiments across both lightweight and heavy SOTA architectures, rigorously evaluating each component (data version, augmentations, architecture, transfer learning, etc.).
  • Increased overall accuracy by 22%.

Ben-Gurion University - Teaching Assistant

2020 - 2023

  • Algorithms and Graph Theory (2023)
  • Digital Design (2022)
  • Digital Computers Structure (2022)
  • Linear Algebra (2020-2021)

Education

Ph.D. in Electrical and Computer Engineering

Ben-Gurion University, 2021 – Present

Lab: InsightLab

GPA: 97

Advisors: Prof. Kobi Cohen (2021–present), Dr. Chaim Baskin (2024–present)

Accelerated Ph.D. program for top-performing students.

Main courses: Deep learning, Sequential learning, Statistical inference and Data Mining, Multivariate statistical data analysis, Game theory.

B.Sc in Computer Engineering

Ben-Gurion University, 2018 – 2022

GPA: 92

Honors: Summa Cum Laude (in the dean's list)

Ranking: Ranked #1 in my class (2018–2022 BGU CE)

Research: Researched computational modeling of protein-DNA/RNA interactions using deep learning models, guided by Dr. Yaron Orenstein.

Publications

On Progress - Innovative Benchmark and Plug-and-Play Frame Selection Method for Long-Video Question Answering

Dan Ben Ami, Gabrielle Serrusi, Kobi Cohen, Chaim Baskin

In preparation, 2025

We are developing a novel Video Question Answering (VideoQA) benchmark. Alongside, we are designing a plug-and-play frame selection module that can integrate into existing VideoQA models to improve efficiency and accuracy by dynamically selecting the most relevant frames during inference.

A Stable Polygamy Approach to Spectrum Access with Channel Reuse

Dan Ben Ami, Kobi Cohen

Under review, IEEE Transactions on Communications (TCOM), 2024

I introduced the "Stable Polygamy Problem" (SPP) for spectrum access with channel reuse, developed efficient algorithms including RP&R, and proved their performance in specific interference regimes with strong simulation results.

Client Selection for Generalization in Accelerated Federated Learning: A Multi-Armed Bandit Approach

Dan Ben Ami, Kobi Cohen, Qing Zhao

IEEE Access, 2025

I developed a multi-armed bandit–based algorithm (BSFL) for federated learning that balances training latency and model generalization, achieving logarithmic regret and outperforming prior methods on synthetic and real datasets.

Client Selection Publication Image

A Universal System for Boosting Gene Expression in Eukaryotic Cell-Lines

Inbal Vaknin, Or Willinger, Jonathan Mandl, Hadar Heuberger, Dan Ben-Ami, Yi Zeng, Sarah Goldberg, Yaron Orenstein, Roee Amit

Nature Communications, 2024

Designed and trained deep learning models to predict protein expression from DNA sequences, guiding motif selection for cross-species promoter boosting.

Characterizing Regulatory Grammar Rules in S. cerevisiae Using a Library of Conserved and Unknown Motifs

Inbal Vaknin, Hadar Heuberger, Dan Ben-Ami, Or Wilinger, Yi Zeng, Leon Anavy, Zohar Yakhini, Sarah Goldberg, Yaron Orenstein, Roee Amit

Cerevisiae, 2023

Developed convolutional neural network models to predict gene expression from DNA sequences, enabling validation of conserved transcriptional grammar rules.

Programming

  • Programming Languages: Python, C++, MATLAB, Bash
  • Deep Learning Frameworks: PyTorch, TensorFlow, Keras
  • Computer Vision & VLMs: OpenCV, Hugging Face Transformers, vLLM, CLIP, LLaVA, Qwen, SAM, YOLO
  • Data Processing & Analysis: NumPy, Pandas, scikit-learn, SciPy
  • Visualization: Matplotlib, Plotly, TensorBoard
  • Optimization & MLOps: PyTorch Lightning, Hydra, ONNX, Torch-TensorRT
  • Tools & Environments: Docker, Git, Linux, Conda, Jupyter

Soft Skills

  • Creative problem-solving and innovation in AI system design
  • Strong analytical and critical thinking abilities
  • Cross-disciplinary collaboration and communication
  • Rapid learning and adaptation to emerging technologies
  • Translating research concepts into practical, scalable solutions
  • Project planning and execution in complex, multi-stage pipelines
  • Mentoring and knowledge sharing in technical teams
  • Attention to detail while maintaining big-picture perspective

Contact Information

Email: danbenami3@gmail.com

Phone: +972 549981819

LinkedIn: Dan Ben Ami