Sound Source Localization Using Neural Networks

We created a neural network capable of identifying the position of a sound source in 3D space, mimicking human auditory perception. Using only two microphones, the model extracts sound features by convolving audio signals with human ear impulse responses. It also isolates the sound source from background noise, achieving high accuracy in localization.


Core Features

  • 3D Sound Localization: Accurately identifies the position of sound sources in three-dimensional space.
  • Human Auditory Model: Uses human ear impulse responses to extract aural features.
  • Noise Isolation: Isolates the target sound source from background noise.
  • Minimal Hardware: Requires only two microphones for input, mimicking human hearing.
  • Deep Learning: Leverages neural networks for robust and adaptive sound processing.

Technical Contributions

  • Neural Network Design: Developed a custom neural network architecture for sound localization.
  • Feature Extraction: Implemented convolution with human ear impulse responses to mimic auditory perception.
  • Noise Reduction: Integrated noise isolation techniques to improve localization accuracy.
  • GPU Acceleration: Utilized Nvidia CUDA for efficient training and inference.
  • Containerization: Used Singularity to ensure reproducibility and portability of the solution.
  • Development Environment: Managed dependencies and workflows with Pipenv for seamless collaboration.

Business Value

  • Innovative Solution: Provides a novel approach to 3D sound localization using minimal hardware.
  • High Accuracy: Delivers precise sound source identification even in noisy environments.
  • Cost Efficiency: Reduces hardware requirements by leveraging advanced algorithms.
  • Scalability: Designed to be adaptable for various applications, from robotics to audio engineering.
  • Reproducibility: Containerized for easy deployment and integration into existing systems.

Involved Technologies

  • Python
  • PyTorch
  • Pipenv
  • Singularity
  • Nvidia CUDA

Other Projects

Sound Source Localization Using Neural Networks

Neural network development inspired by human auditory perception to locate sound sources in 3D space using just two microphones. The model extracts aural features, isolates sound from noise, and leverages human ear impulse responses for accurate localization.

Read More »

Custom Board Support

Modification of the Linux kernel, bootloader, and OS to support a custom hardware board based on BalenaOS. The project included creating a new device tree, integrating network drivers, updating u-boot, and adding custom services for hardware initialization during boot.

Read More »

Cloud Migration and Kubernetes Integration

Plan and migration of a customer’s services from Digital Ocean to AWS, integrating Kubernetes (K8S) for scalable and efficient management. The project involved containerizing services, implementing K8S, and automating infrastructure with Terraform, Ansible, and GitLab CI/CD pipelines for a smooth transition.

Read More »
Scroll to Top