Large-Scale Visual SLAM with 3D Gaussian Splatting
LSG-SLAM: a stereo visual SLAM using 3D Gaussian splatting for large-scale outdoor reconstruction, improving tracking stability and mapping quality.
LSG-SLAM: a stereo visual SLAM using 3D Gaussian splatting for large-scale outdoor reconstruction, improving tracking stability and mapping quality.
Explains why ML models can't reach zero error, detailing irreducible error, bias-variance tradeoff, model complexity, overfitting, and MSE for prediction accuracy.
LLM-driven system automates embodied intelligence skill generation for robotic arms, converting natural-language tasks into MuJoCo scenes, actions, and reward code.
Practical deep learning tuning guide covering learning rate selection, batch size effects, weight initialization, optimizers, regularization, data augmentation and training tips.
Technical overview of why GPUs outperform CPUs for deep learning training: neural networks' matrix operations, parallelism, GPU architecture and GPGPU benefits.
Explore deep learning for defect detection in industries, offering accurate solutions for quality control with advanced frameworks.
Analysis of large-model scaling: how parameter count and training tokens drive compute requirements, showing compute grows ~quadratically with model size.
Technical guide to scaling LLM training: analyzes memory usage, gradient accumulation, ZeRO, and tensor/data parallelism to improve throughput and GPU utilization.
Technical guide to installing RKLLM-Toolkit and converting/deploying the DeepSeek-R1 LLM on EASY-EAI-Orin-Nano (RK3576), covering env setup, conversion, and on-device inference.
Technical overview of RNNs and LSTM architectures, how they model sequential data, application areas like signal and text processing, and MATLAB-based implementation.
OMGEval presents an open-source multilingual open-ended QA benchmark (804 Chinese prompts) localized from AlpacaEval, using Text-Davinci-003 baseline and GPT-4 evaluation.
Summary of terahertz sub-THz testing for 6G: spectrum use, RF front-end modules, signal generation, and channel measurement tools for terahertz communications research.
OpenAI's study unveils an instruction hierarchy to boost LLM security against attacks like prompt injections, enhancing model safety.
Analysis of deep learning in computer vision: strengths, limits, dataset biases, comparison with classical vision methods, interpretability and risks in safety-critical applications.
Technical overview and setup of the Raspberry Pi AI kit with Hailo 8L NPU, covering M.2 HAT+ installation, thermal management, and software setup for Pi 5.
PrefixRL uses deep reinforcement learning to optimize parallel prefix circuits, producing smaller, lower-latency adders and mapping Pareto trade-offs between area and latency.
Summary of TensorNODE, a TensorWave bare-metal AI cloud using AMD MI300X GPUs and a PCIe Gen5 memory fabric to enable petabyte-scale GPU memory pools.
Survey of hyperparameter optimization methods - grid/random search, Bayesian optimization, simulated annealing, genetic algorithms and successive halving for ML tuning.
Analysis of an IoT smart classroom solution - hardware connectivity, data interoperability and scenario intelligence for unified, energy-efficient device management.
Overview of FPGA applications in machine learning: accelerating neural network inference, hardware quantization, algorithm optimization, and efficiency for edge AI deployments.