Developed a multi-scale graph embedding approach for extreme multi-label classification (XMLC), aimed at selecting relevant items from a large number of possible outputs, while automatically categorizing the outputs into hierarchically nested groups. Apart from demonstrating superior performance compared to other factorization machine-based models on public benchmark datasets, the approach was also leveraged for joint conversion prediction across hundreds of predictive audiences.