1. Briefly describe the functioning of a gate. (2 sentences) 2. What is the Gumbel-Max trick? Why is it used? (4 sentences) 3. Do you think reduction in FLOPS would necessarily lead to reduction in inference time? Explain. (2 sentences)