Company: Rystad_MCQ_
Difficulty: medium
You have a DataFrame with 10 million rows. You need to filter for rows where column 'status' equals 'active'. Which approach is fundamentally most efficient? df[df['status'] == 'active'] df.query("status == 'active'") df.loc[df['status'] == 'active'] They're all essentially equivalent for this operation How do you create a line plot of 'consumption' over time using matplotlib? plt.plot(df['date'], df['consumption']) plt.scatter(df['date'], df['consumption']) plt.bar(df['date'], df['consumption']) plt.hist(df['consumption']) How do you handle API pagination for renewable energy project data when you don't know the total number of pages? Guess a large number of pages Loop until empty response is received Request all pages at once Use a fixed range of 1-1000 Given a dataset where RANK() is applied over the Score column in descending order, how does RANK() handle tied scores? It assigns a unique rank to each row regardless of ties. It randomly assigns ranks to tied values. It assigns the s