A manufacturing company uses Amazon Connect to manage its contact center and Salesforce to manage its customer relationship management (CRM) data. The data engineering team must build a pipeline to ingest data from the contact center and CRM system into a data lake that is built on Amazon S3.What is the MOST efficient way to collect data in the data lake with the LEAST operational overhead?
Show Answer & Explanation
Correct Answer: C. Use Amazon Kinesis Data Firehose to ingest Amazon Connect data and Amazon AppFlow to ingest Salesforce data.
Why C is correct: Amazon Kinesis Data Firehose is purpose-built for streaming Amazon Connect data (call records, agent events) directly to S3 with automatic batching, compression, and transformation capabilities - all fully managed. Amazon AppFlow is specifically designed for SaaS application integration, including native Salesforce connectivity with pre-built connectors, automatic schema detection, and data transfer directly to S3. Both services are fully managed with minimal operational overhead. Why others are wrong: A Kinesis Data Streams requires additional consumers and code to write to S3, adding operational overhead B Kinesis Data Streams doesn't natively ingest Salesforce data D AppFlow doesn't directly support Amazon Connect; Firehose is the native integration