Kotlin Coroutines and Flow: Dispatching, Concurrency, Streams | Kai

Introduction: Leave Callback Hell Behind and Embrace Structured Concurrency

In Android development, asynchronous programming is unavoidable. We need to perform time-consuming tasks such as network requests, database operations, and complex computation without blocking the main thread, or UI thread. Traditional concurrency models, such as using Thread directly, AsyncTask, or callback-based designs, often produce complex and hard-to-maintain code, callback hell, difficult resource management such as memory and thread leaks, and complicated cancellation and exception handling logic.

Kotlin Coroutines emerged to solve these problems. They provide a new concurrency programming paradigm that is lighter, easier to understand, and easier to manage. Coroutines let us write asynchronous code in a style that is close to synchronous code, greatly simplifying async logic. Kotlin Flow is the Reactive Streams implementation built on top of coroutines for handling asynchronous sequences of data.

For Android specialists, simply knowing how to start a coroutine with launch or collect a simple Flow with collect is far from enough. You need a deep understanding of how coroutines work internally, including suspension and resumption, state machines, structured concurrency concepts such as scopes, Job hierarchies, and supervision; how dispatcher choices affect execution; cold and hot Flow behavior; advanced Flow operators; backpressure strategies; the internals and use cases of StateFlow and SharedFlow; subtle exception handling and cancellation rules; coroutine and Flow testing; and common traps. Mastering these areas lets you use coroutines confidently in complex scenarios and write efficient, robust, maintainable concurrent code.

This article explores advanced Coroutine and Flow usage and internals:

Coroutine fundamentals: the internal mechanics of suspend functions and Continuation Passing Style, or CPS.
Structured concurrency: CoroutineScope, Job, and SupervisorJob hierarchy and lifecycle management.
Dispatchers: the mechanics of thread switching across IO, Default, Main, and Unconfined.
Flow in depth: cold streams, powerful intermediate operators, flowOn, and context.
Backpressure handling: strategies such as buffer, conflate, and collectLatest.
StateFlow and SharedFlow: hot-flow applications and configuration details.
Channel: communication between coroutines.
Advanced exception handling: try-catch, CoroutineExceptionHandler, supervisorScope, and Flow’s catch operator.
Cancellation: cooperative cancellation and NonCancellable.
Testing: runTest and TestDispatcher.
Comparisons and pitfalls: comparison with RxJava, common mistakes, and best practices.

1. Coroutine Fundamentals: The Magic of Suspension - suspend and Continuations

1. Coroutines vs. threads

Coroutines are not threads. They are units of computation that can suspend execution at specific points and later resume execution on the same or a different thread.
Coroutines are very lightweight. Thousands of coroutines can run on a small number of threads. Switching coroutine context is usually far cheaper than switching thread context.

2. The suspend keyword

Marks that a function can suspend without blocking a thread. Examples include waiting for a network response, calling delay(), or waiting for another coroutine’s result with await().
A suspend function can only be called from another suspend function or inside a coroutine builder such as launch, async, or runBlocking.

3. Internal suspension mechanics - Continuation Passing Style, or CPS

When the Kotlin compiler encounters a suspend function, it transforms the function at compile time. Developers do not need to manage the details, but understanding the mechanism helps with deeper reasoning:

State Machine: the function body is transformed into a finite state machine. Local variables become fields in the state machine, and each suspension point, meaning each call to another suspend function, becomes a state.
Continuation object: the compiler implicitly adds a Continuation<T> parameter to the suspend function. This Continuation object encapsulates the next segment of logic to run after the coroutine resumes, similar to a callback, and also holds the current state of the state machine.
Suspension point: when another suspend function is called, the current coroutine: (1) saves current state, such as local variables, into the Continuation object; (2) calls the target suspend function and passes the Continuation object to it; (3) suspends current coroutine execution, possibly yielding the thread.
Resumption: when the called suspend function completes, for example after a network response returns, it calls resumeWith(Result) on the saved Continuation object. This: (1) restores the state machine to the previous state; (2) continues execution from the next state after the suspension point. Resumption may happen on the original thread or may switch to another thread depending on the dispatcher.
Effect: this compile-time transformation lets developers write asynchronous logic in code that appears synchronous, while the compiler and coroutine library handle state saving and callback complexity.

Diagram: suspend function CPS transformation

// Original suspend function
suspend fun fetchData(url: String): String {
    val request = prepareRequest(url) // Normal code
    val response = networkCall(request) // Suspend point 1
    val processed = processData(response) // Normal code after resume 1
    saveToDb(processed) // Suspend point 2
    return "Success" // Final result after resume 2
}

// Compiled State Machine (Conceptual)
class FetchDataStateMachine(private val continuation: Continuation<String>) : ContinuationImpl {
    var label = 0 // State indicator
    var result: Any? = null
    // Fields to store local variables like 'request', 'response', 'processed'

    override fun invokeSuspend(result: Result<Any?>): Any? {
        this.result = result.getOrThrow() // Store result from previous suspension
        while(true) {
            when (label) {
                0 -> { // Initial state
                    val request = prepareRequest(url)
                    // Save state ('request')
                    label = 1 // Set next state
                    val responseResult = networkCall(request, this) // Call suspend func, pass 'this' as continuation
                    if (responseResult == COROUTINE_SUSPENDED) return COROUTINE_SUSPENDED // Suspend successful
                    // If networkCall completed immediately (rare), fall through
                    this.result = responseResult // Store immediate result
                    // Fall through to state 1 (simulates immediate resume)
                }
                1 -> { // Resumed after networkCall
                    val response = this.result as ResponseType
                    val processed = processData(response)
                    // Save state ('processed')
                    label = 2 // Set next state
                    val dbResult = saveToDb(processed, this) // Call suspend func
                    if (dbResult == COROUTINE_SUSPENDED) return COROUTINE_SUSPENDED
                    this.result = dbResult
                    // Fall through to state 2
                }
                2 -> { // Resumed after saveToDb
                    // Final processing / return calculation
                    return "Success"
                }
                // ... other states for error handling etc. ...
            }
        }
    }
}

2. Structured Concurrency: Avoid Coroutine Leaks and Chaos

This is the core advantage of coroutines over other concurrency models such as raw threads or GlobalScope.

1. Core idea

The lifecycle of a coroutine should be bound to the Scope that executes it. When the Scope ends, all coroutines launched inside it should be automatically cancelled. This greatly simplifies resource management and avoids coroutine leaks.

2. Core concepts

CoroutineScope: defines the scope of coroutines. Each scope has an associated CoroutineContext, which usually contains a Job.
Job: represents a cancellable unit of work with lifecycle states such as Active, Completing, Completed, Cancelling, and Cancelled. Jobs can be organized into parent-child hierarchies.

3. Key principles of structured concurrency

Scope constraint: coroutines must be launched inside a CoroutineScope, using builders such as launch and async.
Lifecycle binding: a coroutine’s lifecycle is controlled by the Job in its CoroutineScope. Cancelling the Scope’s Job recursively cancels all child Jobs and their coroutines.
Parent-child relationship:
- Parents wait for children: a parent Job can only complete after all its child Jobs complete.
- Child failure cancels the parent by default: if a child coroutine that is not a direct child of a SupervisorJob fails with an uncaught exception, it cancels its parent Job. The parent then cancels all other child Jobs.

4. Common scopes and builders

GlobalScope: Use with caution. This is a global singleton Scope whose lifecycle is tied to the entire application process. Coroutines launched in GlobalScope are easy to leak because they are not automatically cancelled when a specific UI component or business operation ends. It is mainly suitable for a small number of top-level long-lived background tasks, with very careful manual management.
runBlocking { … }: starts a coroutine and blocks the current thread until all tasks inside it complete. It is mainly used to bridge blocking code and the suspend world, such as calling suspend functions from a main function or tests. Never use it on the Android main thread or inside coroutines unless you clearly understand the consequences.
coroutineScope { … }, a suspend function: creates a structured nested scope. It inherits the outer scope’s context but has its own Job. It suspends the caller until all child coroutines launched inside it complete. If any child coroutine fails, coroutineScope itself fails and rethrows the exception, while cancelling the other children. It is commonly used to split one unit of work into multiple parallel subtasks and wait for all of them.
supervisorScope { … }, a suspend function: similar to coroutineScope, it creates a nested scope and waits for child tasks to complete. Key difference: it uses SupervisorJob. Failure of a direct child coroutine does not make supervisorScope fail and does not cancel sibling child coroutines. Exceptions need to be handled by the child coroutine itself, or through CoroutineExceptionHandler. It is useful when child task failures should be isolated, such as multiple independent data-loading regions on the same UI screen.

5. Android Jetpack scopes

viewModelScope, a ViewModel extension property: a Scope built into ViewModel and bound to the ViewModel lifecycle. It is automatically cancelled when ViewModel onCleared() runs. Internally it uses SupervisorJob plus Dispatchers.Main.immediate. It is the preferred way to launch coroutines for business logic and data loading inside a ViewModel.
lifecycleScope, a LifecycleOwner extension property: a Scope built into Activity and Fragment and bound to the component Lifecycle. It is automatically cancelled when the Lifecycle reaches DESTROYED. It also uses SupervisorJob plus Dispatchers.Main.immediate. It provides methods such as launchWhenCreated, launchWhenStarted, and launchWhenResumed, which start coroutines at specific lifecycle states and automatically pause or cancel when leaving those states.

6. Job() vs. SupervisorJob()

Job(): child task failure cancels the parent task and all sibling tasks, which is the default failure propagation behavior.
SupervisorJob(): child task failure does not affect the parent or sibling tasks, providing failure isolation. viewModelScope and lifecycleScope use it by default. You can also create CoroutineScope(SupervisorJob() + ...) directly for custom scopes.

Diagram: structured concurrency - Job hierarchy and cancellation

+---------------------------------------------+
| CoroutineScope (Parent Job)                 |
|---------------------------------------------|
|    launch { // Child Job 1                 |
|      ...                                    |
|      launch { // Grandchild Job 1.1        | --------+
|        ...                                  |         | Cancellation
|      }                                      |         | propagates down
|    } // Child Job 1 completes when 1.1 done |         V
|                                             |
|    async { // Child Job 2 (using default Job) |
|      ...                                    |
|      if (error) throw Exception() --------->|--- X (Failure)
|      ...                                    |
|    } // Failure here cancels Parent & Child 1|
|                                             |
|    launch(SupervisorJob()) { // Child Job 3 |
|      launch { // Grandchild Job 3.1        |
|        if (error) throw Exception() ------->|--- X (Failure) - Only 3.1 fails, 3 survives
|      }                                      |
|    } // Child Job 3 unaffected by 3.1 failure|
+---------------------------------------------+
   |
   | Parent Job completes only when ALL (non-failing or supervised) children complete.
   | If Parent Job is cancelled, ALL children are cancelled.

3. Dispatchers: Where Coroutines Run

CoroutineDispatcher determines which thread or thread pool actually executes coroutine code. It is part of CoroutineContext.

1. Standard dispatchers

Dispatchers.Default:
- Thread pool: a JVM-shared background thread pool whose size is usually equal to the number of CPU cores, with a minimum of 2.
- Best for: CPU-intensive computation, such as sorting, parsing complex data, and image processing, where blocking IO is not involved. Do not run blocking IO here.
Dispatchers.IO:
- Thread pool: a JVM-shared background thread pool that can create more threads on demand, with a high upper limit such as 64 or more.
- Best for: blocking IO operations such as network requests, file reads and writes, and database access. IO operations spend much of their time blocking while waiting, so more threads improve concurrent throughput.
Dispatchers.Main:
- Thread: the Android application’s main thread, or UI thread.
- Best for: any operation that needs to interact with UI, such as updating Views or showing Toasts, and Android APIs that must run on the main thread.
- .immediate: Dispatchers.Main.immediate is an optimization. If the current thread is already the main thread, it tries to execute coroutine code immediately instead of posting it to the event queue first, reducing a small amount of latency. Its behavior in complex cases needs careful attention.
Dispatchers.Unconfined:
- Behavior: the coroutine starts on the current caller thread, but after the first suspension point, it resumes on the thread that resumes the coroutine, meaning the thread that executes continuation.resumeWith. The execution thread may change across suspend and resume.
- Best for: very limited situations. Normal application code usually does not need it. It may be used in some low-latency scenarios that do not care about execution thread, or inside framework/library implementations. It can easily produce confusing thread behavior and is not recommended for routine use.

2. Switching dispatchers with withContext(Dispatcher) { … }

Role: temporarily switch to a specified Dispatcher inside a coroutine, run a block of code, then automatically switch back to the original Dispatcher. It is a suspend function.
Core use: encapsulate tasks that must run on a specific thread, such as IO work or CPU computation, while keeping caller code simple. For example, a coroutine launched in viewModelScope on the main thread can use withContext(Dispatchers.IO) for network or database work.
Return value: withContext returns the result of its block.

viewModelScope.launch { // Starts on Dispatchers.Main.immediate
    val userData = fetchUserData() // Calls suspend function below
    updateUi(userData) // Back on Main thread automatically
}

suspend fun fetchUserData(): UserData {
    // Switch to IO dispatcher for network call
    return withContext(Dispatchers.IO) {
        // This block runs on an IO thread
        val response = RetrofitClient.api.getUser()
        processResponse(response) // Still on IO thread
    } // Switches back to the original caller's dispatcher (Main) after block completes
}

4. Flow in Depth: Reactive Streams for the Coroutine Era

Flow is the core tool in the Kotlin coroutines ecosystem for handling asynchronous data streams.

1. Cold stream behavior

Flow is cold by default. This means code inside a Flow builder such as flow { ... } does not start running until a terminal operator, such as collect, subscribes to it.
Each terminal operator triggers an independent Flow execution. If there are multiple collect calls, the code inside flow { ... } runs multiple times.

2. Core components

Builders: create Flow instances, such as flow { emit(T) }, flowOf(T...), List<T>.asFlow(), channelFlow, and callbackFlow.
Intermediate Operators: transform, filter, and combine Flow values, returning a new Flow without triggering execution by themselves. Examples include map, filter, transform, zip, combine, flatMapConcat, flatMapMerge, flatMapLatest, flowOn, buffer, conflate, catch, and onCompletion. These operators are usually suspend functions or inline functions.
Terminal Operators: consume Flow and trigger execution, usually as suspend functions. Examples include collect, first, single, toList, toSet, count, reduce, fold, and launchIn(scope).

3. flowOn(Dispatcher) - specify the upstream execution thread

Key role: changes the CoroutineContext, especially the Dispatcher, used to execute the Flow builder and all intermediate operators before it.
Usage: myRepository.getData().map { ... }.flowOn(Dispatchers.IO).collect { ... }. Here, getData(), assuming it is flow { ... }, and map run on Dispatchers.IO, while collect runs in the caller’s context, such as the main thread.
Compared with withContext: withContext changes the context for a small code block. flowOn affects the execution context of the entire upstream Flow.

4. Backpressure: what if the producer is faster than the consumer?

Default behavior: suspend. When collect cannot keep up, upstream emit calls suspend until collect finishes processing the current element.
Buffering with buffer(capacity):
- Adds a buffer between producer and consumer, internally using a Channel. The producer can quickly emit into the buffer until it fills, while the consumer reads data from the buffer.
- Allows producer and consumer to run concurrently, improving throughput.
- capacity: buffer size. Examples include Channel.BUFFERED, default 64; Channel.CONFLATED, keep only the latest; and Channel.RENDEZVOUS, capacity 0, similar to no buffer.
- The buffer may consume additional memory.
Conflation with conflate():
- When the producer emits a new value while the consumer is still processing an old one, all unprocessed values in the buffer are dropped, and only the latest value is kept for the consumer.
- Suitable when only the latest state matters, such as UI updates.
Processing the latest with collectLatest { action }:
- A terminal operator. When Flow emits a new value, if the action suspend function for the previous value is still running, that previous action is cancelled, and a new action starts for the new value.
- Suitable when only the latest event should be handled, such as search requests triggered by rapid user typing where only the final query should run.

Diagram: Flow backpressure strategies

Producer:  --E1---E2---E3---E4---E5--> emit()

Collector:  | Process(E1) | Process(E2) | Process(E3) | ... -> collect() (Slow)

Default (Suspend):
Producer:  --E1(sus)E2(sus)E3(sus)E4---> emit() waits for collector

buffer(1): (Producer runs ahead slightly)
Buffer:    [E2] [E3]
Producer:  --E1---E2---E3---E4-------> emit() fills buffer then suspends
Collector:  | Process(E1) | Process(E2) | Process(E3) | ... -> collect() takes from buffer

conflate(): (Only latest matters)
Producer:  --E1---E2---E3---E4---E5--> emit() continuously
Collector:  | Process(E1) | Process(E3) | Process(E5) | ... -> collect() gets latest when ready

collectLatest(): (Cancel previous processing)
Producer:  --E1---E2---E3---E4---E5--> emit()
Collector Action: | Prc(E1) | Prc(E2) | Prc(E3)-cancel| Prc(E4)-cancel| Process(E5) | ...

5. StateFlow and SharedFlow: Hot-Flow State and Event Buses

Cold flows are not suitable for every scenario. Sometimes multiple subscribers need to receive the same stream instead of re-executing it, or the stream needs to represent an observable current state.

1. StateFlow<T> - a hot flow for state representation

Characteristics:

Hot flow: has a value as long as it exists, independent of whether there are collectors.
State container: holds a single latest state value.
Initial value: must be created with an initial value.
Immediate access: a new collector immediately receives the current latest state value.
Value comparison: only emits a new value if equals() with the previous value returns false, providing distinctness.
Conflation: its internal mechanism ensures collectors receive only the latest state. Rapid intermediate states are conflated, similar to conflate().

MutableStateFlow<T>: a mutable StateFlow. It provides a value property for reads and writes, update { ... } for atomic updates, and tryEmit() for non-suspending emission attempts.

Use case: one of the best practices for exposing state from ViewModel to the UI layer. The UI layer can convert a cold flow into StateFlow with stateIn(scope, SharingStarted.WhileSubscribed(5000), initialValue), or use MutableStateFlow directly, then collect it with collectAsStateWithLifecycle.

2. SharedFlow<T> - a general-purpose hot flow / event bus

Characteristics:

Hot flow: exists independently of collectors.
Broadcast: broadcasts values to all current collectors.
Highly configurable: constructor parameters control its behavior:
- replay: an Int specifying how many recently emitted values to replay to new collectors. 0 means no replay, so new collectors receive only future values. Values greater than 0 are used to cache history.
- extraBufferCapacity: an Int specifying additional buffer space beyond the replay cache for active collectors, used for backpressure. 0 means no extra buffer.
- onBufferOverflow: a BufferOverflow strategy for new emit operations when the buffer, meaning replay plus extraBufferCapacity, is full: SUSPEND, the default; DROP_OLDEST; or DROP_LATEST.

MutableSharedFlow<T>: a mutable SharedFlow. It provides emit() for suspending emission and tryEmit() for non-suspending emission. subscriptionCount: StateFlow<Int> observes the current number of active collectors.

Use cases:

Event bus: broadcast events, such as user actions or background notifications, to multiple listeners. Pay special attention to “single-shot event” handling, discussed below.
Hot flows that need to replay historical data.
Custom StateFlow-like behavior through specific configuration.

Key considerations:

Parameter selection: replay, extraBufferCapacity, and onBufferOverflow are critical to SharedFlow behavior and must be chosen carefully for the specific requirement. Poor choices can cause event loss, memory leaks if caches are too large, or deadlocks if SUSPEND is used and no collector consumes values.
Single-shot events: if SharedFlow is used to deliver events that must be handled exactly once, such as showing a Toast or navigating, use extra care. Configuration changes may cause the UI layer to resubscribe. If replay > 0, it may receive and process an old event again. Common solutions include: (1) use Channel instead; (2) mark events as consumed downstream; (3) use an Event wrapper class.

6. Channel: The Messenger Between Coroutines

Channel is another hot stream, or communication primitive, provided by coroutines. It is similar to a thread-safe BlockingQueue, but uses suspend functions for sending and receiving.

1. Characteristics

It is hot, and data is passed between senders and receivers.
send(element): suspends the sender until the element is received by receive() or placed into the buffer.
receive(): suspends the receiver until an element is available in the Channel.
Supports different capacities and strategies: RENDEZVOUS, capacity 0, where send and receive must pair; BUFFERED, finite capacity; CONFLATED, keep only the latest; and UNLIMITED, unlimited capacity, which requires care with memory.

2. Builders

Channel<E>(capacity), produce<E>(context, capacity) { ... }, which returns ReceiveChannel<E>.

3. callbackFlow<T> { … } / channelFlow<T> { … }

Purpose: bridge callback-based APIs, or logic that actively pushes data, into cold Flow.
Mechanism: internally creates a Channel. It provides a ProducerScope, which inherits from CoroutineScope and implements SendChannel. Inside it, you can safely call send() or trySend() to emit data. You must call awaitClose { cleanup_logic }, which runs when the Flow is cancelled or closed, to unregister callbacks or clean up resources and prevent leaks.
Difference: channelFlow is a more general version of callbackFlow; callbackFlow adds optimizations for callback APIs, such as safe sending.

4. Use cases

Producer-consumer patterns between coroutines, scenarios requiring precise control over send/receive synchronization, and bridging callback APIs.

7. Advanced Exception Handling: Handle Failure Gracefully

Structured concurrency has important consequences for coroutine exception handling.

1. Basic try-catch

Using try-catch inside a launch or async block catches exceptions thrown by code inside it, including called suspend functions. This is the most direct handling approach.

2. Structured concurrency propagation

Default Job: if a coroutine that is not a direct child of a SupervisorJob throws an uncaught exception, the exception propagates upward to the parent Job, causing the parent Job and all other sibling Jobs to be cancelled. The exception may eventually crash the application.
async exceptions: if a coroutine launched by async throws an exception, the exception is held until await() is called, at which point it is thrown. If await() is never called, the exception may appear to be lost, unless the parent scope is cancelled.

3. CoroutineExceptionHandler - the final line of defense

Role: a CoroutineContext element used to handle uncaught exceptions, meaning exceptions that were not caught by try-catch and have propagated to a Scope boundary or GlobalScope.
Purpose: log and report errors. It cannot stop the coroutine or its scope from being cancelled.
Installation location: usually installed on a top-level scope, such as GlobalScope, viewModelScope, or lifecycleScope, or on a top-level launch builder. Installing it on child coroutines or intermediate non-SupervisorJob scopes usually has no effect, because the exception cancels the parent first.

4. supervisorScope and exception isolation

Inside supervisorScope { ... }, failure of a direct child coroutine does not propagate to supervisorScope itself or to sibling child coroutines. The failed child coroutine must handle the exception itself, through try-catch or a CoroutineExceptionHandler installed on its launch builder.

5. Flow exception handling

Downstream try-catch: a caller of collect can use try-catch to catch exceptions thrown by collect itself, all upstream operators, and the Flow builder.
catch { e -> emit(…) } operator:
- Only catches upstream exceptions: it only catches exceptions from its upstream, including the builder and previous operators.
- Recovery or transformation: inside catch, you can emit a fallback value, log the error, or throw a different exception.
- Does not catch downstream exceptions: the catch operator cannot catch exceptions thrown downstream, such as inside collect.
onCompletion { cause: Throwable? -> … } operator:
- The onCompletion block runs whether the Flow completes normally or terminates because of an exception.
- The cause parameter is non-null on exceptional termination and null otherwise.
- Use case: perform cleanup logic, similar to finally, regardless of success or failure.

8. Cancellation: Stopping Gracefully

Coroutine cancellation is also based on structured concurrency and cooperative mechanisms.

1. Propagation

Cancellation requests propagate downward from parent Jobs to all child Jobs. Use scope.cancel() or job.cancel().

2. Cooperative cancellation

Coroutine code must actively check cancellation state and respond to it in order to be cancelled effectively.

Built-in checkpoints: all suspending functions in the kotlinx.coroutines library, such as delay, yield, withContext, channel.receive, and Flow operators, check internally whether the current coroutine has been cancelled. If it has, they throw CancellationException.
Manual checks: for long-running CPU-intensive loops that do not contain suspending function calls, you must manually check cancellation state:
- if (!isActive) return or if (!isActive) throw CancellationException().
- ensureActive(): throws CancellationException if already cancelled.
- yield(): suspends the current coroutine and lets other coroutines run, while also checking cancellation state.
CancellationException: this is the standard signal for coroutine cancellation. It is usually considered normal control flow, and default exception handlers such as CoroutineExceptionHandler usually ignore it.

3. NonCancellable Context

Scenario: perform cleanup work in a finally block or onCompletion that must complete even if the coroutine has already been cancelled, such as releasing file handles or closing network connections.
Usage: withContext(NonCancellable) { // cleanup code }. Inside this block, the coroutine temporarily does not respond to cancellation requests. Use it carefully and avoid long-running operations.

:::danger Be very careful:

Do not catch too broadly: if you write catch (e: Exception), make sure to rethrow CancellationException, for example with throw e; otherwise the coroutine will ignore the cancellation signal and keep running, wasting resources or causing leaks.
Non-cancellable suspend functions: if a coroutine is stuck inside a suspend function that does not check cancellation state, such as some IO operations, it may not respond to cancellation promptly. :::

9. Testing Coroutines and Flow

The kotlinx-coroutines-test library provides strong testing support.

1. runTest { … }

Core test builder: replaces runBlockingTest. It provides a TestScope and runs on a TestDispatcher.
Virtual time: by default, time-related suspending functions such as delay complete immediately through virtual-time advancement, so tests do not need to wait in real time.
Dispatcher control: you can inject and control TestDispatchers, such as StandardTestDispatcher and UnconfinedTestDispatcher, to manage coroutine execution order and time.

2. TestCoroutineScheduler

Provides more precise virtual-time control, such as advanceTimeBy and runCurrent.

3. Testing Flow

Collect Flow directly inside runTest.
Use the third-party library Turbine, app.cash.turbine:turbine, for a simpler and more powerful Flow testing API, such as flow.test { awaitItem(), expectNoEvents(), awaitComplete(), ... }.

4. Dependency injection

Best practice is to inject CoroutineDispatcher into classes such as ViewModel and Repository, rather than using Dispatchers.IO directly, so tests can easily replace it with a TestDispatcher.

10. Coroutines vs. RxJava vs. Threads

Coroutines vs. Threads: coroutines are lighter, easier to manage, avoid leaks through structured concurrency, and produce simpler code.
Coroutines vs. RxJava:
- Similarities: both handle asynchronous data streams.
- Differences:
  - Paradigm: coroutines are based on suspend functions and code closer to synchronous style. RxJava is based on the observer pattern and chained operators.
  - Simplicity: coroutines usually require less boilerplate.
  - Structured concurrency: coroutines have stronger built-in support.
  - Operators: RxJava has an extremely rich operator ecosystem. Flow’s operators are also continuing to improve.
  - Learning curve: coroutines are generally considered easier to learn.
- Interoperability: the kotlinx-coroutines-rx3 or rx2 libraries provide conversion methods between the two, such as Flow.asObservable() and Observable.asFlow().

For new projects, especially Kotlin-based projects, Coroutines plus Flow are usually the preferred choice. For projects that already contain a large amount of RxJava code, gradual migration or mixed use is reasonable.

11. Common Pitfalls and Best Practices

Blocking the wrong dispatcher: running blocking IO on Dispatchers.Default, or long CPU-intensive computation on Dispatchers.IO.
Overusing GlobalScope: creates potential leaks and makes testing difficult. Prefer viewModelScope, lifecycleScope, or a custom Scope.
Forgetting cancellation checks: not checking isActive in long loops without suspending function calls.
Incorrect exception handling: assuming CoroutineExceptionHandler can prevent cancellation; failing to catch async exceptions through await; leaving child coroutine exceptions unhandled under supervisorScope.
Incorrect SharedFlow configuration: misunderstanding replay, buffer, or overflow behavior, causing event loss or behavior that does not match expectations, especially for single-shot events.
Forgetting awaitClose in callbackFlow/channelFlow: causes callback or listener leaks.
Overusing Dispatchers.Unconfined: makes thread behavior difficult to predict.
Hardcoding Dispatchers: hurts testability. Inject Dispatchers through DI.
Collecting Flow in the UI layer without collectAsStateWithLifecycle: may cause unnecessary background resource consumption.
Choosing StateFlow and SharedFlow poorly: using StateFlow for event streams that require guaranteed delivery, which may lose events, or using SharedFlow for state where only the latest value matters, which is less efficient and may replay old values.

12. Conclusion: A Modern Tool for Concurrent Programming

Kotlin Coroutines and Flow provide Android developers with a powerful, modern, and elegant async programming solution. Through lightweight coroutines, structured concurrency, flexible dispatchers, and feature-rich reactive streams with Flow, they greatly simplify the writing, management, and testing of concurrent code and address many pain points of traditional approaches.

Mastering Coroutines and Flow means more than being comfortable with the APIs. It means deeply understanding the nature of suspension and resumption, state-machine transformation, structured-concurrency lifecycle and exception propagation, appropriate dispatcher use and performance impact, Flow’s cold and hot stream model and backpressure strategies, the subtle configuration and use cases of StateFlow and SharedFlow, and the many details of coroutine cancellation and exception handling.

With these advanced principles and practices, you can confidently build Android applications with high performance, high concurrency, and high stability even as business logic and user experience requirements grow more complex. You can reduce async complexity to a manageable model. This is a core skill for modern senior Android engineers.