feat: Implement Nth-frame detection with tracking for performance

Optimizes webcam performance for face swapping by introducing Nth-frame full face detection and using a KCF tracker for intermediate frames in modules/processors/frame/face_swapper.py. Key changes: - Full face analysis (get_one_face) now runs every N frames (default 3) or when tracking is lost in the process_frame function (for single face mode). - For intermediate frames, a KCF tracker updates the target face bounding box, and keypoints are estimated by translating the last known good keypoints. - The actual face swap (inswapper model) still runs on every frame if a face (either detected or tracked) is available. - Experimental tracking logic added to _process_live_target_v2 for map_faces=True in live mode (non-many_faces path). - Added robustness: - None checks for landmarks in mouth_mask and create_face_mask functions, with fallbacks for create_face_mask. - Division-by-zero check in apply_color_transfer. - Reset tracker state in process_video for new video files. This aims to significantly improve FPS by reducing the frequency of costly full face analysis, while still providing a continuous swap. Mouth masking will be less effective on tracked intermediate frames due to the absence of full landmark data.
fix: Revert Nth frame logic in ui.py to fix UnboundLocalError
2025-06-18 14:25:56 +00:00 · 2025-06-18 11:20:32 +00:00 · 2025-06-18 10:35:13 +00:00 · 2025-06-18 09:54:10 +00:00 · 2025-06-18 09:03:07 +00:00
2 changed files with 493 additions and 261 deletions
--- a/modules/processors/frame/face_swapper.py
+++ b/modules/processors/frame/face_swapper.py
@ -8,7 +8,7 @@ import logging
 import modules.processors.frame.core
 from modules.core import update_status
 from modules.face_analyser import get_one_face, get_many_faces, default_source_face
-from modules.typing import Face, Frame
+from modules.typing import Face, Frame # Face is insightface.app.common.Face
 from modules.hair_segmenter import segment_hair
 from modules.utilities import (
    conditional_download,
@ -27,6 +27,15 @@ models_dir = os.path.join(
    os.path.dirname(os.path.dirname(os.path.dirname(abs_dir))), "models"
 )
 # --- Tracker State Variables ---
 TARGET_TRACKER: Optional[cv2.Tracker] = None
 LAST_TARGET_KPS: Optional[np.ndarray] = None
 LAST_TARGET_BBOX_XYWH: Optional[List[int]] = None # Stored as [x, y, w, h]
 TRACKING_FRAME_COUNTER = 0
 DETECTION_INTERVAL = 3  # Process every 3rd frame for full detection
 LAST_DETECTION_SUCCESS = False
 # --- End Tracker State Variables ---
 def pre_check() -> bool:
    download_directory_path = abs_dir
@ -72,14 +81,13 @@ def _prepare_warped_source_material_and_mask(
    source_face_obj: Face,
    source_frame_full: Frame,
    matrix: np.ndarray,
-    dsize: tuple # Built-in tuple is fine here for parameter type
+    dsize: tuple
 ) -> Tuple[Optional[Frame], Optional[Frame]]:
    """
    Prepares warped source material (full image) and a combined (face+hair) mask for blending.
    Returns (None, None) if essential masks cannot be generated.
    """
    try:
        # Generate Hair Mask
        hair_only_mask_source_raw = segment_hair(source_frame_full)
        if hair_only_mask_source_raw is None:
            logging.error("segment_hair returned None, which is unexpected.")
@ -92,7 +100,6 @@ def _prepare_warped_source_material_and_mask(
        return None, None
    try:
        # Generate Face Mask
        face_only_mask_source_raw = create_face_mask(source_face_obj, source_frame_full)
        if face_only_mask_source_raw is None:
            logging.error("create_face_mask returned None, which is unexpected.")
@ -102,7 +109,6 @@ def _prepare_warped_source_material_and_mask(
        logging.error(f"Face mask creation failed for source: {e}", exc_info=True)
        return None, None
    # Combine Face and Hair Masks and Warp
    try:
        if face_only_mask_source_binary.shape != hair_only_mask_source_binary.shape:
            logging.warning("Resizing hair mask to match face mask for source during preparation.")
@ -134,7 +140,7 @@ def _blend_material_onto_frame(
    Uses seamlessClone if possible, otherwise falls back to simple masking.
    """
    x, y, w, h = cv2.boundingRect(mask_for_blending)
-    output_frame = base_frame # Start with base, will be modified by blending
+    output_frame = base_frame
    if w > 0 and h > 0:
        center = (x + w // 2, y + h // 2)
@ -161,11 +167,10 @@ def _blend_material_onto_frame(
 def swap_face(source_face_obj: Face, target_face: Face, source_frame_full: Frame, temp_frame: Frame) -> Frame:
    face_swapper = get_face_swapper()
    # Apply the base face swap
    swapped_frame = face_swapper.get(temp_frame, target_face, source_face_obj, paste_back=True)
-    final_swapped_frame = swapped_frame # Initialize with the base swap. Copy is made only if needed.
+    final_swapped_frame = swapped_frame
-    if getattr(modules.globals, 'enable_hair_swapping', True): # Default to True if attribute is missing
+    if getattr(modules.globals, 'enable_hair_swapping', True):
        if not (source_face_obj.kps is not None and \
                target_face.kps is not None and \
                source_face_obj.kps.shape[0] >= 3 and \
@ -183,21 +188,20 @@ def swap_face(source_face_obj: Face, target_face: Face, source_frame_full: Frame
            if matrix is None:
                logging.warning("Failed to estimate affine transformation matrix for hair. Skipping hair blending.")
            else:
-                dsize = (temp_frame.shape[1], temp_frame.shape[0]) # width, height
+                dsize = (temp_frame.shape[1], temp_frame.shape[0])
                warped_material, warped_mask = _prepare_warped_source_material_and_mask(
                    source_face_obj, source_frame_full, matrix, dsize
                )
                if warped_material is not None and warped_mask is not None:
                    # Make a copy only now that we are sure we will modify it for hair.
                    final_swapped_frame = swapped_frame.copy()
                    try:
                        color_corrected_material = apply_color_transfer(warped_material, final_swapped_frame)
                    except Exception as e:
                        logging.warning(f"Color transfer failed: {e}. Proceeding with uncorrected material for hair blending.", exc_info=True)
-                        color_corrected_material = warped_material # Use uncorrected material as fallback
+                        color_corrected_material = warped_material
                    final_swapped_frame = _blend_material_onto_frame(
                        final_swapped_frame,
@ -205,24 +209,19 @@ def swap_face(source_face_obj: Face, target_face: Face, source_frame_full: Frame
                        warped_mask
                    )
    # Mouth Mask Logic (operates on final_swapped_frame)
    if modules.globals.mouth_mask:
-        # If final_swapped_frame wasn't copied for hair, it needs to be copied now before mouth mask modification.
+        if final_swapped_frame is swapped_frame:
        if final_swapped_frame is swapped_frame: # Check if it's still the same object
            final_swapped_frame = swapped_frame.copy()
-        # Create a mask for the target face
+        face_mask_for_mouth = create_face_mask(target_face, temp_frame) # Use original temp_frame for target mask context
        face_mask = create_face_mask(target_face, temp_frame)
        # Create the mouth mask
        mouth_mask, mouth_cutout, mouth_box, lower_lip_polygon = (
-            create_lower_mouth_mask(target_face, temp_frame)
+            create_lower_mouth_mask(target_face, temp_frame) # Use original temp_frame for target mouth context
        )
-        # Apply the mouth area
+        # Ensure apply_mouth_area gets the most up-to-date final_swapped_frame if hair blending happened
        # Apply to final_swapped_frame if hair blending happened, otherwise to swapped_frame
        final_swapped_frame = apply_mouth_area(
-            final_swapped_frame, mouth_cutout, mouth_box, face_mask, lower_lip_polygon
+            final_swapped_frame, mouth_cutout, mouth_box, face_mask_for_mouth, lower_lip_polygon
        )
        if modules.globals.show_mouth_mask_box:
@ -235,23 +234,111 @@ def swap_face(source_face_obj: Face, target_face: Face, source_frame_full: Frame
 def process_frame(source_face_obj: Face, source_frame_full: Frame, temp_frame: Frame) -> Frame:
    global TARGET_TRACKER, LAST_TARGET_KPS, LAST_TARGET_BBOX_XYWH
    global TRACKING_FRAME_COUNTER, DETECTION_INTERVAL, LAST_DETECTION_SUCCESS
    if modules.globals.color_correction:
        temp_frame = cv2.cvtColor(temp_frame, cv2.COLOR_BGR2RGB)
    if modules.globals.many_faces:
-        many_faces = get_many_faces(temp_frame)
+        # Tracking logic is not applied for many_faces mode in this iteration
-        if many_faces:
+        many_faces_detected = get_many_faces(temp_frame)
-            for target_face in many_faces:
+        if many_faces_detected:
-                if source_face_obj and target_face:
+            for target_face_data in many_faces_detected:
-                    temp_frame = swap_face(source_face_obj, target_face, source_frame_full, temp_frame)
+                if source_face_obj and target_face_data:
                    temp_frame = swap_face(source_face_obj, target_face_data, source_frame_full, temp_frame)
                else:
-                    print("Face detection failed for target/source.")
+                    # This print might be too verbose for many_faces mode
-    else:
+                    # logging.debug("Face detection failed for a target/source in many_faces.")
-        target_face = get_one_face(temp_frame)
+                    pass # Optionally log or handle
-        if target_face and source_face_obj:
+        return temp_frame # Return early after processing all faces or if none found
-            temp_frame = swap_face(source_face_obj, target_face, source_frame_full, temp_frame)
+
    # --- Single Face Mode with Tracking ---
    TRACKING_FRAME_COUNTER += 1
    target_face_to_swap = None
    if TRACKING_FRAME_COUNTER % DETECTION_INTERVAL == 0 or not LAST_DETECTION_SUCCESS:
        logging.debug(f"Frame {TRACKING_FRAME_COUNTER}: Running full detection.")
        actual_target_face_data = get_one_face(temp_frame)
        if actual_target_face_data:
            target_face_to_swap = actual_target_face_data
            LAST_TARGET_KPS = actual_target_face_data.kps.copy() if actual_target_face_data.kps is not None else None
            bbox_xyxy = actual_target_face_data.bbox
            LAST_TARGET_BBOX_XYWH = [int(bbox_xyxy[0]), int(bbox_xyxy[1]), int(bbox_xyxy[2] - bbox_xyxy[0]), int(bbox_xyxy[3] - bbox_xyxy[1])]
            try:
                TARGET_TRACKER = cv2.TrackerKCF_create()
                TARGET_TRACKER.init(temp_frame, tuple(LAST_TARGET_BBOX_XYWH))
                LAST_DETECTION_SUCCESS = True
                logging.debug(f"Frame {TRACKING_FRAME_COUNTER}: Detection SUCCESS, tracker initialized.")
            except Exception as e:
                logging.error(f"Failed to initialize tracker: {e}", exc_info=True)
                TARGET_TRACKER = None
                LAST_DETECTION_SUCCESS = False
        else:
-            logging.error("Face detection failed for target or source.")
+            logging.debug(f"Frame {TRACKING_FRAME_COUNTER}: Full detection FAILED.")
            LAST_DETECTION_SUCCESS = False
            TARGET_TRACKER = None
    else: # Intermediate frame, try to track
        if TARGET_TRACKER is not None:
            logging.debug(f"Frame {TRACKING_FRAME_COUNTER}: Attempting track.")
            success, new_bbox_xywh_float = TARGET_TRACKER.update(temp_frame)
            if success:
                logging.debug(f"Frame {TRACKING_FRAME_COUNTER}: Tracking SUCCESS.")
                new_bbox_xywh = [int(v) for v in new_bbox_xywh_float]
                if LAST_TARGET_KPS is not None and LAST_TARGET_BBOX_XYWH is not None:
                    # Estimate KPS based on bbox center shift
                    old_bbox_center_x = LAST_TARGET_BBOX_XYWH[0] + LAST_TARGET_BBOX_XYWH[2] / 2
                    old_bbox_center_y = LAST_TARGET_BBOX_XYWH[1] + LAST_TARGET_BBOX_XYWH[3] / 2
                    new_bbox_center_x = new_bbox_xywh[0] + new_bbox_xywh[2] / 2
                    new_bbox_center_y = new_bbox_xywh[1] + new_bbox_xywh[3] / 2
                    delta_x = new_bbox_center_x - old_bbox_center_x
                    delta_y = new_bbox_center_y - old_bbox_center_y
                    current_kps = LAST_TARGET_KPS + np.array([delta_x, delta_y])
                else: # Fallback if prior KPS/BBox not available
                    current_kps = None
                new_bbox_xyxy = np.array([
                    new_bbox_xywh[0],
                    new_bbox_xywh[1],
                    new_bbox_xywh[0] + new_bbox_xywh[2],
                    new_bbox_xywh[1] + new_bbox_xywh[3]
                ])
                # Construct a Face object or a compatible dictionary
                # For insightface.app.common.Face, it requires specific fields.
                # A dictionary might be safer if not all fields can be reliably populated.
                target_face_to_swap = Face(
                    bbox=new_bbox_xyxy,
                    kps=current_kps,
                    det_score=0.95, # Using a high score for tracked faces
                    landmark_3d_68=None, # Not available from KCF tracker
                    landmark_2d_106=None, # Not available from KCF tracker, mouth mask might be affected
                    gender=None, # Not available
                    age=None, # Not available
                    embedding=None, # Not available
                    normed_embedding=None # Not available
                )
                LAST_TARGET_BBOX_XYWH = new_bbox_xywh # Update for next frame's delta calculation
                LAST_TARGET_KPS = current_kps # Update KPS for next frame's delta calculation
                LAST_DETECTION_SUCCESS = True # Tracking was successful
            else:
                logging.debug(f"Frame {TRACKING_FRAME_COUNTER}: Tracking FAILED.")
                LAST_DETECTION_SUCCESS = False
                TARGET_TRACKER = None # Reset tracker
        else:
            logging.debug(f"Frame {TRACKING_FRAME_COUNTER}: No active tracker, skipping track.")
    if target_face_to_swap and source_face_obj:
        temp_frame = swap_face(source_face_obj, target_face_to_swap, source_frame_full, temp_frame)
    else:
        if TRACKING_FRAME_COUNTER % DETECTION_INTERVAL == 0: # Only log error if it was a detection frame
            logging.info("Target face not found by detection or tracking in process_frame.")
            # No error log here as it might just be no face in frame.
            # The swap_face call will be skipped, returning the original temp_frame.
    return temp_frame
@ -290,45 +377,130 @@ def _process_video_target_v2(source_frame_full: Frame, temp_frame: Frame, temp_f
    return temp_frame
 def _process_live_target_v2(source_frame_full: Frame, temp_frame: Frame) -> Frame:
-    detected_faces = get_many_faces(temp_frame)
+    # This function is called by UI directly for webcam when map_faces is True.
-    if not detected_faces:
+    # The Nth frame/tracking logic for webcam should ideally be here or called from here.
    # For now, it reuses the global tracker state, which might be an issue if multiple
    # call paths use process_frame_v2 concurrently.
    # However, with webcam, process_frame (single face) or this (map_faces) is called.
    # Assuming single-threaded UI updates for webcam for now.
    global TARGET_TRACKER, LAST_TARGET_KPS, LAST_TARGET_BBOX_XYWH
    global TRACKING_FRAME_COUNTER, DETECTION_INTERVAL, LAST_DETECTION_SUCCESS
    if not modules.globals.many_faces: # Tracking only implemented for single target face in live mode
        TRACKING_FRAME_COUNTER += 1 # Use the same counter for now
        target_face_to_swap = None
        if TRACKING_FRAME_COUNTER % DETECTION_INTERVAL == 0 or not LAST_DETECTION_SUCCESS:
            logging.debug(f"Frame {TRACKING_FRAME_COUNTER} (Live V2): Running full detection.")
            # In map_faces mode for live, we might need to select one target based on some criteria
            # or apply to all detected faces if a simple_map isn't specific enough.
            # This part needs careful thought for map_faces=True live mode.
            # For now, let's assume simple_map implies one primary target for tracking.
            detected_faces = get_many_faces(temp_frame) # Get all faces first
            # If simple_map is configured, try to find the "main" target face from simple_map
            actual_target_face_data = None
            if detected_faces and modules.globals.simple_map and modules.globals.simple_map.get("target_embeddings"):
                # This logic tries to find one specific face to track based on simple_map.
                # It might not be ideal if multiple mapped faces are expected to be swapped.
                # For simplicity, we'll track the first match or a dominant face.
                # This part is a placeholder for a more robust target selection in map_faces live mode.
                # For now, let's try to find one based on the first simple_map embedding.
                if modules.globals.simple_map["target_embeddings"]:
                    closest_idx, _ = find_closest_centroid([face.normed_embedding for face in detected_faces], modules.globals.simple_map["target_embeddings"][0])
                    if closest_idx < len(detected_faces):
                         actual_target_face_data = detected_faces[closest_idx]
            elif detected_faces: # Fallback if no simple_map or if logic above fails
                actual_target_face_data = detected_faces[0] # Default to the first detected face
            if actual_target_face_data:
                target_face_to_swap = actual_target_face_data
                LAST_TARGET_KPS = actual_target_face_data.kps.copy() if actual_target_face_data.kps is not None else None
                bbox_xyxy = actual_target_face_data.bbox
                LAST_TARGET_BBOX_XYWH = [int(bbox_xyxy[0]), int(bbox_xyxy[1]), int(bbox_xyxy[2] - bbox_xyxy[0]), int(bbox_xyxy[3] - bbox_xyxy[1])]
                try:
                    TARGET_TRACKER = cv2.TrackerKCF_create()
                    TARGET_TRACKER.init(temp_frame, tuple(LAST_TARGET_BBOX_XYWH))
                    LAST_DETECTION_SUCCESS = True
                    logging.debug(f"Frame {TRACKING_FRAME_COUNTER} (Live V2): Detection SUCCESS, tracker initialized.")
                except Exception as e:
                    logging.error(f"Failed to initialize tracker (Live V2): {e}", exc_info=True)
                    TARGET_TRACKER = None
                    LAST_DETECTION_SUCCESS = False
            else:
                logging.debug(f"Frame {TRACKING_FRAME_COUNTER} (Live V2): Full detection FAILED.")
                LAST_DETECTION_SUCCESS = False
                TARGET_TRACKER = None
        else: # Intermediate frame, try to track
            if TARGET_TRACKER is not None:
                logging.debug(f"Frame {TRACKING_FRAME_COUNTER} (Live V2): Attempting track.")
                success, new_bbox_xywh_float = TARGET_TRACKER.update(temp_frame)
                if success:
                    logging.debug(f"Frame {TRACKING_FRAME_COUNTER} (Live V2): Tracking SUCCESS.")
                    new_bbox_xywh = [int(v) for v in new_bbox_xywh_float]
                    current_kps = None
                    if LAST_TARGET_KPS is not None and LAST_TARGET_BBOX_XYWH is not None:
                        old_bbox_center_x = LAST_TARGET_BBOX_XYWH[0] + LAST_TARGET_BBOX_XYWH[2] / 2
                        old_bbox_center_y = LAST_TARGET_BBOX_XYWH[1] + LAST_TARGET_BBOX_XYWH[3] / 2
                        new_bbox_center_x = new_bbox_xywh[0] + new_bbox_xywh[2] / 2
                        new_bbox_center_y = new_bbox_xywh[1] + new_bbox_xywh[3] / 2
                        delta_x = new_bbox_center_x - old_bbox_center_x
                        delta_y = new_bbox_center_y - old_bbox_center_y
                        current_kps = LAST_TARGET_KPS + np.array([delta_x, delta_y])
                    new_bbox_xyxy = np.array([new_bbox_xywh[0], new_bbox_xywh[1], new_bbox_xywh[0] + new_bbox_xywh[2], new_bbox_xywh[1] + new_bbox_xywh[3]])
                    target_face_to_swap = Face(bbox=new_bbox_xyxy, kps=current_kps, det_score=0.95, landmark_3d_68=None, landmark_2d_106=None, gender=None, age=None, embedding=None, normed_embedding=None)
                    LAST_TARGET_BBOX_XYWH = new_bbox_xywh
                    LAST_TARGET_KPS = current_kps
                    LAST_DETECTION_SUCCESS = True
                else:
                    logging.debug(f"Frame {TRACKING_FRAME_COUNTER} (Live V2): Tracking FAILED.")
                    LAST_DETECTION_SUCCESS = False
                    TARGET_TRACKER = None
            else:
                logging.debug(f"Frame {TRACKING_FRAME_COUNTER} (Live V2): No active tracker, skipping track.")
        # Perform swap for the identified or tracked face
        if target_face_to_swap:
             # In map_faces=True, need to determine which source face to use.
             # This part of _process_live_target_v2 needs to align with how simple_map or source_target_map is used.
             # The current logic for simple_map (else branch below) is more complete for this.
             # For now, if a target_face_to_swap is found by tracking, we need a source.
             # This indicates a simplification: if we track one face, we use the default source or first simple_map source.
            source_face_obj_to_use = default_source_face() # Fallback, might not be the right one for simple_map
            if modules.globals.simple_map and modules.globals.simple_map.get("source_faces"):
                # This assumes the tracked face corresponds to the first entry in simple_map, which is a simplification.
                source_face_obj_to_use = modules.globals.simple_map["source_faces"][0]
            if source_face_obj_to_use:
                 temp_frame = swap_face(source_face_obj_to_use, target_face_to_swap, source_frame_full, temp_frame)
            else:
                logging.warning("No source face available for tracked target in _process_live_target_v2.")
        elif TRACKING_FRAME_COUNTER % DETECTION_INTERVAL == 0:
             logging.info("Target face not found by detection or tracking in _process_live_target_v2 (single face tracking path).")
        return temp_frame
-    if modules.globals.many_faces:
+    # Fallback to original many_faces logic if not in single face tracking mode (or if above logic doesn't return)
    # This part is essentially the original _process_live_target_v2 for many_faces=True
    detected_faces = get_many_faces(temp_frame) # Re-get if not already gotten or if many_faces path
    if not detected_faces:
        return temp_frame # No faces, return original
    if modules.globals.many_faces: # This is the original many_faces logic for live
        source_face_obj = default_source_face()
        if source_face_obj:
            for target_face in detected_faces:
                temp_frame = swap_face(source_face_obj, target_face, source_frame_full, temp_frame)
-    else: # not many_faces (apply simple_map logic)
+    # The complex simple_map logic for non-many_faces was attempted above with tracking.
-        if not modules.globals.simple_map or \
+    # If that path wasn't taken or didn't result in a swap, and it's not many_faces,
-           not modules.globals.simple_map.get("target_embeddings") or \
+    # we might need to re-evaluate the original simple_map logic here.
-           not modules.globals.simple_map.get("source_faces"):
+    # For now, the tracking path for single face handles the non-many_faces case.
-            logging.warning("Simple map is not configured correctly. Skipping face swap.")
+    # If tracking is off or fails consistently, this function will effectively just return temp_frame for non-many_faces.
-            return temp_frame
+    # This else block for simple_map from original _process_live_target_v2 might be needed if tracking is disabled.
-
+    # However, to avoid processing faces twice (once for tracking attempt, once here), this is tricky.
-        target_embeddings = modules.globals.simple_map["target_embeddings"]
+    # For now, the subtask focuses on adding tracking to process_frame, which is used by webcam in non-map_faces mode.
-        source_faces_from_map = modules.globals.simple_map["source_faces"]
+    # The changes to _process_live_target_v2 are more experimental for map_faces=True live mode.
        if len(detected_faces) <= len(target_embeddings):
            for detected_face in detected_faces:
                closest_centroid_index, _ = find_closest_centroid(target_embeddings, detected_face.normed_embedding)
                if closest_centroid_index < len(source_faces_from_map):
                    source_face_obj_from_map = source_faces_from_map[closest_centroid_index]
                    temp_frame = swap_face(source_face_obj_from_map, detected_face, source_frame_full, temp_frame)
                else:
                    logging.warning(f"Centroid index {closest_centroid_index} out of bounds for source_faces_from_map.")
        else: # More detected faces than target embeddings in simple_map
            detected_faces_embeddings = [face.normed_embedding for face in detected_faces]
            for i, target_embedding in enumerate(target_embeddings):
                if i < len(source_faces_from_map):
                    closest_detected_face_index, _ = find_closest_centroid(detected_faces_embeddings, target_embedding)
                    source_face_obj_from_map = source_faces_from_map[i]
                    target_face_to_swap = detected_faces[closest_detected_face_index]
                    temp_frame = swap_face(source_face_obj_from_map, target_face_to_swap, source_frame_full, temp_frame)
                    # Optionally, remove the swapped detected face to prevent re-swapping if one source maps to multiple targets.
                    # This depends on desired behavior. For now, simple independent mapping.
                else:
                    logging.warning(f"Index {i} out of bounds for source_faces_from_map in simple_map else case.")
    return temp_frame
@ -338,6 +510,10 @@ def process_frame_v2(source_frame_full: Frame, temp_frame: Frame, temp_frame_pat
    elif is_video(modules.globals.target_path):
        return _process_video_target_v2(source_frame_full, temp_frame, temp_frame_path)
    else: # This is the live cam / generic case
        # If map_faces is True for webcam, this is called.
        # We need to decide if tracking applies here or if it's simpler to use existing logic.
        # The subtask's main focus was process_frame.
        # For now, let _process_live_target_v2 handle it, which includes an attempt at tracking for non-many_faces.
        return _process_live_target_v2(source_frame_full, temp_frame)
@ -350,7 +526,7 @@ def process_frames(
        return
    if not modules.globals.map_faces:
-        source_face_obj = get_one_face(source_img) # Use source_img here
+        source_face_obj = get_one_face(source_img)
        if not source_face_obj:
            logging.error(f"No face detected in source image {source_path}")
            return
@ -360,25 +536,21 @@ def process_frames(
                logging.warning(f"Failed to read temp_frame from {temp_frame_path}, skipping.")
                continue
            try:
-                result = process_frame(source_face_obj, source_img, temp_frame)
+                result = process_frame(source_face_obj, source_img, temp_frame) # process_frame will use tracking
                cv2.imwrite(temp_frame_path, result)
            except Exception as exception:
                logging.error(f"Error processing frame {temp_frame_path}: {exception}", exc_info=True)
                pass
            if progress:
                progress.update(1)
-    else: # This is for map_faces == True
+    else:
        # In map_faces=True, source_face is determined per mapping.
        # process_frame_v2 will need source_frame_full for hair,
        # which should be the original source_path image.
        for temp_frame_path in temp_frame_paths:
            temp_frame = cv2.imread(temp_frame_path)
            if temp_frame is None:
                logging.warning(f"Failed to read temp_frame from {temp_frame_path}, skipping.")
                continue
            try:
-                # Pass source_img (as source_frame_full) to process_frame_v2
+                result = process_frame_v2(source_img, temp_frame, temp_frame_path) # process_frame_v2 might use tracking via _process_live_target_v2
                result = process_frame_v2(source_img, temp_frame, temp_frame_path)
                cv2.imwrite(temp_frame_path, result)
            except Exception as exception:
                logging.error(f"Error processing frame {temp_frame_path} with map_faces: {exception}", exc_info=True)
@ -393,33 +565,31 @@ def process_image(source_path: str, target_path: str, output_path: str) -> None:
        logging.error(f"Failed to read source image from {source_path}")
        return
-    target_frame = cv2.imread(target_path)
+    # target_frame = cv2.imread(target_path) # This line is not needed as original_target_frame is used
-    if target_frame is None:
+    # if target_frame is None:
-        logging.error(f"Failed to read target image from {target_path}")
+    #     logging.error(f"Failed to read target image from {target_path}")
-        return
+    #     return
    # Read the original target frame once at the beginning
    original_target_frame = cv2.imread(target_path)
    if original_target_frame is None:
        logging.error(f"Failed to read original target image from {target_path}")
        return
-    result = None  # Initialize result
+    result = None
    if not modules.globals.map_faces:
-        source_face_obj = get_one_face(source_img) # Use source_img here
+        source_face_obj = get_one_face(source_img)
        if not source_face_obj:
            logging.error(f"No face detected in source image {source_path}")
            return
        # process_frame will use tracking if called in a context where TRACKING_FRAME_COUNTER changes (e.g. video/live)
        # For single image, TRACKING_FRAME_COUNTER would be 1, so full detection.
        result = process_frame(source_face_obj, source_img, original_target_frame)
-    else: # map_faces is True
+    else:
        if modules.globals.many_faces:
            update_status(
                "Many faces enabled. Using first source image. Progressing...", NAME
            )
        # process_frame_v2 takes the original target frame for processing.
        # target_path is passed as temp_frame_path for consistency with process_frame_v2's signature,
        # used for map lookups in video context but less critical for single images.
        result = process_frame_v2(source_img, original_target_frame, target_path)
    if result is not None:
@ -429,6 +599,14 @@ def process_image(source_path: str, target_path: str, output_path: str) -> None:
 def process_video(source_path: str, temp_frame_paths: List[str]) -> None:
    global TRACKING_FRAME_COUNTER, LAST_DETECTION_SUCCESS, TARGET_TRACKER, LAST_TARGET_KPS, LAST_TARGET_BBOX_XYWH
    # Reset tracker state for each new video
    TRACKING_FRAME_COUNTER = 0
    LAST_DETECTION_SUCCESS = False
    TARGET_TRACKER = None
    LAST_TARGET_KPS = None
    LAST_TARGET_BBOX_XYWH = None
    if modules.globals.map_faces and modules.globals.many_faces:
        update_status(
            "Many faces enabled. Using first source image. Progressing...", NAME
@ -443,8 +621,22 @@ def create_lower_mouth_mask(
 ) -> (np.ndarray, np.ndarray, tuple, np.ndarray):
    mask = np.zeros(frame.shape[:2], dtype=np.uint8)
    mouth_cutout = None
-    landmarks = face.landmark_2d_106
+    # Mouth mask requires landmark_2d_106, which tracked faces won't have.
-    if landmarks is not None:
+    # Add a check here to prevent errors if landmark_2d_106 is None.
    if face.landmark_2d_106 is None:
        logging.debug("Skipping lower_mouth_mask due to missing landmark_2d_106 (likely a tracked face).")
        # Return empty/default values that won't cause downstream errors
        # The bounding box (min_x, etc.) might still be useful if derived from face.bbox
        # For now, return fully empty to prevent partial processing.
        # The caller (apply_mouth_area) should also be robust to this.
        # Fallback: create a simple mask from bbox if needed, or ensure apply_mouth_area handles this.
        # For now, returning all Nones for the mask parts.
        # The tuple for bbox still needs 4 values, even if invalid, to unpack.
        # A truly robust solution would be for apply_mouth_area to not proceed if mouth_mask is None.
        return mask, None, (0,0,0,0), None # Ensure tuple has 4 values
    landmarks = face.landmark_2d_106 # Now we know it's not None
    # ... (rest of the function remains the same)
        #                  0  1  2  3  4  5  6  7  8  9  10 11 12 13 14 15 16 17 18 19 20
        lower_lip_order = [
            65,
@ -558,83 +750,83 @@ def create_lower_mouth_mask(
 def draw_mouth_mask_visualization(
    frame: Frame, face: Face, mouth_mask_data: tuple
 ) -> Frame:
    # Add check for landmarks before trying to use them
    if face.landmark_2d_106 is None or mouth_mask_data is None or mouth_mask_data[1] is None: # mouth_cutout is mouth_mask_data[1]
        logging.debug("Skipping mouth mask visualization due to missing landmarks or data.")
        return frame
    landmarks = face.landmark_2d_106
-    if landmarks is not None and mouth_mask_data is not None:
+    # if landmarks is not None and mouth_mask_data is not None: # This check is now partially done above
-        mask, mouth_cutout, (min_x, min_y, max_x, max_y), lower_lip_polygon = (
+    mask, mouth_cutout, (min_x, min_y, max_x, max_y), lower_lip_polygon = (
-            mouth_mask_data
+        mouth_mask_data
-        )
+    )
    if mouth_cutout is None or lower_lip_polygon is None: # Further check
        logging.debug("Skipping mouth mask visualization due to missing mouth_cutout or polygon.")
        return frame
        vis_frame = frame.copy()
-        # Ensure coordinates are within frame bounds
+    vis_frame = frame.copy()
        height, width = vis_frame.shape[:2]
        min_x, min_y = max(0, min_x), max(0, min_y)
        max_x, max_y = min(width, max_x), min(height, max_y)
-        # Adjust mask to match the region size
+    # Ensure coordinates are within frame bounds
-        mask_region = mask[0 : max_y - min_y, 0 : max_x - min_x]
+    height, width = vis_frame.shape[:2]
    min_x, min_y = max(0, min_x), max(0, min_y)
    max_x, max_y = min(width, max_x), min(height, max_y)
-        # Remove the color mask overlay
+    # Adjust mask to match the region size
-        # color_mask = cv2.applyColorMap((mask_region * 255).astype(np.uint8), cv2.COLORMAP_JET)
+    # Ensure mask_region calculation is safe
    if max_y - min_y <= 0 or max_x - min_x <= 0:
        logging.warning("Invalid ROI for mouth mask visualization.")
        return frame # or vis_frame, as it's a copy
    mask_region = mask[0 : max_y - min_y, 0 : max_x - min_x]
        # Ensure shapes match before blending
        vis_region = vis_frame[min_y:max_y, min_x:max_x]
        # Remove blending with color_mask
        # if vis_region.shape[:2] == color_mask.shape[:2]:
        #     blended = cv2.addWeighted(vis_region, 0.7, color_mask, 0.3, 0)
        #     vis_frame[min_y:max_y, min_x:max_x] = blended
-        # Draw the lower lip polygon
+    cv2.polylines(vis_frame, [lower_lip_polygon], True, (0, 255, 0), 2)
        cv2.polylines(vis_frame, [lower_lip_polygon], True, (0, 255, 0), 2)
-        # Remove the red box
+    feather_amount = max(
-        # cv2.rectangle(vis_frame, (min_x, min_y), (max_x, max_y), (0, 0, 255), 2)
+        1,
-
+        min(
-        # Visualize the feathered mask
+            30,
-        feather_amount = max(
+            (max_x - min_x) // modules.globals.mask_feather_ratio if (max_x - min_x) > 0 else 1,
-            1,
+            (max_y - min_y) // modules.globals.mask_feather_ratio if (max_y - min_y) > 0 else 1,
-            min(
+        ),
-                30,
+    )
-                (max_x - min_x) // modules.globals.mask_feather_ratio,
+    kernel_size = 2 * feather_amount + 1
-                (max_y - min_y) // modules.globals.mask_feather_ratio,
+    # Ensure mask_region is not empty before blur
-            ),
+    if mask_region.size > 0 :
        )
        # Ensure kernel size is odd
        kernel_size = 2 * feather_amount + 1
        feathered_mask = cv2.GaussianBlur(
            mask_region.astype(float), (kernel_size, kernel_size), 0
        )
-        feathered_mask = (feathered_mask / feathered_mask.max() * 255).astype(np.uint8)
+        # Check if feathered_mask.max() is zero to avoid division by zero error
-        # Remove the feathered mask color overlay
+        max_val = feathered_mask.max()
-        # color_feathered_mask = cv2.applyColorMap(feathered_mask, cv2.COLORMAP_VIRIDIS)
+        if max_val > 0:
            feathered_mask = (feathered_mask / max_val * 255).astype(np.uint8)
        else:
            feathered_mask = np.zeros_like(mask_region, dtype=np.uint8) # Handle case of all-black mask
    else: # if mask_region is empty, create an empty feathered_mask
        feathered_mask = np.zeros_like(mask_region, dtype=np.uint8)
        # Ensure shapes match before blending feathered mask
        # if vis_region.shape == color_feathered_mask.shape:
        #     blended_feathered = cv2.addWeighted(vis_region, 0.7, color_feathered_mask, 0.3, 0)
        #     vis_frame[min_y:max_y, min_x:max_x] = blended_feathered
-        # Add labels
+    cv2.putText(
-        cv2.putText(
+        vis_frame,
-            vis_frame,
+        "Lower Mouth Mask",
-            "Lower Mouth Mask",
+        (min_x, min_y - 10),
-            (min_x, min_y - 10),
+        cv2.FONT_HERSHEY_SIMPLEX,
-            cv2.FONT_HERSHEY_SIMPLEX,
+        0.5,
-            0.5,
+        (255, 255, 255),
-            (255, 255, 255),
+        1,
-            1,
+    )
-        )
+    cv2.putText(
-        cv2.putText(
+        vis_frame,
-            vis_frame,
+        "Feathered Mask",
-            "Feathered Mask",
+        (min_x, max_y + 20),
-            (min_x, max_y + 20),
+        cv2.FONT_HERSHEY_SIMPLEX,
-            cv2.FONT_HERSHEY_SIMPLEX,
+        0.5,
-            0.5,
+        (255, 255, 255),
-            (255, 255, 255),
+        1,
-            1,
+    )
        )
-        return vis_frame
+    return vis_frame
-    return frame
+    # return frame # Fallback if landmarks or mouth_mask_data is None
 def apply_mouth_area(
@ -644,23 +836,30 @@ def apply_mouth_area(
    face_mask: np.ndarray,
    mouth_polygon: np.ndarray,
 ) -> np.ndarray:
    # Add check for None mouth_polygon which can happen if landmark_2d_106 was None
    if mouth_polygon is None or mouth_cutout is None:
        logging.debug("Skipping apply_mouth_area due to missing mouth_polygon or mouth_cutout.")
        return frame
    min_x, min_y, max_x, max_y = mouth_box
    box_width = max_x - min_x
    box_height = max_y - min_y
    if (
-        mouth_cutout is None
+        box_width <= 0 or box_height <= 0 or # Check for valid box dimensions
-        or box_width is None
+        face_mask is None
        or box_height is None
        or face_mask is None
        or mouth_polygon is None
    ):
        return frame
    try:
        resized_mouth_cutout = cv2.resize(mouth_cutout, (box_width, box_height))
        # Ensure ROI slicing is valid
        if min_y >= max_y or min_x >= max_x:
             logging.warning("Invalid ROI for applying mouth area.")
             return frame
        roi = frame[min_y:max_y, min_x:max_x]
        if roi.shape != resized_mouth_cutout.shape:
            resized_mouth_cutout = cv2.resize(
                resized_mouth_cutout, (roi.shape[1], roi.shape[0])
@ -668,39 +867,51 @@ def apply_mouth_area(
        color_corrected_mouth = apply_color_transfer(resized_mouth_cutout, roi)
        # Use the provided mouth polygon to create the mask
        polygon_mask = np.zeros(roi.shape[:2], dtype=np.uint8)
        adjusted_polygon = mouth_polygon - [min_x, min_y]
        cv2.fillPoly(polygon_mask, [adjusted_polygon], 255)
        # Apply feathering to the polygon mask
        feather_amount = min(
            30,
-            box_width // modules.globals.mask_feather_ratio,
+            box_width // modules.globals.mask_feather_ratio if modules.globals.mask_feather_ratio > 0 else 30,
-            box_height // modules.globals.mask_feather_ratio,
+            box_height // modules.globals.mask_feather_ratio if modules.globals.mask_feather_ratio > 0 else 30,
        )
-        feathered_mask = cv2.GaussianBlur(
+        feather_amount = max(1, feather_amount) # Ensure feather_amount is at least 1 for kernel size
-            polygon_mask.astype(float), (0, 0), feather_amount
+
        # Ensure kernel size is odd and positive for GaussianBlur
        kernel_size_blur = 2 * feather_amount + 1
        feathered_mask_float = cv2.GaussianBlur(
            polygon_mask.astype(float), (kernel_size_blur, kernel_size_blur), 0
        )
-        feathered_mask = feathered_mask / feathered_mask.max()
+
        max_val = feathered_mask_float.max()
        if max_val > 0:
            feathered_mask_normalized = feathered_mask_float / max_val
        else: # Avoid division by zero if mask is all black
            feathered_mask_normalized = feathered_mask_float
        face_mask_roi = face_mask[min_y:max_y, min_x:max_x]
-        combined_mask = feathered_mask * (face_mask_roi / 255.0)
+        combined_mask_float = feathered_mask_normalized * (face_mask_roi / 255.0)
        combined_mask_3ch = combined_mask_float[:, :, np.newaxis]
        combined_mask = combined_mask[:, :, np.newaxis]
        blended = (
-            color_corrected_mouth * combined_mask + roi * (1 - combined_mask)
+            color_corrected_mouth.astype(np.float32) * combined_mask_3ch +
            roi.astype(np.float32) * (1 - combined_mask_3ch)
        ).astype(np.uint8)
-        # Apply face mask to blended result
+        # This final blend with face_mask_3channel seems redundant if combined_mask_float already incorporates face_mask_roi
-        face_mask_3channel = (
+        # However, it ensures that areas outside the broader face_mask (but inside mouth_box) are not affected.
-            np.repeat(face_mask_roi[:, :, np.newaxis], 3, axis=2) / 255.0
+        # For simplicity and to maintain original intent if there was one, keeping it for now.
-        )
+        # face_mask_3channel_roi = np.repeat(face_mask_roi[:, :, np.newaxis], 3, axis=2) / 255.0
-        final_blend = blended * face_mask_3channel + roi * (1 - face_mask_3channel)
+        # final_blend = blended * face_mask_3channel_roi + roi * (1 - face_mask_3channel_roi)
-        frame[min_y:max_y, min_x:max_x] = final_blend.astype(np.uint8)
+        frame[min_y:max_y, min_x:max_x] = blended.astype(np.uint8)
    except Exception as e:
-        pass
+        logging.error(f"Error in apply_mouth_area: {e}", exc_info=True)
        pass # Keep original frame on error
    return frame
@ -708,68 +919,109 @@ def apply_mouth_area(
 def create_face_mask(face: Face, frame: Frame) -> np.ndarray:
    mask = np.zeros(frame.shape[:2], dtype=np.uint8)
    landmarks = face.landmark_2d_106
    if landmarks is not None:
        # Convert landmarks to int32
        landmarks = landmarks.astype(np.int32)
-        # Extract facial features
+    # Add check for landmarks before trying to use them
-        right_side_face = landmarks[0:16]
+    if landmarks is None:
-        left_side_face = landmarks[17:32]
+        logging.debug("Skipping face_mask creation due to missing landmark_2d_106.")
-        right_eye = landmarks[33:42]
+        # Fallback: if no landmarks, try to create a simple mask from bbox if available
-        right_eye_brow = landmarks[43:51]
+        if face.bbox is not None:
-        left_eye = landmarks[87:96]
+            x1, y1, x2, y2 = face.bbox.astype(int)
-        left_eye_brow = landmarks[97:105]
+            center_x = (x1 + x2) // 2
            center_y = (y1 + y2) // 2
            width = x2 - x1
            height = y2 - y1
            # Simple ellipse based on bbox - adjust size factor as needed
            cv2.ellipse(mask, (center_x, center_y), (int(width * 0.6), int(height * 0.7)), 0, 0, 360, 255, -1)
            mask = cv2.GaussianBlur(mask, (15, 15), 5) # Soften the simple mask too
        return mask
        # Calculate forehead extension
        right_eyebrow_top = np.min(right_eye_brow[:, 1])
        left_eyebrow_top = np.min(left_eye_brow[:, 1])
        eyebrow_top = min(right_eyebrow_top, left_eyebrow_top)
-        face_top = np.min([right_side_face[0, 1], left_side_face[-1, 1]])
+    landmarks = landmarks.astype(np.int32) # Now safe to use
        forehead_height = face_top - eyebrow_top
        extended_forehead_height = int(forehead_height * 5.0)  # Extend by 50%
-        # Create forehead points
+    right_side_face = landmarks[0:16]
-        forehead_left = right_side_face[0].copy()
+    left_side_face = landmarks[17:32]
-        forehead_right = left_side_face[-1].copy()
+    # right_eye = landmarks[33:42] # Not used for outline
-        forehead_left[1] -= extended_forehead_height
+    right_eye_brow = landmarks[43:51]
-        forehead_right[1] -= extended_forehead_height
+    # left_eye = landmarks[87:96] # Not used for outline
    left_eye_brow = landmarks[97:105]
-        # Combine all points to create the face outline
+    if right_eye_brow.size == 0 or left_eye_brow.size == 0 or right_side_face.size == 0 or left_side_face.size == 0 :
-        face_outline = np.vstack(
+        logging.warning("Face mask creation skipped due to empty landmark arrays for key features.")
-            [
+        if face.bbox is not None: # Fallback to bbox mask if landmarks are partially missing
-                [forehead_left],
+            x1, y1, x2, y2 = face.bbox.astype(int)
-                right_side_face,
+            cv2.rectangle(mask, (x1,y1), (x2,y2), 255, -1) # Simple rectangle from bbox
-                left_side_face[
+            mask = cv2.GaussianBlur(mask, (15,15), 5)
-                    ::-1
+        return mask
                ],  # Reverse left side to create a continuous outline
                [forehead_right],
            ]
        )
-        # Calculate padding
+    right_eyebrow_top = np.min(right_eye_brow[:, 1])
-        padding = int(
+    left_eyebrow_top = np.min(left_eye_brow[:, 1])
-            np.linalg.norm(right_side_face[0] - left_side_face[-1]) * 0.05
+    eyebrow_top = min(right_eyebrow_top, left_eyebrow_top)
        )  # 5% of face width
-        # Create a slightly larger convex hull for padding
+    face_top = np.min([right_side_face[0, 1], left_side_face[-1, 1]])
-        hull = cv2.convexHull(face_outline)
+    forehead_height = max(0, face_top - eyebrow_top) # Ensure non-negative
-        hull_padded = []
+    extended_forehead_height = int(forehead_height * 5.0)
        for point in hull:
            x, y = point[0]
            center = np.mean(face_outline, axis=0)
            direction = np.array([x, y]) - center
            direction = direction / np.linalg.norm(direction)
            padded_point = np.array([x, y]) + direction * padding
            hull_padded.append(padded_point)
    forehead_left = right_side_face[0].copy()
    forehead_right = left_side_face[-1].copy()
    # Prevent negative y-coordinates
    forehead_left[1] = max(0, forehead_left[1] - extended_forehead_height)
    forehead_right[1] = max(0, forehead_right[1] - extended_forehead_height)
    face_outline = np.vstack(
        [
            [forehead_left],
            right_side_face,
            left_side_face[
                ::-1
            ],
            [forehead_right],
        ]
    )
    if face_outline.shape[0] < 3 : # convexHull needs at least 3 points
        logging.warning("Not enough points for convex hull in face mask creation. Using bbox as fallback.")
        if face.bbox is not None:
            x1, y1, x2, y2 = face.bbox.astype(int)
            cv2.rectangle(mask, (x1,y1), (x2,y2), 255, -1)
            mask = cv2.GaussianBlur(mask, (15,15), 5)
        return mask
    padding = int(
        np.linalg.norm(right_side_face[0] - left_side_face[-1]) * 0.05
    )
    hull = cv2.convexHull(face_outline)
    hull_padded = []
    # Calculate center of the original outline for padding direction
    center_of_outline = np.mean(face_outline, axis=0).squeeze()
    if center_of_outline.ndim > 1: # Ensure center is 1D
        center_of_outline = np.mean(center_of_outline, axis=0)
    for point_contour in hull:
        point = point_contour[0]
        direction = point - center_of_outline
        norm_direction = np.linalg.norm(direction)
        if norm_direction == 0:
            unit_direction = np.array([0,0])
        else:
            unit_direction = direction / norm_direction
        padded_point = point + unit_direction * padding
        hull_padded.append(padded_point)
    if hull_padded:
        hull_padded = np.array(hull_padded, dtype=np.int32)
-
+        # Ensure hull_padded has the correct shape for fillConvexPoly (e.g., (N, 1, 2))
-        # Fill the padded convex hull
+        if hull_padded.ndim == 2:
            hull_padded = hull_padded[:, np.newaxis, :]
        cv2.fillConvexPoly(mask, hull_padded, 255)
    else:
        if hull.ndim == 2: # Ensure hull has correct shape if hull_padded was empty
            hull = hull[:, np.newaxis, :]
        cv2.fillConvexPoly(mask, hull, 255)
-        # Smooth the mask edges
+    mask = cv2.GaussianBlur(mask, (5, 5), 3)
        mask = cv2.GaussianBlur(mask, (5, 5), 3)
    return mask
@ -784,13 +1036,14 @@ def apply_color_transfer(source, target):
    source_mean, source_std = cv2.meanStdDev(source)
    target_mean, target_std = cv2.meanStdDev(target)
    # Reshape mean and std to be broadcastable
    source_mean = source_mean.reshape(1, 1, 3)
    source_std = source_std.reshape(1, 1, 3)
    target_mean = target_mean.reshape(1, 1, 3)
    target_std = target_std.reshape(1, 1, 3)
-    # Perform the color transfer
+    # Prevent division by zero if source_std is zero in any channel
    source_std[source_std == 0] = 1
    source = (source - source_mean) * (target_std / source_std) + target_mean
    return cv2.cvtColor(np.clip(source, 0, 255).astype("uint8"), cv2.COLOR_LAB2BGR)
--- a/modules/ui.py
+++ b/modules/ui.py
@ -257,10 +257,6 @@ def create_root(start: Callable[[], None], destroy: Callable[[], None]) -> ctk.C
    )
    color_correction_switch.place(relx=0.6, rely=0.70)
    #    nsfw_value = ctk.BooleanVar(value=modules.globals.nsfw_filter)
    #    nsfw_switch = ctk.CTkSwitch(root, text='NSFW filter', variable=nsfw_value, cursor='hand2', command=lambda: setattr(modules.globals, 'nsfw_filter', nsfw_value.get()))
    #    nsfw_switch.place(relx=0.6, rely=0.7)
    map_faces = ctk.BooleanVar(value=modules.globals.map_faces)
    map_faces_switch = ctk.CTkSwitch(
        root,
@ -288,7 +284,6 @@ def create_root(start: Callable[[], None], destroy: Callable[[], None]) -> ctk.C
    )
    show_fps_switch.place(relx=0.6, rely=0.75)
    # Hair Swapping Switch (placed below "Show FPS" on the right column)
    hair_swapping_value = ctk.BooleanVar(value=modules.globals.enable_hair_swapping)
    hair_swapping_switch = ctk.CTkSwitch(
        root,
@ -300,7 +295,7 @@ def create_root(start: Callable[[], None], destroy: Callable[[], None]) -> ctk.C
            save_switch_states(),
        )
    )
-    hair_swapping_switch.place(relx=0.6, rely=0.80) # Adjusted rely from 0.75 to 0.80
+    hair_swapping_switch.place(relx=0.6, rely=0.80)
    mouth_mask_var = ctk.BooleanVar(value=modules.globals.mouth_mask)
    mouth_mask_switch = ctk.CTkSwitch(
@ -324,26 +319,23 @@ def create_root(start: Callable[[], None], destroy: Callable[[], None]) -> ctk.C
    )
    show_mouth_mask_box_switch.place(relx=0.6, rely=0.55)
    # Adjusting placement of Start, Stop, Preview buttons due to new switch
    start_button = ctk.CTkButton(
        root, text=_("Start"), cursor="hand2", command=lambda: analyze_target(start, root)
    )
-    start_button.place(relx=0.15, rely=0.85, relwidth=0.2, relheight=0.05) # rely from 0.80 to 0.85
+    start_button.place(relx=0.15, rely=0.85, relwidth=0.2, relheight=0.05)
    stop_button = ctk.CTkButton(
        root, text=_("Destroy"), cursor="hand2", command=lambda: destroy()
    )
-    stop_button.place(relx=0.4, rely=0.85, relwidth=0.2, relheight=0.05) # rely from 0.80 to 0.85
+    stop_button.place(relx=0.4, rely=0.85, relwidth=0.2, relheight=0.05)
    preview_button = ctk.CTkButton(
        root, text=_("Preview"), cursor="hand2", command=lambda: toggle_preview()
    )
-    preview_button.place(relx=0.65, rely=0.85, relwidth=0.2, relheight=0.05) # rely from 0.80 to 0.85
+    preview_button.place(relx=0.65, rely=0.85, relwidth=0.2, relheight=0.05)
    # --- Camera Selection ---
    # Adjusting placement of Camera selection due to new switch
    camera_label = ctk.CTkLabel(root, text=_("Select Camera:"))
-    camera_label.place(relx=0.1, rely=0.91, relwidth=0.2, relheight=0.05) # rely from 0.86 to 0.91
+    camera_label.place(relx=0.1, rely=0.91, relwidth=0.2, relheight=0.05)
    available_cameras = get_available_cameras()
    camera_indices, camera_names = available_cameras
@ -362,7 +354,7 @@ def create_root(start: Callable[[], None], destroy: Callable[[], None]) -> ctk.C
            root, variable=camera_variable, values=camera_names
        )
-    camera_optionmenu.place(relx=0.35, rely=0.91, relwidth=0.25, relheight=0.05) # rely from 0.86 to 0.91
+    camera_optionmenu.place(relx=0.35, rely=0.91, relwidth=0.25, relheight=0.05)
    live_button = ctk.CTkButton(
        root,
@ -382,16 +374,15 @@ def create_root(start: Callable[[], None], destroy: Callable[[], None]) -> ctk.C
            else "disabled"
        ),
    )
-    live_button.place(relx=0.65, rely=0.91, relwidth=0.2, relheight=0.05) # rely from 0.86 to 0.91
+    live_button.place(relx=0.65, rely=0.91, relwidth=0.2, relheight=0.05)
    # --- End Camera Selection ---
    status_label = ctk.CTkLabel(root, text=None, justify="center")
-    status_label.place(relx=0.1, rely=0.96, relwidth=0.8) # rely from 0.9 to 0.96
+    status_label.place(relx=0.1, rely=0.96, relwidth=0.8)
    donate_label = ctk.CTkLabel(
        root, text="Deep Live Cam", justify="center", cursor="hand2"
    )
-    donate_label.place(relx=0.1, rely=0.99, relwidth=0.8) # rely from 0.95 to 0.99
+    donate_label.place(relx=0.1, rely=0.99, relwidth=0.8)
    donate_label.configure(
        text_color=ctk.ThemeManager.theme.get("URL").get("text_color")
    )
@ -940,9 +931,6 @@ def create_webcam_preview(camera_index: int):
        source_face_obj_for_cam = get_one_face(source_frame_full_for_cam)
        if source_face_obj_for_cam is None:
            update_status(f"Error: No face detected in source image {modules.globals.source_path}")
            # This error is less critical for stopping immediately, but we'll make it persistent too.
            # The loop below will run, but processing for frames will effectively be skipped.
            # For consistency in error handling, make it persistent.
            cap.release()
            PREVIEW.withdraw()
            while PREVIEW.state() != "withdrawn" and ROOT.winfo_exists():
@ -983,8 +971,6 @@ def create_webcam_preview(camera_index: int):
        if not modules.globals.source_target_map and not modules.globals.simple_map:
            update_status("Warning: No face map defined for map_faces mode. Swapper may not work as expected.")
            # This is a warning, not a fatal error for the preview window itself. Processing will continue.
            # No persistent loop here, as it's a warning about functionality, not a critical load error.
    # --- End Source Image Loading ---
@ -1007,39 +993,32 @@ def create_webcam_preview(camera_index: int):
            temp_frame = fit_image_to_size(
                temp_frame, PREVIEW.winfo_width(), PREVIEW.winfo_height()
            )
        else:
            temp_frame = fit_image_to_size(
                temp_frame, PREVIEW.winfo_width(), PREVIEW.winfo_height()
            )
        # REMOVED: detection_frame_counter += 1
        # REMOVED: if detection_frame_counter % DETECTION_INTERVAL == 0:
        # The following block is now unindented to run every frame
        if not modules.globals.map_faces:
-            # Case 1: map_faces is False - source_face_obj_for_cam and source_frame_full_for_cam are pre-loaded
+            if source_face_obj_for_cam is not None and source_frame_full_for_cam is not None:
            if source_face_obj_for_cam is not None and source_frame_full_for_cam is not None: # Check if valid after pre-loading
                for frame_processor in frame_processors:
                    if frame_processor.NAME == "DLC.FACE-ENHANCER":
                        if modules.globals.fp_ui["face_enhancer"]:
                            temp_frame = frame_processor.process_frame(None, temp_frame)
                    else:
                        temp_frame = frame_processor.process_frame(source_face_obj_for_cam, source_frame_full_for_cam, temp_frame)
            # If source image was invalid (e.g. no face), source_face_obj_for_cam might be None.
            # In this case, the frame processors that need it will be skipped, effectively just showing the raw webcam frame.
            # The error message is already persistent due to the pre-loop check.
        else:
-            # Case 2: map_faces is True - source_frame_full_for_cam_map_faces is pre-loaded
+            if source_frame_full_for_cam_map_faces is not None:
-            if source_frame_full_for_cam_map_faces is not None: # Check if valid after pre-loading
+                modules.globals.target_path = None
                modules.globals.target_path = None # Standard for live mode
                for frame_processor in frame_processors:
                    if frame_processor.NAME == "DLC.FACE-ENHANCER":
                        if modules.globals.fp_ui["face_enhancer"]:
                            # Corrected: face_enhancer.process_frame_v2 is expected to take only temp_frame
                            temp_frame = frame_processor.process_frame_v2(temp_frame)
                    else:
                        # This is for other processors when map_faces is True
                        temp_frame = frame_processor.process_frame_v2(source_frame_full_for_cam_map_faces, temp_frame)
            # If source_frame_full_for_cam_map_faces was invalid, error is persistent from pre-loop check.
        # Calculate and display FPS
        current_time = time.time()
        frame_count += 1
        if current_time - prev_time >= fps_update_interval:
Author	SHA1	Message	Date
google-labs-jules[bot]	a01314b52c	feat: Implement Nth-frame detection with tracking for performance Optimizes webcam performance for face swapping by introducing Nth-frame full face detection and using a KCF tracker for intermediate frames in modules/processors/frame/face_swapper.py. Key changes: - Full face analysis (get_one_face) now runs every N frames (default 3) or when tracking is lost in the process_frame function (for single face mode). - For intermediate frames, a KCF tracker updates the target face bounding box, and keypoints are estimated by translating the last known good keypoints. - The actual face swap (inswapper model) still runs on every frame if a face (either detected or tracked) is available. - Experimental tracking logic added to _process_live_target_v2 for map_faces=True in live mode (non-many_faces path). - Added robustness: - None checks for landmarks in mouth_mask and create_face_mask functions, with fallbacks for create_face_mask. - Division-by-zero check in apply_color_transfer. - Reset tracker state in process_video for new video files. This aims to significantly improve FPS by reducing the frequency of costly full face analysis, while still providing a continuous swap. Mouth masking will be less effective on tracked intermediate frames due to the absence of full landmark data.	2025-06-18 14:25:56 +00:00
google-labs-jules[bot]	0fc481db47	fix: Revert Nth frame logic in ui.py to fix UnboundLocalError I've completely removed the Nth frame processing logic (frame counter, interval, and conditional execution) from the `create_webcam_preview` function in `modules/ui.py`. The frame processing block has been unindented to ensure it runs on every frame. This resolves an `UnboundLocalError` for 'detection_frame_counter' that occurred because the variable was being used after its initialization was removed in a previous attempt to revert this logic. The webcam preview will now process every frame as it did before the Nth frame optimization was introduced.	2025-06-18 11:20:32 +00:00
google-labs-jules[bot]	984048b39a	fix: Remove orphaned Nth frame counter line in ui.py Removes the leftover `detection_frame_counter += 1` line from the `create_webcam_preview` function in modules/ui.py. This line was erroneously kept after the rest of the Nth frame processing logic was reverted, causing an UnboundLocalError as the counter was no longer initialized. This fix ensures the webcam preview can start correctly without this error.	2025-06-18 10:35:13 +00:00
google-labs-jules[bot]	9fd870cfd2	refactor: Revert Nth frame processing in webcam mode Reverts the Nth frame processing logic previously introduced in modules/ui.py (create_webcam_preview function). Webcam frames will now be processed by the full pipeline on every frame, instead of skipping frames. This change is based on your feedback requesting to focus on optimizing the per-frame performance rather than using frame skipping techniques at this stage.	2025-06-18 09:54:10 +00:00
google-labs-jules[bot]	c5c08b652f	perf: Implement Nth frame processing for webcam mode Optimizes webcam performance by running full face detection and frame processing (face swap, enhancers) only every N frames (currently N=3) in modules/ui.py (create_webcam_preview function). For intermediate frames, the raw (but mirrored/resized) webcam feed is displayed. This aims to improve UI responsiveness and reduce overall CPU/GPU load during live webcam sessions, particularly when resource-intensive operations like hair swapping or face enhancement are active. The actual swap/effect will appear at a reduced frame rate (FPS/N), but the UI should remain smoother.	2025-06-18 09:03:07 +00:00