linux

mirror of https://github.com/torvalds/linux.git synced 2025-12-01 07:26:02 +07:00

Author	SHA1	Message	Date
Linus Torvalds	fc033cf25e	Linux 6.13-rc5	2024-12-29 13:15:45 -08:00
Linus Torvalds	4099a71718	Merge tag 'sched-urgent-2024-12-29' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip Pull scheduler fix from Ingo Molnar: "Fix a procfs task state reporting regression when freezing sleeping tasks" * tag 'sched-urgent-2024-12-29' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: freezer, sched: Report frozen tasks as 'D' instead of 'R'	2024-12-29 10:19:54 -08:00
Linus Torvalds	6cbc4b29eb	Merge tag 'x86-urgent-2024-12-29' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip Pull x86 fixes from Ingo Molnar: - Fix a hang in the "kernel IBT no ENDBR" self-test that may trigger on FRED systems, caused by incomplete FRED state cleanup in the #CP fault handler - Improve TDX (Coco VM) guest unrecoverable error handling to not potentially leak decrypted memory * tag 'x86-urgent-2024-12-29' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: virt: tdx-guest: Just leak decrypted memory on unrecoverable errors x86/fred: Clear WFE in missing-ENDBRANCH #CPs	2024-12-29 10:16:41 -08:00
Linus Torvalds	f65832a32f	Merge tag 'perf-urgent-2024-12-29' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip Pull x86 perf fixes from Ingo Molnar: - Fix Intel Lunar Lake build-in event definitions - Fall back to (compatible) legacy features on new Intel PEBS format v6 hardware - Enable uncore support on Intel Clearwater Forest CPUs, which is the same as the existing Sierra Forest uncore driver * tag 'perf-urgent-2024-12-29' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: perf/x86/intel: Fix bitmask of OCR and FRONTEND events for LNC perf/x86/intel/ds: Add PEBS format 6 perf/x86/intel/uncore: Add Clearwater Forest support	2024-12-29 10:14:08 -08:00
Linus Torvalds	bcfac5530a	Merge tag 'objtool-urgent-2024-12-29' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip Pull objtool fix from Ingo Molnar: "Fix false positive objtool build warning related to a noreturn function in the bcachefs code" * tag 'objtool-urgent-2024-12-29' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: objtool: Add bch2_trans_unlocked_error() to bcachefs noreturns	2024-12-29 10:07:40 -08:00
Linus Torvalds	bf7a281b80	Merge tag 'locking-urgent-2024-12-29' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip Pull locking fix from Ingo Molnar: "Fix missed rtmutex wakeups causing sporadic boot hangs and other misbehavior" * tag 'locking-urgent-2024-12-29' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: locking/rtmutex: Make sure we wake anything on the wake_q when we release the lock->wait_lock	2024-12-29 10:04:47 -08:00
Linus Torvalds	feffd35a03	Merge tag 'irq-urgent-2024-12-29' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip Pull irq fix from Ingo Molnar: "Fix bogus MSI IRQ setup warning on RISC-V" * tag 'irq-urgent-2024-12-29' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: PCI/MSI: Handle lack of irqdomain gracefully	2024-12-29 10:03:01 -08:00
Linus Torvalds	c059361673	Merge tag 'for-6.13-rc4-tag' of git://git.kernel.org/pub/scm/linux/kernel/git/kdave/linux Pull btrfs fixes from David Sterba: "A few more fixes that accumulated over the last two weeks, fixing some user reported problems: - swapfile fixes: - conditional reschedule in the activation loop - fix race with memory mapped file when activating - make activation loop interruptible - rework and fix extent sharing checks - folio fixes: - in send, recheck folio mapping after unlock - in relocation, recheck folio mapping after unlock - fix waiting for encoded read io_uring requests - fix transaction atomicity when enabling simple quotas - move COW block trace point before the block gets freed - print various sizes in sysfs with correct endianity" * tag 'for-6.13-rc4-tag' of git://git.kernel.org/pub/scm/linux/kernel/git/kdave/linux: btrfs: sysfs: fix direct super block member reads btrfs: fix transaction atomicity bug when enabling simple quotas btrfs: avoid monopolizing a core when activating a swap file btrfs: allow swap activation to be interruptible btrfs: fix swap file activation failure due to extents that used to be shared btrfs: fix race with memory mapped writes when activating swap file btrfs: check folio mapping after unlock in put_file_data() btrfs: check folio mapping after unlock in relocate_one_folio() btrfs: fix use-after-free when COWing tree bock and tracing is enabled btrfs: fix use-after-free waiting for encoded read endios	2024-12-29 09:34:34 -08:00
Linus Torvalds	e1d9326608	Merge tag 'i2c-for-6.13-rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/wsa/linux Pull i2c fixes from Wolfram Sang: - IMX: fix stop condition in single master mode and add compatible string for errata adherence - Microchip: Add support for proper repeated sends and fix unnecessary NAKs on empty messages, which caused false bus detection * tag 'i2c-for-6.13-rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/wsa/linux: i2c: microchip-core: fix "ghost" detections i2c: microchip-core: actually use repeated sends i2c: imx: add imx7d compatible string for applying erratum ERR007805 i2c: imx: fix missing stop condition in single-master mode	2024-12-29 09:31:55 -08:00
Li RongQing	27834971f6	virt: tdx-guest: Just leak decrypted memory on unrecoverable errors In CoCo VMs it is possible for the untrusted host to cause set_memory_decrypted() to fail such that an error is returned and the resulting memory is shared. Callers need to take care to handle these errors to avoid returning decrypted (shared) memory to the page allocator, which could lead to functional or security issues. Leak the decrypted memory when set_memory_decrypted() fails, and don't need to print an error since set_memory_decrypted() will call WARN_ONCE(). Fixes: `f4738f56d1` ("virt: tdx-guest: Add Quote generation support using TSM_REPORTS") Signed-off-by: Li RongQing <lirongqing@baidu.com> Signed-off-by: Dave Hansen <dave.hansen@linux.intel.com> Signed-off-by: Ingo Molnar <mingo@kernel.org> Reviewed-by: Rick Edgecombe <rick.p.edgecombe@intel.com> Reviewed-by: Kirill A. Shutemov <kirill.shutemov@linux.intel.com> Cc: stable@vger.kernel.org Link: https://lore.kernel.org/all/20240619111801.25630-1-lirongqing%40baidu.com	2024-12-29 10:18:44 +01:00
Xin Li (Intel)	dc81e556f2	x86/fred: Clear WFE in missing-ENDBRANCH #CPs An indirect branch instruction sets the CPU indirect branch tracker (IBT) into WAIT_FOR_ENDBRANCH (WFE) state and WFE stays asserted across the instruction boundary. When the decoder finds an inappropriate instruction while WFE is set ENDBR, the CPU raises a #CP fault. For the "kernel IBT no ENDBR" selftest where #CPs are deliberately triggered, the WFE state of the interrupted context needs to be cleared to let execution continue. Otherwise when the CPU resumes from the instruction that just caused the previous #CP, another missing-ENDBRANCH #CP is raised and the CPU enters a dead loop. This is not a problem with IDT because it doesn't preserve WFE and IRET doesn't set WFE. But FRED provides space on the entry stack (in an expanded CS area) to save and restore the WFE state, thus the WFE state is no longer clobbered, so software must clear it. Clear WFE to avoid dead looping in ibt_clear_fred_wfe() and the !ibt_fatal code path when execution is allowed to continue. Clobbering WFE in any other circumstance is a security-relevant bug. [ dhansen: changelog rewording ] Fixes: `a5f6c2ace9` ("x86/shstk: Add user control-protection fault handler") Signed-off-by: Xin Li (Intel) <xin@zytor.com> Signed-off-by: Dave Hansen <dave.hansen@linux.intel.com> Signed-off-by: Ingo Molnar <mingo@kernel.org> Acked-by: Dave Hansen <dave.hansen@linux.intel.com> Cc: stable@vger.kernel.org Link: https://lore.kernel.org/all/20241113175934.3897541-1-xin%40zytor.com	2024-12-29 10:18:10 +01:00
Chen Ridong	f718faf394	freezer, sched: Report frozen tasks as 'D' instead of 'R' Before commit: `f5d39b0208` ("freezer,sched: Rewrite core freezer logic") the frozen task stat was reported as 'D' in cgroup v1. However, after rewriting the core freezer logic, the frozen task stat is reported as 'R'. This is confusing, especially when a task with stat of 'S' is frozen. This bug can be reproduced with these steps: $ cd /sys/fs/cgroup/freezer/ $ mkdir test $ sleep 1000 & [1] 739 // task whose stat is 'S' $ echo 739 > test/cgroup.procs $ echo FROZEN > test/freezer.state $ ps -aux \| grep 739 root 739 0.1 0.0 8376 1812 pts/0 R 10:56 0:00 sleep 1000 As shown above, a task whose stat is 'S' was changed to 'R' when it was frozen. To solve this regression, simply maintain the same reported state as before the rewrite. [ mingo: Enhanced the changelog and comments ] Fixes: `f5d39b0208` ("freezer,sched: Rewrite core freezer logic") Signed-off-by: Chen Ridong <chenridong@huawei.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Signed-off-by: Ingo Molnar <mingo@kernel.org> Acked-by: Tejun Heo <tj@kernel.org> Acked-by: Michal Koutný <mkoutny@suse.com> Link: https://lore.kernel.org/r/20241217004818.3200515-1-chenridong@huaweicloud.com	2024-12-29 10:14:20 +01:00
chenchangcheng	31ad36a271	objtool: Add bch2_trans_unlocked_error() to bcachefs noreturns Fix the following objtool warning during build time: fs/bcachefs/btree_trans_commit.o: warning: objtool: bch2_trans_commit_write_locked.isra.0() falls through to next function do_bch2_trans_commit.isra.0() fs/bcachefs/btree_trans_commit.o: warning: objtool: .text: unexpected end of section ...... fs/bcachefs/btree_update.o: warning: objtool: bch2_trans_update_get_key_cache() falls through to next function flush_new_cached_update() fs/bcachefs/btree_update.o: warning: objtool: flush_new_cached_update() falls through to next function bch2_trans_update_by_path() bch2_trans_unlocked_error() is an Obviously Correct (tm) panic() wrapper, add it to the list of known noreturns. [ mingo: Improved the changelog ] Fixes: `fd104e2967` ("bcachefs: bch2_trans_verify_not_unlocked()") Signed-off-by: chenchangcheng <chenchangcheng@kylinos.cn> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Signed-off-by: Ingo Molnar <mingo@kernel.org> Link: https://lkml.kernel.org/r/20241220074847.3418134-1-ccc194101@163.com	2024-12-29 09:52:21 +01:00
Linus Torvalds	059dd502b2	Merge tag 'block-6.13-20241228' of git://git.kernel.dk/linux Pull block fix from Jens Axboe: "Just a single fix for ublk setup error handling" * tag 'block-6.13-20241228' of git://git.kernel.dk/linux: ublk: detach gendisk from ublk device if add_disk() fails	2024-12-28 11:02:35 -08:00
Linus Torvalds	d19a3ee573	Merge tag 'io_uring-6.13-20241228' of git://git.kernel.dk/linux Pull io_uring fix from Jens Axboe: "Just a single fix for a theoretical issue with SQPOLL setup" * tag 'io_uring-6.13-20241228' of git://git.kernel.dk/linux: io_uring/sqpoll: fix sqpoll error handling races	2024-12-28 11:00:29 -08:00
Linus Torvalds	e51da4a232	Merge tag '6.13-rc4-SMB3-client-fixes' of git://git.samba.org/sfrench/cifs-2.6 Pull smb client fixes from Steve French: - fix caching of files that will be reused for write - minor cleanup * tag '6.13-rc4-SMB3-client-fixes' of git://git.samba.org/sfrench/cifs-2.6: cifs: Remove unused is_server_using_iface() smb: enable reuse of deferred file handles for write operations	2024-12-28 10:58:01 -08:00
Linus Torvalds	fd0584d220	Merge tag 'trace-tools-v6.13-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/trace/linux-trace Pull tracing tool fix from Steven Rostedt: - Fix rtla divide by zero when the count is zero in histograms * tag 'trace-tools-v6.13-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/trace/linux-trace: rtla/timerlat: Fix histogram ALL for zero samples	2024-12-27 15:31:52 -08:00
Wolfram Sang	f802f11b23	Merge tag 'i2c-host-fixes-6.13-rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/andi.shyti/linux into i2c/for-current i2c-host-fixes for v6.13-rc5 - IMX: fixed stop condition in single master mode and added compatible string for errata adherence. - Microchip: Added support for proper repeated sends and fixed unnecessary NAKs on empty messages, which caused false bus detection.	2024-12-28 00:25:04 +01:00
Linus Torvalds	8379578b11	Merge tag 'for-v6.13-rc' of git://git.kernel.org/pub/scm/linux/kernel/git/sre/linux-power-supply Pull power supply fixes from Sebastian Reichel: - fix potential array out of bounds access in gpio-charger - cros_charge-control: - fix concurrent sysfs access - allow start_threshold == end_threshold - workaround limited v2 charge threshold API - bq24296: fix vbus regulator handling * tag 'for-v6.13-rc' of git://git.kernel.org/pub/scm/linux/kernel/git/sre/linux-power-supply: power: supply: bq24190: Fix BQ24296 Vbus regulator support power: supply: cros_charge-control: hide start threshold on v2 cmd power: supply: cros_charge-control: allow start_threshold == end_threshold power: supply: cros_charge-control: add mutex for driver data power: supply: gpio-charger: Fix set charge current limits	2024-12-27 11:10:56 -08:00
Linus Torvalds	eff4f67583	Merge tag 'powerpc-6.13-3' of git://git.kernel.org/pub/scm/linux/kernel/git/powerpc/linux Pull powerpc fix from Madhavan Srinivasan: - Add close() callback in vas_vm_ops struct for proper cleanup Thanks to Haren Myneni. * tag 'powerpc-6.13-3' of git://git.kernel.org/pub/scm/linux/kernel/git/powerpc/linux: powerpc/pseries/vas: Add close() callback in vas_vm_ops struct	2024-12-27 11:06:29 -08:00
Linus Torvalds	411a678d30	Merge tag 'probes-fixes-v6.13-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/trace/linux-trace Pull probes fix from Masami Hiramatsu: "Change the priority of the module callback of kprobe events so that it is called after the jump label list on the module is updated. This ensures the kprobe can check whether it is not on the jump label address correctly" * tag 'probes-fixes-v6.13-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/trace/linux-trace: tracing/kprobe: Make trace_kprobe's module callback called after jump_label update	2024-12-27 11:03:15 -08:00
Linus Torvalds	f0bc704f46	Merge tag 'hardening-v6.13-rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/kees/linux Pull hardening fix from Kees Cook: - stddef: make __struct_group() UAPI C++-friendly (Alexander Lobakin) * tag 'hardening-v6.13-rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/kees/linux: stddef: make __struct_group() UAPI C++-friendly	2024-12-27 10:39:05 -08:00
Linus Torvalds	2c2b3d906c	Merge tag 'trace-v6.13-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/trace/linux-trace Pull tracing fixes from Steven Rostedt: "Two minor tracing fixes: - Add "const" to "char " in event structure field that gets assigned literals. - Check size of input passed into the tracing cpumask file. If a too large of an input gets passed into the cpumask file, it could trigger a warning in the bitmask parsing code" tag 'trace-v6.13-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/trace/linux-trace: tracing: Prevent bad count for tracing_cpumask_write tracing: Constify string literal data member in struct trace_event_call	2024-12-27 10:33:21 -08:00
Tomas Glozar	6cc45f8c1f	rtla/timerlat: Fix histogram ALL for zero samples rtla timerlat hist currently computers the minimum, maximum and average latency even in cases when there are zero samples. This leads to nonsensical values being calculated for maximum and minimum, and to divide by zero for average. A similar bug is fixed by `01b05fc0e5` ("rtla/timerlat: Fix histogram report when a cpu count is 0") but the bug still remains for printing the sum over all CPUs in timerlat_print_stats_all. The issue can be reproduced with this command: $ rtla timerlat hist -U -d 1s Index over: count: min: avg: max: Floating point exception (core dumped) (There are always no samples with -U unless the user workload is created.) Fix the bug by omitting max/min/avg when sample count is zero, displaying a dash instead, just like we already do for the individual CPUs. The logic is moved into a new function called format_summary_value, which is used for both the individual CPUs and for the overall summary. Cc: stable@vger.kernel.org Link: https://lore.kernel.org/20241127134130.51171-1-tglozar@redhat.com Fixes: `1462501c7a` ("rtla/timerlat: Add a summary for hist mode") Signed-off-by: Tomas Glozar <tglozar@redhat.com> Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>	2024-12-27 11:21:46 -05:00
Linus Torvalds	d6ef8b40d0	Merge tag 'sound-6.13-rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/tiwai/sound Pull sound fixes from Takashi Iwai: "A collection of small fixes. Nothing really stands out, fortunately. - Follow-up fixes for the new compress offload API extension - A few ASoC SOF, AMD and Mediatek quirks and fixes - A regression fix in legacy SH driver cleanup - Fix DMA mapping error handling in the helper code - Fix kselftest dependency" * tag 'sound-6.13-rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/tiwai/sound: ALSA: sh: Fix wrong argument order for copy_from_iter() selftests/alsa: Fix circular dependency involving global-timer ALSA: memalloc: prefer dma_mapping_error() over explicit address checking ALSA: compress_offload: improve file descriptors installation for dma-buf ALSA: compress_offload: use safe list iteration in snd_compr_task_seq() ALSA: compress_offload: avoid 64-bit get_user() ALSA: compress_offload: import DMA_BUF namespace ASoC: mediatek: disable buffer pre-allocation ASoC: rt722: add delay time to wait for the calibration procedure ASoC: SOF: Intel: hda-dai: Do not release the link DMA on STOP ASoC: dt-bindings: realtek,rt5645: Fix CPVDD voltage comment ASoC: Intel: sof_sdw: Fix DMI match for Lenovo 21QA and 21QB ASoC: Intel: sof_sdw: Fix DMI match for Lenovo 21Q6 and 21Q7 ASoC: amd: ps: Fix for enabling DMIC on acp63 platform via _DSD entry	2024-12-26 10:49:02 -08:00
Linus Torvalds	23db0ed34f	Merge tag 'dmaengine-fix-6.13' of git://git.kernel.org/pub/scm/linux/kernel/git/vkoul/dmaengine Pull dmaengine fixes from Vinod Koul: "Bunch of minor driver fixes for drivers in this cycle: - Kernel doc warning documentation fixes - apple driver fix for register access - amd driver dropping private dma_ops - freescale cleanup path fix - refcount fix for mv_xor driver - null pointer deref fix for at_xdmac driver - GENMASK to GENMASK_ULL fix for loongson2 apb driver - Tegra driver fix for correcting dma status" * tag 'dmaengine-fix-6.13' of git://git.kernel.org/pub/scm/linux/kernel/git/vkoul/dmaengine: dmaengine: tegra: Return correct DMA status when paused dmaengine: mv_xor: fix child node refcount handling in early exit dmaengine: fsl-edma: implement the cleanup path of fsl_edma3_attach_pd() dmaengine: amd: qdma: Remove using the private get and set dma_ops APIs dmaengine: apple-admac: Avoid accessing registers in probe linux/dmaengine.h: fix a few kernel-doc warnings dmaengine: loongson2-apb: Change GENMASK to GENMASK_ULL dmaengine: dw: Select only supported masters for ACPI devices dmaengine: at_xdmac: avoid null_prt_deref in at_xdmac_prep_dma_memset	2024-12-26 10:43:25 -08:00
Linus Torvalds	6fcb22ef50	Merge tag 'phy-fixes-6.13' of git://git.kernel.org/pub/scm/linux/kernel/git/phy/linux-phy Pull phy fixes from Vinod Koul: "A few core API fixes for devm calls and bunch of driver fixes as usual: - devm_phy_xxx fixes for few APIs in the phy core - qmp driver register name config - init sequence fix for usb driver - rockchip driver setting drvdata correctly in samsung hdptx and reset fix for naneng combophy - regulator dependency fix for mediatek hdmi driver - overflow assertion fix for stm32 driver" * tag 'phy-fixes-6.13' of git://git.kernel.org/pub/scm/linux/kernel/git/phy/linux-phy: phy: mediatek: phy-mtk-hdmi: add regulator dependency phy: freescale: fsl-samsung-hdmi: Fix 64-by-32 division cocci warnings phy: core: Fix an OF node refcount leakage in of_phy_provider_lookup() phy: core: Fix an OF node refcount leakage in _of_phy_get() phy: core: Fix that API devm_phy_destroy() fails to destroy the phy phy: core: Fix that API devm_of_phy_provider_unregister() fails to unregister the phy provider phy: core: Fix that API devm_phy_put() fails to release the phy phy: rockchip: samsung-hdptx: Set drvdata before enabling runtime PM phy: stm32: work around constant-value overflow assertion phy: qcom-qmp: Fix register name in RX Lane config of SC8280XP phy: rockchip: naneng-combphy: fix phy reset phy: usb: Toggle the PHY power during init	2024-12-26 10:39:57 -08:00
Linus Torvalds	ab8beb2047	Merge tag 'chrome-platform-for-6.13-rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/chrome-platform/linux Pull chrome platform fix from Tzung-Bi Shih: - Fix wrong product names for early Framework Laptops * tag 'chrome-platform-for-6.13-rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/chrome-platform/linux: platform/chrome: cros_ec_lpc: fix product identity for early Framework Laptops	2024-12-26 10:35:13 -08:00
Pavel Begunkov	e33ac68e5e	io_uring/sqpoll: fix sqpoll error handling races BUG: KASAN: slab-use-after-free in __lock_acquire+0x370b/0x4a10 kernel/locking/lockdep.c:5089 Call Trace: <TASK> ... _raw_spin_lock_irqsave+0x3d/0x60 kernel/locking/spinlock.c:162 class_raw_spinlock_irqsave_constructor include/linux/spinlock.h:551 [inline] try_to_wake_up+0xb5/0x23c0 kernel/sched/core.c:4205 io_sq_thread_park+0xac/0xe0 io_uring/sqpoll.c:55 io_sq_thread_finish+0x6b/0x310 io_uring/sqpoll.c:96 io_sq_offload_create+0x162/0x11d0 io_uring/sqpoll.c:497 io_uring_create io_uring/io_uring.c:3724 [inline] io_uring_setup+0x1728/0x3230 io_uring/io_uring.c:3806 ... Kun Hu reports that the SQPOLL creating error path has UAF, which happens if io_uring_alloc_task_context() fails and then io_sq_thread() manages to run and complete before the rest of error handling code, which means io_sq_thread_finish() is looking at already killed task. Note that this is mostly theoretical, requiring fault injection on the allocation side to trigger in practice. Cc: stable@vger.kernel.org Reported-by: Kun Hu <huk23@m.fudan.edu.cn> Signed-off-by: Pavel Begunkov <asml.silence@gmail.com> Link: https://lore.kernel.org/r/0f2f1aa5729332612bd01fe0f2f385fd1f06ce7c.1735231717.git.asml.silence@gmail.com Signed-off-by: Jens Axboe <axboe@kernel.dk>	2024-12-26 10:02:40 -07:00
Ming Lei	75cd4005da	ublk: detach gendisk from ublk device if add_disk() fails Inside ublk_abort_requests(), gendisk is grabbed for aborting all inflight requests. And ublk_abort_requests() is called when exiting the uring context or handling timeout. If add_disk() fails, the gendisk may have been freed when calling ublk_abort_requests(), so use-after-free can be caused when getting disk's reference in ublk_abort_requests(). Fixes the bug by detaching gendisk from ublk device if add_disk() fails. Fixes: `bd23f6c2c2` ("ublk: quiesce request queue when aborting queue") Signed-off-by: Ming Lei <ming.lei@redhat.com> Link: https://lore.kernel.org/r/20241225110640.351531-1-ming.lei@redhat.com Signed-off-by: Jens Axboe <axboe@kernel.dk>	2024-12-26 06:42:55 -07:00
Conor Dooley	49e1f0fd0d	i2c: microchip-core: fix "ghost" detections Running i2c-detect currently produces an output akin to: 0 1 2 3 4 5 6 7 8 9 a b c d e f 00: 08 -- 0a -- 0c -- 0e -- 10: 10 -- 12 -- 14 -- 16 -- UU 19 -- 1b -- 1d -- 1f 20: -- 21 -- 23 -- 25 -- 27 -- 29 -- 2b -- 2d -- 2f 30: -- -- -- -- -- -- -- -- 38 -- 3a -- 3c -- 3e -- 40: 40 -- 42 -- 44 -- 46 -- 48 -- 4a -- 4c -- 4e -- 50: -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- 60: 60 -- 62 -- 64 -- 66 -- 68 -- 6a -- 6c -- 6e -- 70: 70 -- 72 -- 74 -- 76 -- This happens because for an i2c_msg with a len of 0 the driver will mark the transmission of the message as a success once the START has been sent, without waiting for the devices on the bus to respond with an ACK/NAK. Since i2cdetect seems to run in a tight loop over all addresses the NAK is treated as part of the next test for the next address. Delete the fast path that marks a message as complete when idev->msg_len is zero after sending a START/RESTART since this isn't a valid scenario. CC: stable@vger.kernel.org Fixes: `64a6f1c498` ("i2c: add support for microchip fpga i2c controllers") Signed-off-by: Conor Dooley <conor.dooley@microchip.com> Reviewed-by: Andi Shyti <andi.shyti@kernel.org> Link: https://lore.kernel.org/r/20241218-outbid-encounter-b2e78b1cc707@spud Signed-off-by: Andi Shyti <andi.shyti@kernel.org>	2024-12-26 01:54:47 +01:00
Conor Dooley	9a8f9320d6	i2c: microchip-core: actually use repeated sends At present, where repeated sends are intended to be used, the i2c-microchip-core driver sends a stop followed by a start. Lots of i2c devices must not malfunction in the face of this behaviour, because the driver has operated like this for years! Try to keep track of whether or not a repeated send is required, and suppress sending a stop in these cases. CC: stable@vger.kernel.org Fixes: `64a6f1c498` ("i2c: add support for microchip fpga i2c controllers") Signed-off-by: Conor Dooley <conor.dooley@microchip.com> Reviewed-by: Andi Shyti <andi.shyti@kernel.org> Link: https://lore.kernel.org/r/20241218-football-composure-e56df2461461@spud Signed-off-by: Andi Shyti <andi.shyti@kernel.org>	2024-12-26 01:54:47 +01:00
Carlos Song	e0cec36319	i2c: imx: add imx7d compatible string for applying erratum ERR007805 Compatible string "fsl,imx7d-i2c" is not exited at i2c-imx driver compatible string table, at the result, "fsl,imx21-i2c" will be matched, but it will cause erratum ERR007805 not be applied in fact. So Add "fsl,imx7d-i2c" compatible string in i2c-imx driver to apply the erratum ERR007805(https://www.nxp.com/docs/en/errata/IMX7DS_3N09P.pdf). " ERR007805 I2C: When the I2C clock speed is configured for 400 kHz, the SCL low period violates the I2C spec of 1.3 uS min Description: When the I2C module is programmed to operate at the maximum clock speed of 400 kHz (as defined by the I2C spec), the SCL clock low period violates the I2C spec of 1.3 uS min. The user must reduce the clock speed to obtain the SCL low time to meet the 1.3us I2C minimum required. This behavior means the SoC is not compliant to the I2C spec at 400kHz. Workaround: To meet the clock low period requirement in fast speed mode, SCL must be configured to 384KHz or less. " "fsl,imx7d-i2c" already is documented in binding doc. This erratum fix has been included in imx6_i2c_hwdata and it is the same in all I.MX6/7/8, so just reuse it. Fixes: `39c025721d` ("i2c: imx: Implement errata ERR007805 or e7805 bus frequency limit") Cc: stable@vger.kernel.org # v5.18+ Signed-off-by: Carlos Song <carlos.song@nxp.com> Signed-off-by: Haibo Chen <haibo.chen@nxp.com> Reviewed-by: Frank Li <Frank.Li@nxp.com> Fixes: `39c025721d` ("i2c: imx: Implement errata ERR007805 or e7805 bus frequency limit") Acked-by: Oleksij Rempel <o.rempel@pengutronix.de> Link: https://lore.kernel.org/r/20241218044238.143414-1-carlos.song@nxp.com Signed-off-by: Andi Shyti <andi.shyti@kernel.org>	2024-12-25 23:45:05 +01:00
Stefan Eichenberger	768776dd4e	i2c: imx: fix missing stop condition in single-master mode A regression was introduced with the implementation of single-master mode, preventing proper stop conditions from being generated. Devices that require a valid stop condition, such as EEPROMs, fail to function correctly as a result. The issue only affects devices with the single-master property enabled. This commit resolves the issue by re-enabling I2C bus busy bit (IBB) polling for single-master mode when generating a stop condition. The fix further ensures that the i2c_imx->stopped flag is cleared at the start of each transfer, allowing the stop condition to be correctly generated in i2c_imx_stop(). According to the reference manual (IMX8MMRM, Rev. 2, 09/2019, page 5270), polling the IBB bit to determine if the bus is free is only necessary in multi-master mode. Consequently, the IBB bit is not polled for the start condition in single-master mode. Fixes: `6692694aca` ("i2c: imx: do not poll for bus busy in single master mode") Signed-off-by: Stefan Eichenberger <stefan.eichenberger@toradex.com> Reviewed-by: Frank Li <Frank.Li@nxp.com> Reviewed-by: Francesco Dolcini <francesco.dolcini@toradex.com> Link: https://lore.kernel.org/r/20241216151829.74056-1-eichest@gmail.com Signed-off-by: Andi Shyti <andi.shyti@kernel.org>	2024-12-25 23:45:04 +01:00
Dustin L. Howett	dcd59d0d7d	platform/chrome: cros_ec_lpc: fix product identity for early Framework Laptops The product names for the Framework Laptop (12th and 13th Generation Intel Core) are incorrect as of `62be134abf`. Fixes: `62be134abf` ("platform/chrome: cros_ec_lpc: switch primary DMI data for Framework Laptop") Cc: stable@vger.kernel.org # 6.12.x Signed-off-by: Dustin L. Howett <dustin@howett.net> Reviewed-by: Thomas Weißschuh <linux@weissschuh.net> Link: https://lore.kernel.org/r/20241224-platform-chrome-cros_ec_lpc-fix-product-identity-for-early-framework-laptops-v1-1-0d31d6e1d22c@howett.net Signed-off-by: Tzung-Bi Shih <tzungbi@kernel.org>	2024-12-25 01:47:35 +00:00
Linus Torvalds	9b2ffa6148	Merge tag 'mtd/fixes-for-6.13-rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/mtd/linux Pull mtd fixes from Miquel Raynal: "Four minor fixes for NAND controller drivers (cleanup path, double actions, and W=1 warning) as well as a cast to avoid overflows in an mtd device driver" * tag 'mtd/fixes-for-6.13-rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/mtd/linux: mtd: rawnand: omap2: Fix build warnings with W=1 mtd: rawnand: arasan: Fix missing de-registration of NAND mtd: rawnand: arasan: Fix double assertion of chip-select mtd: diskonchip: Cast an operand to prevent potential overflow mtd: rawnand: fix double free in atmel_pmecc_create_user()	2024-12-24 09:08:45 -08:00
Arnd Bergmann	17194c2998	phy: mediatek: phy-mtk-hdmi: add regulator dependency The driver no longer builds when regulator support is unavailable: arm-linux-gnueabi-ld: drivers/phy/mediatek/phy-mtk-hdmi.o: in function `mtk_hdmi_phy_register_regulators': phy-mtk-hdmi.c:(.text.unlikely+0x3e): undefined reference to `devm_regulator_register' arm-linux-gnueabi-ld: drivers/phy/mediatek/phy-mtk-hdmi-mt8195.o: in function `mtk_hdmi_phy_pwr5v_is_enabled': phy-mtk-hdmi-mt8195.c:(.text+0x326): undefined reference to `rdev_get_drvdata' arm-linux-gnueabi-ld: drivers/phy/mediatek/phy-mtk-hdmi-mt8195.o: in function `mtk_hdmi_phy_pwr5v_disable': phy-mtk-hdmi-mt8195.c:(.text+0x346): undefined reference to `rdev_get_drvdata' arm-linux-gnueabi-ld: drivers/phy/mediatek/phy-mtk-hdmi-mt8195.o: in function `mtk_hdmi_phy_pwr5v_enable': Fixes: `49393b2da1` ("phy: mediatek: phy-mtk-hdmi: Register PHY provided regulator") Signed-off-by: Arnd Bergmann <arnd@arndb.de> Reviewed-by: AngeloGioacchino Del Regno <angelogioacchino.delregno@collabora.com> Link: https://lore.kernel.org/r/20241213083056.2596499-1-arnd@kernel.org Signed-off-by: Vinod Koul <vkoul@kernel.org>	2024-12-24 20:38:53 +05:30
Adam Ford	739214dd1c	phy: freescale: fsl-samsung-hdmi: Fix 64-by-32 division cocci warnings The Kernel test robot returns the following warning: do_div() does a 64-by-32 division, please consider using div64_ul instead. To prevent the 64-by-32 divsion, consolidate both the multiplication and the do_div into one line which explicitly uses u64 sizes. Fixes: `1951dbb41d` ("phy: freescale: fsl-samsung-hdmi: Support dynamic integer") Signed-off-by: Adam Ford <aford173@gmail.com> Reported-by: kernel test robot <lkp@intel.com> Closes: https://lore.kernel.org/oe-kbuild-all/202412091243.fSObwwPi-lkp@intel.com/ Link: https://lore.kernel.org/r/20241215220555.99113-1-aford173@gmail.com Signed-off-by: Vinod Koul <vkoul@kernel.org>	2024-12-24 20:37:56 +05:30
Zijun Hu	a2d633cb14	phy: core: Fix an OF node refcount leakage in of_phy_provider_lookup() For macro for_each_child_of_node(parent, child), refcount of @child has been increased before entering its loop body, so normally needs to call of_node_put(@child) before returning from the loop body to avoid refcount leakage. of_phy_provider_lookup() has such usage but does not call of_node_put() before returning, so cause leakage of the OF node refcount. Fix by simply calling of_node_put() before returning from the loop body. The APIs affected by this issue are shown below since they indirectly invoke problematic of_phy_provider_lookup(). phy_get() of_phy_get() devm_phy_get() devm_of_phy_get() devm_of_phy_get_by_index() Fixes: `2a4c37016c` ("phy: core: Fix of_phy_provider_lookup to return PHY provider for sub node") Cc: stable@vger.kernel.org Reviewed-by: Johan Hovold <johan+linaro@kernel.org> Signed-off-by: Zijun Hu <quic_zijuhu@quicinc.com> Link: https://lore.kernel.org/r/20241213-phy_core_fix-v6-5-40ae28f5015a@quicinc.com Signed-off-by: Vinod Koul <vkoul@kernel.org>	2024-12-24 19:55:37 +05:30
Zijun Hu	5ebdc6be16	phy: core: Fix an OF node refcount leakage in _of_phy_get() _of_phy_get() will directly return when suffers of_device_is_compatible() error, but it forgets to decrease refcount of OF node @args.np before error return, the refcount was increased by previous of_parse_phandle_with_args() so causes the OF node's refcount leakage. Fix by decreasing the refcount via of_node_put() before the error return. Fixes: `b7563e2796` ("phy: work around 'phys' references to usb-nop-xceiv devices") Cc: stable@vger.kernel.org Reviewed-by: Johan Hovold <johan+linaro@kernel.org> Signed-off-by: Zijun Hu <quic_zijuhu@quicinc.com> Link: https://lore.kernel.org/r/20241213-phy_core_fix-v6-4-40ae28f5015a@quicinc.com Signed-off-by: Vinod Koul <vkoul@kernel.org>	2024-12-24 19:55:37 +05:30
Zijun Hu	4dc48c88fc	phy: core: Fix that API devm_phy_destroy() fails to destroy the phy For devm_phy_destroy(), its comment says it needs to invoke phy_destroy() to destroy the phy, but it will not actually invoke the function since devres_destroy() does not call devm_phy_consume(), and the missing phy_destroy() call will cause that the phy fails to be destroyed. Fortunately, the faulty API has not been used by current kernel tree. Fix by using devres_release() instead of devres_destroy() within the API. Fixes: `ff76496347` ("drivers: phy: add generic PHY framework") Reviewed-by: Johan Hovold <johan+linaro@kernel.org> Signed-off-by: Zijun Hu <quic_zijuhu@quicinc.com> Link: https://lore.kernel.org/r/20241213-phy_core_fix-v6-3-40ae28f5015a@quicinc.com Signed-off-by: Vinod Koul <vkoul@kernel.org>	2024-12-24 19:55:37 +05:30
Zijun Hu	c0b82ab95b	phy: core: Fix that API devm_of_phy_provider_unregister() fails to unregister the phy provider For devm_of_phy_provider_unregister(), its comment says it needs to invoke of_phy_provider_unregister() to unregister the phy provider, but it will not actually invoke the function since devres_destroy() does not call devm_phy_provider_release(), and the missing of_phy_provider_unregister() call will cause: - The phy provider fails to be unregistered. - Leak both memory and the OF node refcount. Fortunately, the faulty API has not been used by current kernel tree. Fix by using devres_release() instead of devres_destroy() within the API. Fixes: `ff76496347` ("drivers: phy: add generic PHY framework") Reviewed-by: Johan Hovold <johan+linaro@kernel.org> Signed-off-by: Zijun Hu <quic_zijuhu@quicinc.com> Link: https://lore.kernel.org/stable/20241213-phy_core_fix-v6-2-40ae28f5015a%40quicinc.com Link: https://lore.kernel.org/r/20241213-phy_core_fix-v6-2-40ae28f5015a@quicinc.com Signed-off-by: Vinod Koul <vkoul@kernel.org>	2024-12-24 19:55:37 +05:30
Zijun Hu	fe4bfa9b6d	phy: core: Fix that API devm_phy_put() fails to release the phy For devm_phy_put(), its comment says it needs to invoke phy_put() to release the phy, but it will not actually invoke the function since devres_destroy() does not call devm_phy_release(), and the missing phy_put() call will cause: - The phy fails to be released. - devm_phy_put() can not fully undo what API devm_phy_get() does. - Leak refcount of both the module and device for below typical usage: devm_phy_get(); // or its variant ... err = do_something(); if (err) goto err_out; ... err_out: devm_phy_put(); // leak refcount here The file(s) affected by this issue are shown below since they have such typical usage. drivers/pci/controller/cadence/pcie-cadence.c drivers/net/ethernet/ti/am65-cpsw-nuss.c Fix by using devres_release() instead of devres_destroy() within the API. Fixes: `ff76496347` ("drivers: phy: add generic PHY framework") Cc: stable@vger.kernel.org Cc: Lorenzo Pieralisi <lpieralisi@kernel.org> Cc: Krzysztof Wilczyński <kw@linux.com> Cc: Bjorn Helgaas <bhelgaas@google.com> Cc: David S. Miller <davem@davemloft.net> Cc: Eric Dumazet <edumazet@google.com> Cc: Jakub Kicinski <kuba@kernel.org> Cc: Paolo Abeni <pabeni@redhat.com> Reviewed-by: Johan Hovold <johan+linaro@kernel.org> Signed-off-by: Zijun Hu <quic_zijuhu@quicinc.com> Link: https://lore.kernel.org/r/20241213-phy_core_fix-v6-1-40ae28f5015a@quicinc.com Signed-off-by: Vinod Koul <vkoul@kernel.org>	2024-12-24 19:55:37 +05:30
Akhil R	ebc008699f	dmaengine: tegra: Return correct DMA status when paused Currently, the driver does not return the correct DMA status when a DMA pause is issued by the client drivers. This causes GPCDMA users to assume that DMA is still running, while in reality, the DMA is paused. Return DMA_PAUSED for tx_status() if the channel is paused in the middle of a transfer. Fixes: `ee17028009` ("dmaengine: tegra: Add tegra gpcdma driver") Cc: stable@vger.kernel.org Signed-off-by: Akhil R <akhilrajeev@nvidia.com> Signed-off-by: Kartik Rajput <kkartik@nvidia.com> Link: https://lore.kernel.org/r/20241212124412.5650-1-kkartik@nvidia.com Signed-off-by: Vinod Koul <vkoul@kernel.org>	2024-12-24 15:49:30 +05:30
Javier Carrasco	362f1bf98a	dmaengine: mv_xor: fix child node refcount handling in early exit The for_each_child_of_node() loop requires explicit calls to of_node_put() to decrement the child's refcount upon early exits (break, goto, return). Add the missing calls in the two early exits before the goto instructions. Cc: stable@vger.kernel.org Fixes: `f7d12ef53d` ("dma: mv_xor: add Device Tree binding") Signed-off-by: Javier Carrasco <javier.carrasco.cruz@gmail.com> Link: https://lore.kernel.org/r/20241011-dma_mv_xor_of_node_put-v1-1-3c2de819f463@gmail.com Signed-off-by: Vinod Koul <vkoul@kernel.org>	2024-12-24 15:45:01 +05:30
Joe Hattori	ccfa3131d4	dmaengine: fsl-edma: implement the cleanup path of fsl_edma3_attach_pd() Current implementation of fsl_edma3_attach_pd() does not provide a cleanup path, resulting in a memory leak. For example, dev_pm_domain_detach() is not called after dev_pm_domain_attach_by_id(), and the device link created with the DL_FLAG_STATELESS is not released explicitly. Therefore, provide a cleanup function fsl_edma3_detach_pd() and call it upon failure. Also add a devm_add_action_or_reset() call with this function after a successful fsl_edma3_attach_pd(). Fixes: `72f5801a4e` ("dmaengine: fsl-edma: integrate v3 support") Signed-off-by: Joe Hattori <joe@pf.is.s.u-tokyo.ac.jp> Link: https://lore.kernel.org/r/20241221075712.3297200-1-joe@pf.is.s.u-tokyo.ac.jp Signed-off-by: Vinod Koul <vkoul@kernel.org>	2024-12-24 14:55:57 +05:30
Lizhi Xu	98feccbf32	tracing: Prevent bad count for tracing_cpumask_write If a large count is provided, it will trigger a warning in bitmap_parse_user. Also check zero for it. Cc: stable@vger.kernel.org Fixes: `9e01c1b74c` ("cpumask: convert kernel trace functions") Link: https://lore.kernel.org/20241216073238.2573704-1-lizhi.xu@windriver.com Reported-by: syzbot+0aecfd34fb878546f3fd@syzkaller.appspotmail.com Closes: https://syzkaller.appspot.com/bug?extid=0aecfd34fb878546f3fd Tested-by: syzbot+0aecfd34fb878546f3fd@syzkaller.appspotmail.com Signed-off-by: Lizhi Xu <lizhi.xu@windriver.com> Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>	2024-12-23 21:59:15 -05:00
Christian Göttsche	452f4b31e3	tracing: Constify string literal data member in struct trace_event_call The name member of the struct trace_event_call is assigned with generated string literals; declare them pointer to read-only. Reported by clang: security/landlock/syscalls.c:179:1: warning: initializing 'char ' with an expression of type 'const char[34]' discards qualifiers [-Wincompatible-pointer-types-discards-qualifiers] 179 \| SYSCALL_DEFINE3(landlock_create_ruleset, \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 \| const struct landlock_ruleset_attr __user const, attr, \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 181 \| const size_t, size, const __u32, flags) \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ./include/linux/syscalls.h:226:36: note: expanded from macro 'SYSCALL_DEFINE3' 226 \| #define SYSCALL_DEFINE3(name, ...) SYSCALL_DEFINEx(3, _##name, __VA_ARGS__) \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ./include/linux/syscalls.h:234:2: note: expanded from macro 'SYSCALL_DEFINEx' 234 \| SYSCALL_METADATA(sname, x, __VA_ARGS__) \ \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ./include/linux/syscalls.h:184:2: note: expanded from macro 'SYSCALL_METADATA' 184 \| SYSCALL_TRACE_ENTER_EVENT(sname); \ \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ./include/linux/syscalls.h:151:30: note: expanded from macro 'SYSCALL_TRACE_ENTER_EVENT' 151 \| .name = "sys_enter"#sname, \ \| ^~~~~~~~~~~~~~~~~ Cc: stable@vger.kernel.org Cc: Masami Hiramatsu <mhiramat@kernel.org> Cc: Mathieu Desnoyers <mathieu.desnoyers@efficios.com> Cc: Mickaël Salaün <mic@digikod.net> Cc: Günther Noack <gnoack@google.com> Cc: Nathan Chancellor <nathan@kernel.org> Cc: Nick Desaulniers <ndesaulniers@google.com> Cc: Bill Wendling <morbo@google.com> Cc: Justin Stitt <justinstitt@google.com> Link: https://lore.kernel.org/20241125105028.42807-1-cgoettsche@seltendoof.de Fixes: `b77e38aa24` ("tracing: add event trace infrastructure") Signed-off-by: Christian Göttsche <cgzones@googlemail.com> Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>	2024-12-23 21:53:43 -05:00
Linus Torvalds	ef49c460ab	Merge tag 'modules-6.13-rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/modules/linux Pull modules fix from Petr Pavlu: "A single fix is present to correct the module vermagic for PREEMPT_RT" * tag 'modules-6.13-rc5' of git://git.kernel.org/pub/scm/linux/kernel/git/modules/linux: preempt: Move PREEMPT_RT before PREEMPT in vermagic.	2024-12-23 13:06:48 -08:00
Qu Wenruo	fca432e73d	btrfs: sysfs: fix direct super block member reads The following sysfs entries are reading super block member directly, which can have a different endian and cause wrong values: - sys/fs/btrfs/<uuid>/nodesize - sys/fs/btrfs/<uuid>/sectorsize - sys/fs/btrfs/<uuid>/clone_alignment Thankfully those values (nodesize and sectorsize) are always aligned inside the btrfs_super_block, so it won't trigger unaligned read errors, just endian problems. Fix them by using the native cached members instead. Fixes: `df93589a17` ("btrfs: export more from FS_INFO to sysfs") CC: stable@vger.kernel.org Reviewed-by: Naohiro Aota <naohiro.aota@wdc.com> Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: David Sterba <dsterba@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2024-12-23 22:06:44 +01:00
Julian Sun	f2363e6fcc	btrfs: fix transaction atomicity bug when enabling simple quotas Set squota incompat bit before committing the transaction that enables the feature. With the config CONFIG_BTRFS_ASSERT enabled, an assertion failure occurs regarding the simple quota feature. [5.596534] assertion failed: btrfs_fs_incompat(fs_info, SIMPLE_QUOTA), in fs/btrfs/qgroup.c:365 [5.597098] ------------[ cut here ]------------ [5.597371] kernel BUG at fs/btrfs/qgroup.c:365! [5.597946] CPU: 1 UID: 0 PID: 268 Comm: mount Not tainted 6.13.0-rc2-00031-gf92f4749861b #146 [5.598450] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.16.2-debian-1.16.2-1 04/01/2014 [5.599008] RIP: 0010:btrfs_read_qgroup_config+0x74d/0x7a0 [5.604303] <TASK> [5.605230] ? btrfs_read_qgroup_config+0x74d/0x7a0 [5.605538] ? exc_invalid_op+0x56/0x70 [5.605775] ? btrfs_read_qgroup_config+0x74d/0x7a0 [5.606066] ? asm_exc_invalid_op+0x1f/0x30 [5.606441] ? btrfs_read_qgroup_config+0x74d/0x7a0 [5.606741] ? btrfs_read_qgroup_config+0x74d/0x7a0 [5.607038] ? try_to_wake_up+0x317/0x760 [5.607286] open_ctree+0xd9c/0x1710 [5.607509] btrfs_get_tree+0x58a/0x7e0 [5.608002] vfs_get_tree+0x2e/0x100 [5.608224] fc_mount+0x16/0x60 [5.608420] btrfs_get_tree+0x2f8/0x7e0 [5.608897] vfs_get_tree+0x2e/0x100 [5.609121] path_mount+0x4c8/0xbc0 [5.609538] __x64_sys_mount+0x10d/0x150 The issue can be easily reproduced using the following reproducer: root@q:linux# cat repro.sh set -e mkfs.btrfs -q -f /dev/sdb mount /dev/sdb /mnt/btrfs btrfs quota enable -s /mnt/btrfs umount /mnt/btrfs mount /dev/sdb /mnt/btrfs The issue is that when enabling quotas, at btrfs_quota_enable(), we set BTRFS_QGROUP_STATUS_FLAG_SIMPLE_MODE at fs_info->qgroup_flags and persist it in the quota root in the item with the key BTRFS_QGROUP_STATUS_KEY, but we only set the incompat bit BTRFS_FEATURE_INCOMPAT_SIMPLE_QUOTA after we commit the transaction used to enable simple quotas. This means that if after that transaction commit we unmount the filesystem without starting and committing any other transaction, or we have a power failure, the next time we mount the filesystem we will find the flag BTRFS_QGROUP_STATUS_FLAG_SIMPLE_MODE set in the item with the key BTRFS_QGROUP_STATUS_KEY but we will not find the incompat bit BTRFS_FEATURE_INCOMPAT_SIMPLE_QUOTA set in the superblock, triggering an assertion failure at: btrfs_read_qgroup_config() -> qgroup_read_enable_gen() To fix this issue, set the BTRFS_FEATURE_INCOMPAT_SIMPLE_QUOTA flag immediately after setting the BTRFS_QGROUP_STATUS_FLAG_SIMPLE_MODE. This ensures that both flags are flushed to disk within the same transaction. Fixes: `182940f4f4` ("btrfs: qgroup: add new quota mode for simple quotas") CC: stable@vger.kernel.org # 6.6+ Reviewed-by: Filipe Manana <fdmanana@suse.com> Signed-off-by: Julian Sun <sunjunchao2870@gmail.com> Signed-off-by: Filipe Manana <fdmanana@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2024-12-23 22:05:05 +01:00
Filipe Manana	2c8507c63f	btrfs: avoid monopolizing a core when activating a swap file During swap activation we iterate over the extents of a file and we can have many thousands of them, so we can end up in a busy loop monopolizing a core. Avoid this by doing a voluntary reschedule after processing each extent. CC: stable@vger.kernel.org # 5.4+ Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Filipe Manana <fdmanana@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2024-12-23 22:04:48 +01:00
Filipe Manana	9a45022a0e	btrfs: allow swap activation to be interruptible During swap activation we iterate over the extents of a file, then do several checks for each extent, some of which may take some significant time such as checking if an extent is shared. Since a file can have many thousands of extents, this can be a very slow operation and it's currently not interruptible. I had a bug during development of a previous patch that resulted in an infinite loop when iterating the extents, so a core was busy looping and I couldn't cancel the operation, which is very annoying and requires a reboot. So make the loop interruptible by checking for fatal signals at the end of each iteration and stopping immediately if there is one. CC: stable@vger.kernel.org # 5.4+ Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Filipe Manana <fdmanana@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2024-12-23 22:04:38 +01:00
Filipe Manana	03018e5d85	btrfs: fix swap file activation failure due to extents that used to be shared When activating a swap file, to determine if an extent is shared we use can_nocow_extent(), which ends up at btrfs_cross_ref_exist(). That helper is meant to be quick because it's used in the NOCOW write path, when flushing delalloc and when doing a direct IO write, however it does return some false positives, meaning it may indicate that an extent is shared even if it's no longer the case. For the write path this is fine, we just do a unnecessary COW operation instead of doing a more rigorous check which would be too heavy (calling btrfs_is_data_extent_shared()). However when activating a swap file, the false positives simply result in a failure, which is confusing for users/applications. One particular case where this happens is when a data extent only has 1 reference but that reference is not inlined in the extent item located in the extent tree - this happens when we create more than 33 references for an extent and then delete those 33 references plus every other non-inline reference except one. The function check_committed_ref() assumes that if the size of an extent item doesn't match the size of struct btrfs_extent_item plus the size of an inline reference (plus an owner reference in case simple quotas are enabled), then the extent is shared - that is not the case however, we can have a single reference but it's not inlined - the reason we do this is to be fast and avoid inspecting non-inline references which may be located in another leaf of the extent tree, slowing down write paths. The following test script reproduces the bug: $ cat test.sh #!/bin/bash DEV=/dev/sdi MNT=/mnt/sdi NUM_CLONES=50 umount $DEV &> /dev/null run_test() { local sync_after_add_reflinks=$1 local sync_after_remove_reflinks=$2 mkfs.btrfs -f $DEV > /dev/null #mkfs.xfs -f $DEV > /dev/null mount $DEV $MNT touch $MNT/foo chmod 0600 $MNT/foo # On btrfs the file must be NOCOW. chattr +C $MNT/foo &> /dev/null xfs_io -s -c "pwrite -b 1M 0 1M" $MNT/foo mkswap $MNT/foo for ((i = 1; i <= $NUM_CLONES; i++)); do touch $MNT/foo_clone_$i chmod 0600 $MNT/foo_clone_$i # On btrfs the file must be NOCOW. chattr +C $MNT/foo_clone_$i &> /dev/null cp --reflink=always $MNT/foo $MNT/foo_clone_$i done if [ $sync_after_add_reflinks -ne 0 ]; then # Flush delayed refs and commit current transaction. sync -f $MNT fi # Remove the original file and all clones except the last. rm -f $MNT/foo for ((i = 1; i < $NUM_CLONES; i++)); do rm -f $MNT/foo_clone_$i done if [ $sync_after_remove_reflinks -ne 0 ]; then # Flush delayed refs and commit current transaction. sync -f $MNT fi # Now use the last clone as a swap file. It should work since # its extent are not shared anymore. swapon $MNT/foo_clone_${NUM_CLONES} swapoff $MNT/foo_clone_${NUM_CLONES} umount $MNT } echo -e "\nTest without sync after creating and removing clones" run_test 0 0 echo -e "\nTest with sync after creating clones" run_test 1 0 echo -e "\nTest with sync after removing clones" run_test 0 1 echo -e "\nTest with sync after creating and removing clones" run_test 1 1 Running the test: $ ./test.sh Test without sync after creating and removing clones wrote 1048576/1048576 bytes at offset 0 1 MiB, 1 ops; 0.0017 sec (556.793 MiB/sec and 556.7929 ops/sec) Setting up swapspace version 1, size = 1020 KiB (1044480 bytes) no label, UUID=a6b9c29e-5ef4-4689-a8ac-bc199c750f02 swapon: /mnt/sdi/foo_clone_50: swapon failed: Invalid argument swapoff: /mnt/sdi/foo_clone_50: swapoff failed: Invalid argument Test with sync after creating clones wrote 1048576/1048576 bytes at offset 0 1 MiB, 1 ops; 0.0036 sec (271.739 MiB/sec and 271.7391 ops/sec) Setting up swapspace version 1, size = 1020 KiB (1044480 bytes) no label, UUID=5e9008d6-1f7a-4948-a1b4-3f30aba20a33 swapon: /mnt/sdi/foo_clone_50: swapon failed: Invalid argument swapoff: /mnt/sdi/foo_clone_50: swapoff failed: Invalid argument Test with sync after removing clones wrote 1048576/1048576 bytes at offset 0 1 MiB, 1 ops; 0.0103 sec (96.665 MiB/sec and 96.6651 ops/sec) Setting up swapspace version 1, size = 1020 KiB (1044480 bytes) no label, UUID=916c2740-fa9f-4385-9f06-29c3f89e4764 Test with sync after creating and removing clones wrote 1048576/1048576 bytes at offset 0 1 MiB, 1 ops; 0.0031 sec (314.268 MiB/sec and 314.2678 ops/sec) Setting up swapspace version 1, size = 1020 KiB (1044480 bytes) no label, UUID=06aab1dd-4d90-49c0-bd9f-3a8db4e2f912 swapon: /mnt/sdi/foo_clone_50: swapon failed: Invalid argument swapoff: /mnt/sdi/foo_clone_50: swapoff failed: Invalid argument Fix this by reworking btrfs_swap_activate() to instead of using extent maps and checking for shared extents with can_nocow_extent(), iterate over the inode's file extent items and use the accurate btrfs_is_data_extent_shared(). CC: stable@vger.kernel.org # 5.4+ Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Filipe Manana <fdmanana@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2024-12-23 22:04:17 +01:00
Filipe Manana	0525064bb8	btrfs: fix race with memory mapped writes when activating swap file When activating the swap file we flush all delalloc and wait for ordered extent completion, so that we don't miss any delalloc and extents before we check that the file's extent layout is usable for a swap file and activate the swap file. We are called with the inode's VFS lock acquired, so we won't race with buffered and direct IO writes, however we can still race with memory mapped writes since they don't acquire the inode's VFS lock. The race window is between flushing all delalloc and locking the whole file's extent range, since memory mapped writes lock an extent range with the length of a page. Fix this by acquiring the inode's mmap lock before we flush delalloc. CC: stable@vger.kernel.org # 5.4+ Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Filipe Manana <fdmanana@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2024-12-23 22:03:43 +01:00
Boris Burkov	0fba7be1ca	btrfs: check folio mapping after unlock in put_file_data() When we call btrfs_read_folio() we get an unlocked folio, so it is possible for a different thread to concurrently modify folio->mapping. We must check that this hasn't happened once we do have the lock. CC: stable@vger.kernel.org # 6.12+ Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Boris Burkov <boris@bur.io> Signed-off-by: David Sterba <dsterba@suse.com>	2024-12-23 22:00:07 +01:00
Boris Burkov	3e74859ee3	btrfs: check folio mapping after unlock in relocate_one_folio() When we call btrfs_read_folio() to bring a folio uptodate, we unlock the folio. The result of that is that a different thread can modify the mapping (like remove it with invalidate) before we call folio_lock(). This results in an invalid page and we need to try again. In particular, if we are relocating concurrently with aborting a transaction, this can result in a crash like the following: BUG: kernel NULL pointer dereference, address: 0000000000000000 PGD 0 P4D 0 Oops: 0000 [#1] SMP CPU: 76 PID: 1411631 Comm: kworker/u322:5 Workqueue: events_unbound btrfs_reclaim_bgs_work RIP: 0010:set_page_extent_mapped+0x20/0xb0 RSP: 0018:ffffc900516a7be8 EFLAGS: 00010246 RAX: ffffea009e851d08 RBX: ffffea009e0b1880 RCX: 0000000000000000 RDX: 0000000000000000 RSI: ffffc900516a7b90 RDI: ffffea009e0b1880 RBP: 0000000003573000 R08: 0000000000000001 R09: ffff88c07fd2f3f0 R10: 0000000000000000 R11: 0000194754b575be R12: 0000000003572000 R13: 0000000003572fff R14: 0000000000100cca R15: 0000000005582fff FS: 0000000000000000(0000) GS:ffff88c07fd00000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 0000000000000000 CR3: 000000407d00f002 CR4: 00000000007706f0 DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 PKRU: 55555554 Call Trace: <TASK> ? __die+0x78/0xc0 ? page_fault_oops+0x2a8/0x3a0 ? __switch_to+0x133/0x530 ? wq_worker_running+0xa/0x40 ? exc_page_fault+0x63/0x130 ? asm_exc_page_fault+0x22/0x30 ? set_page_extent_mapped+0x20/0xb0 relocate_file_extent_cluster+0x1a7/0x940 relocate_data_extent+0xaf/0x120 relocate_block_group+0x20f/0x480 btrfs_relocate_block_group+0x152/0x320 btrfs_relocate_chunk+0x3d/0x120 btrfs_reclaim_bgs_work+0x2ae/0x4e0 process_scheduled_works+0x184/0x370 worker_thread+0xc6/0x3e0 ? blk_add_timer+0xb0/0xb0 kthread+0xae/0xe0 ? flush_tlb_kernel_range+0x90/0x90 ret_from_fork+0x2f/0x40 ? flush_tlb_kernel_range+0x90/0x90 ret_from_fork_asm+0x11/0x20 </TASK> This occurs because cleanup_one_transaction() calls destroy_delalloc_inodes() which calls invalidate_inode_pages2() which takes the folio_lock before setting mapping to NULL. We fail to check this, and subsequently call set_extent_mapping(), which assumes that mapping != NULL (in fact it asserts that in debug mode) Note that the "fixes" patch here is not the one that introduced the race (the very first iteration of this code from 2009) but a more recent change that made this particular crash happen in practice. Fixes: `e7f1326cc2` ("btrfs: set page extent mapped after read_folio in relocate_one_page") CC: stable@vger.kernel.org # 6.1+ Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Boris Burkov <boris@bur.io> Signed-off-by: David Sterba <dsterba@suse.com>	2024-12-23 22:00:07 +01:00
Filipe Manana	44f52bbe96	btrfs: fix use-after-free when COWing tree bock and tracing is enabled When a COWing a tree block, at btrfs_cow_block(), and we have the tracepoint trace_btrfs_cow_block() enabled and preemption is also enabled (CONFIG_PREEMPT=y), we can trigger a use-after-free in the COWed extent buffer while inside the tracepoint code. This is because in some paths that call btrfs_cow_block(), such as btrfs_search_slot(), we are holding the last reference on the extent buffer @buf so btrfs_force_cow_block() drops the last reference on the @buf extent buffer when it calls free_extent_buffer_stale(buf), which schedules the release of the extent buffer with RCU. This means that if we are on a kernel with preemption, the current task may be preempted before calling trace_btrfs_cow_block() and the extent buffer already released by the time trace_btrfs_cow_block() is called, resulting in a use-after-free. Fix this by moving the trace_btrfs_cow_block() from btrfs_cow_block() to btrfs_force_cow_block() before the COWed extent buffer is freed. This also has a side effect of invoking the tracepoint in the tree defrag code, at defrag.c:btrfs_realloc_node(), since btrfs_force_cow_block() is called there, but this is fine and it was actually missing there. Reported-by: syzbot+8517da8635307182c8a5@syzkaller.appspotmail.com Link: https://lore.kernel.org/linux-btrfs/6759a9b9.050a0220.1ac542.000d.GAE@google.com/ CC: stable@vger.kernel.org # 5.4+ Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Filipe Manana <fdmanana@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2024-12-23 21:59:32 +01:00
Johannes Thumshirn	d29662695e	btrfs: fix use-after-free waiting for encoded read endios Fix a use-after-free in the I/O completion path for encoded reads by using a completion instead of a wait_queue for synchronizing the destruction of 'struct btrfs_encoded_read_private'. Fixes: `1881fba89b` ("btrfs: add BTRFS_IOC_ENCODED_READ ioctl") CC: stable@vger.kernel.org # 6.1+ Reviewed-by: Filipe Manana <fdmanana@suse.com> Reviewed-by: Qu Wenruo <wqu@suse.com> Signed-off-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Reviewed-by: David Sterba <dsterba@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2024-12-23 21:55:06 +01:00
Linus Torvalds	f07044dd0d	Merge tag 'nfsd-6.13-1' of git://git.kernel.org/pub/scm/linux/kernel/git/cel/linux Pull nfsd fixes from Chuck Lever:: - Revert one v6.13 fix at the author's request (to be done differently) - Fix a minor problem with recent NFSv4.2 COPY enhancements - Fix an NFSv4.0 callback bug introduced in the v6.13 merge window * tag 'nfsd-6.13-1' of git://git.kernel.org/pub/scm/linux/kernel/git/cel/linux: nfsd: restore callback functionality for NFSv4.0 NFSD: fix management of pending async copies nfsd: Revert "nfsd: release svc_expkey/svc_export with rcu_work"	2024-12-23 12:16:15 -08:00
Masami Hiramatsu (Google)	d685d55dfc	tracing/kprobe: Make trace_kprobe's module callback called after jump_label update Make sure the trace_kprobe's module notifer callback function is called after jump_label's callback is called. Since the trace_kprobe's callback eventually checks jump_label address during registering new kprobe on the loading module, jump_label must be updated before this registration happens. Link: https://lore.kernel.org/all/173387585556.995044.3157941002975446119.stgit@devnote2/ Fixes: `6142431810` ("tracing/kprobes: Support module init function probing") Signed-off-by: Masami Hiramatsu (Google) <mhiramat@kernel.org>	2024-12-24 00:08:13 +09:00
Dr. David Alan Gilbert	f17224c2a7	cifs: Remove unused is_server_using_iface() The last use of is_server_using_iface() was removed in 2022 by commit `aa45dadd34` ("cifs: change iface_list from array to sorted linked list") Remove it. Signed-off-by: Dr. David Alan Gilbert <linux@treblig.org> Signed-off-by: Steve French <stfrench@microsoft.com>	2024-12-23 08:06:05 -06:00
Bharath SM	b8ea3b1ff5	smb: enable reuse of deferred file handles for write operations Previously, deferred file handles were reused only for read operations, this commit extends to reusing deferred handles for write operations. By reusing these handles we can reduce the need for open/close operations over the wire. Signed-off-by: Bharath SM <bharathsm@microsoft.com> Signed-off-by: Steve French <stfrench@microsoft.com>	2024-12-23 08:05:39 -06:00
Sebastian Andrzej Siewior	0b7a66a2c8	preempt: Move PREEMPT_RT before PREEMPT in vermagic. Since the dynamic preemption has been enabled for PREEMPT_RT we have now CONFIG_PREEMPT and CONFIG_PREEMPT_RT set simultaneously. This affects the vermagic strings which comes now PREEMPT with PREEMPT_RT enabled. The PREEMPT_RT module usually can not be loaded on a PREEMPT kernel because some symbols are missing. However if the symbols are fine then it continues and it crashes later. The problem is that the struct module has a different layout and the num_exentries or init members are at a different position leading to a crash later on. This is not necessary caught by the size check in elf_validity_cache_index_mod() because the mem member has an alignment requirement of __module_memory_align which is big enough keep the total size unchanged. Therefore we should keep the string accurate instead of removing it. Move the PREEMPT_RT check before the PREEMPT so that it takes precedence if both symbols are enabled. Fixes: `35772d627b` ("sched: Enable PREEMPT_DYNAMIC for PREEMPT_RT") Signed-off-by: Sebastian Andrzej Siewior <bigeasy@linutronix.de> Reviewed-by: Petr Pavlu <petr.pavlu@suse.com> Link: https://lore.kernel.org/r/20241205160602.3lIAsJRT@linutronix.de Signed-off-by: Petr Pavlu <petr.pavlu@suse.com>	2024-12-23 10:46:38 +01:00
Linus Torvalds	4bbf9020be	Linux 6.13-rc4	2024-12-22 13:22:21 -08:00
Linus Torvalds	b1fdbe77be	Merge tag 'for-linus' of git://git.kernel.org/pub/scm/virt/kvm/kvm Pull KVM x86 fixes from Paolo Bonzini: - Disable AVIC on SNP-enabled systems that don't allow writes to the virtual APIC page, as such hosts will hit unexpected RMP #PFs in the host when running VMs of any flavor. - Fix a WARN in the hypercall completion path due to KVM trying to determine if a guest with protected register state is in 64-bit mode (KVM's ABI is to assume such guests only make hypercalls in 64-bit mode). - Allow the guest to write to supported bits in MSR_AMD64_DE_CFG to fix a regression with Windows guests, and because KVM's read-only behavior appears to be entirely made up. - Treat TDP MMU faults as spurious if the faulting access is allowed given the existing SPTE. This fixes a benign WARN (other than the WARN itself) due to unexpectedly replacing a writable SPTE with a read-only SPTE. - Emit a warning when KVM is configured with ignore_msrs=1 and also to hide the MSRs that the guest is looking for from the kernel logs. ignore_msrs can trick guests into assuming that certain processor features are present, and this in turn leads to bogus bug reports. * tag 'for-linus' of git://git.kernel.org/pub/scm/virt/kvm/kvm: KVM: x86: let it be known that ignore_msrs is a bad idea KVM: VMX: don't include '<linux/find.h>' directly KVM: x86/mmu: Treat TDP MMU faults as spurious if access is already allowed KVM: SVM: Allow guest writes to set MSR_AMD64_DE_CFG bits KVM: x86: Play nice with protected guests in complete_hypercall_exit() KVM: SVM: Disable AVIC on SNP-enabled system without HvInUseWrAllowed feature	2024-12-22 12:16:41 -08:00
Paolo Bonzini	8afa5b10af	Merge tag 'kvm-x86-fixes-6.13-rcN' of https://github.com/kvm-x86/linux into HEAD KVM x86 fixes for 6.13: - Disable AVIC on SNP-enabled systems that don't allow writes to the virtual APIC page, as such hosts will hit unexpected RMP #PFs in the host when running VMs of any flavor. - Fix a WARN in the hypercall completion path due to KVM trying to determine if a guest with protected register state is in 64-bit mode (KVM's ABI is to assume such guests only make hypercalls in 64-bit mode). - Allow the guest to write to supported bits in MSR_AMD64_DE_CFG to fix a regression with Windows guests, and because KVM's read-only behavior appears to be entirely made up. - Treat TDP MMU faults as spurious if the faulting access is allowed given the existing SPTE. This fixes a benign WARN (other than the WARN itself) due to unexpectedly replacing a writable SPTE with a read-only SPTE.	2024-12-22 12:07:16 -05:00
Paolo Bonzini	398b7b6cb9	KVM: x86: let it be known that ignore_msrs is a bad idea When running KVM with ignore_msrs=1 and report_ignored_msrs=0, the user has no clue that that the guest is being lied to. This may cause bug reports such as https://gitlab.com/qemu-project/qemu/-/issues/2571, where enabling a CPUID bit in QEMU caused Linux guests to try reading MSR_CU_DEF_ERR; and being lied about the existence of MSR_CU_DEF_ERR caused the guest to assume other things about the local APIC which were not true: Sep 14 12:02:53 kernel: mce: [Firmware Bug]: Your BIOS is not setting up LVT offset 0x2 for deferred error IRQs correctly. Sep 14 12:02:53 kernel: unchecked MSR access error: RDMSR from 0x852 at rIP: 0xffffffffb548ffa7 (native_read_msr+0x7/0x40) Sep 14 12:02:53 kernel: Call Trace: ... Sep 14 12:02:53 kernel: native_apic_msr_read+0x20/0x30 Sep 14 12:02:53 kernel: setup_APIC_eilvt+0x47/0x110 Sep 14 12:02:53 kernel: mce_amd_feature_init+0x485/0x4e0 ... Sep 14 12:02:53 kernel: [Firmware Bug]: cpu 0, try to use APIC520 (LVT offset 2) for vector 0xf4, but the register is already in use for vector 0x0 on this cpu Without reported_ignored_msrs=0 at least the host kernel log will contain enough information to avoid going on a wild goose chase. But if reports about individual MSR accesses are being silenced too, at least complain loudly the first time a VM is started. Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>	2024-12-22 12:06:01 -05:00
Wolfram Sang	37d1d99b88	KVM: VMX: don't include '<linux/find.h>' directly The header clearly states that it does not want to be included directly, only via '<linux/bitmap.h>'. Replace the include accordingly. Signed-off-by: Wolfram Sang <wsa+renesas@sang-engineering.com> Message-ID: <20241217070539.2433-2-wsa+renesas@sang-engineering.com> Signed-off-by: Paolo Bonzini <pbonzini@redhat.com>	2024-12-22 12:04:57 -05:00
Linus Torvalds	bcde95ce32	Merge tag 'devicetree-fixes-for-6.13-1' of git://git.kernel.org/pub/scm/linux/kernel/git/robh/linux Pull devicetree fixes from Rob Herring: - Disable #address-cells/#size-cells warning on coreboot (Chromebooks) platforms - Add missing root #address-cells/#size-cells in default empty DT - Fix uninitialized variable in of_irq_parse_one() - Fix interrupt-map cell length check in of_irq_parse_imap_parent() - Fix refcount handling in __of_get_dma_parent() - Fix error path in of_parse_phandle_with_args_map() - Fix dma-ranges handling with flags cells - Drop explicit fw_devlink handling of 'interrupt-parent' - Fix "compression" typo in fixed-partitions binding - Unify "fsl,liodn" property type definitions * tag 'devicetree-fixes-for-6.13-1' of git://git.kernel.org/pub/scm/linux/kernel/git/robh/linux: of: Add coreboot firmware to excluded default cells list of/irq: Fix using uninitialized variable @addr_len in API of_irq_parse_one() of/irq: Fix interrupt-map cell length check in of_irq_parse_imap_parent() of: Fix refcount leakage for OF node returned by __of_get_dma_parent() of: Fix error path in of_parse_phandle_with_args_map() dt-bindings: mtd: fixed-partitions: Fix "compression" typo of: Add #address-cells/#size-cells in the device-tree root empty node dt-bindings: Unify "fsl,liodn" type definitions of: address: Preserve the flags portion on 1:1 dma-ranges mapping of/unittest: Add empty dma-ranges address translation tests of: property: fw_devlink: Do not use interrupt-parent directly	2024-12-22 08:40:23 -08:00
Linus Torvalds	48f506ad0b	Merge tag 'soc-fixes-6.13-2' of git://git.kernel.org/pub/scm/linux/kernel/git/soc/soc Pull SoC fixes from Arnd Bergmann: "Two more small fixes, correcting the cacheline size on Raspberry Pi 5 and fixing a logic mistake in the microchip mpfs firmware driver" * tag 'soc-fixes-6.13-2' of git://git.kernel.org/pub/scm/linux/kernel/git/soc/soc: arm64: dts: broadcom: Fix L2 linesize for Raspberry Pi 5 firmware: microchip: fix UL_IAP lock check in mpfs_auto_update_state()	2024-12-21 15:45:06 -08:00
Linus Torvalds	4aa748dd1a	Merge tag 'mm-hotfixes-stable-2024-12-21-12-09' of git://git.kernel.org/pub/scm/linux/kernel/git/akpm/mm Pull misc fixes from Andrew Morton: "25 hotfixes. 16 are cc:stable. 19 are MM and 6 are non-MM. The usual bunch of singletons and doubletons - please see the relevant changelogs for details" * tag 'mm-hotfixes-stable-2024-12-21-12-09' of git://git.kernel.org/pub/scm/linux/kernel/git/akpm/mm: (25 commits) mm: huge_memory: handle strsep not finding delimiter alloc_tag: fix set_codetag_empty() when !CONFIG_MEM_ALLOC_PROFILING_DEBUG alloc_tag: fix module allocation tags populated area calculation mm/codetag: clear tags before swap mm/vmstat: fix a W=1 clang compiler warning mm: convert partially_mapped set/clear operations to be atomic nilfs2: fix buffer head leaks in calls to truncate_inode_pages() vmalloc: fix accounting with i915 mm/page_alloc: don't call pfn_to_page() on possibly non-existent PFN in split_large_buddy() fork: avoid inappropriate uprobe access to invalid mm nilfs2: prevent use of deleted inode zram: fix uninitialized ZRAM not releasing backing device zram: refuse to use zero sized block device as backing device mm: use clear_user_(high)page() for arch with special user folio handling mm: introduce cpu_icache_is_aliasing() across all architectures mm: add RCU annotation to pte_offset_map(_lock) mm: correctly reference merged VMA mm: use aligned address in copy_user_gigantic_page() mm: use aligned address in clear_gigantic_page() mm: shmem: fix ShmemHugePages at swapout ...	2024-12-21 15:31:56 -08:00
Steven Rostedt	e84a3bf7f4	staging: gpib: Fix allyesconfig build failures My tests run an allyesconfig build and it failed with the following errors: LD [M] samples/kfifo/dma-example.ko ld.lld: error: undefined symbol: nec7210_board_reset ld.lld: error: undefined symbol: nec7210_read ld.lld: error: undefined symbol: nec7210_write It appears that some modules call the function nec7210_board_reset() that is defined in nec7210.c. In an allyesconfig build, these other modules are built in. But the file that holds nec7210_board_reset() has: obj-m += nec7210.o Where that "-m" means it only gets built as a module. With the other modules built in, they have no access to nec7210_board_reset() and the build fails. This isn't the only function. After fixing that one, I hit another: ld.lld: error: undefined symbol: push_gpib_event ld.lld: error: undefined symbol: gpib_match_device_path Where push_gpib_event() was also used outside of the file it was defined in, and that file too only was built as a module. Since the directory that nec7210.c is only traversed when CONFIG_GPIB_NEC7210 is set, and the directory with gpib_common.c is only traversed when CONFIG_GPIB_COMMON is set, use those configs as the option to build those modules. When it is an allyesconfig, then they will both be built in and their functions will be available to the other modules that are also built in. Fixes: `3ba84ac69b` ("staging: gpib: Add nec7210 GPIB chip driver") Fixes: `9dde4559e9` ("staging: gpib: Add GPIB common core driver") Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org> Reviewed-by: Palmer Dabbelt <palmer@rivosinc.com> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>	2024-12-21 11:30:13 -08:00
Linus Torvalds	a016546ba6	Merge tag 'kbuild-fixes-v6.13-2' of git://git.kernel.org/pub/scm/linux/kernel/git/masahiroy/linux-kbuild Pull Kbuild fixes from Masahiro Yamada: - Remove stale code in usr/include/headers_check.pl - Fix issues in the user-mode-linux Debian package - Fix false-positive "export twice" errors in modpost * tag 'kbuild-fixes-v6.13-2' of git://git.kernel.org/pub/scm/linux/kernel/git/masahiroy/linux-kbuild: modpost: distinguish same module paths from different dump files kbuild: deb-pkg: Do not install maint scripts for arch 'um' kbuild: deb-pkg: add debarch for ARCH=um kbuild: Drop support for include/asm-<arch> in headers_check.pl	2024-12-21 11:24:32 -08:00
Linus Torvalds	9c707ba99f	Merge tag 'bpf-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/bpf/bpf Pull BPF fixes from Daniel Borkmann: - Fix inlining of bpf_get_smp_processor_id helper for !CONFIG_SMP systems (Andrea Righi) - Fix BPF USDT selftests helper code to use asm constraint "m" for LoongArch (Tiezhu Yang) - Fix BPF selftest compilation error in get_uprobe_offset when PROCMAP_QUERY is not defined (Jerome Marchand) - Fix BPF bpf_skb_change_tail helper when used in context of BPF sockmap to handle negative skb header offsets (Cong Wang) - Several fixes to BPF sockmap code, among others, in the area of socket buffer accounting (Levi Zim, Zijian Zhang, Cong Wang) * tag 'bpf-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/bpf/bpf: selftests/bpf: Test bpf_skb_change_tail() in TC ingress selftests/bpf: Introduce socket_helpers.h for TC tests selftests/bpf: Add a BPF selftest for bpf_skb_change_tail() bpf: Check negative offsets in __bpf_skb_min_len() tcp_bpf: Fix copied value in tcp_bpf_sendmsg skmsg: Return copied bytes in sk_msg_memcopy_from_iter tcp_bpf: Add sk_rmem_alloc related logic for tcp_bpf ingress redirection tcp_bpf: Charge receive socket buffer in bpf_tcp_ingress() selftests/bpf: Fix compilation error in get_uprobe_offset() selftests/bpf: Use asm constraint "m" for LoongArch bpf: Fix bpf_get_smp_processor_id() on !CONFIG_SMP	2024-12-21 11:07:19 -08:00
Linus Torvalds	876685ce5e	Merge tag 'media/v6.13-3' of git://git.kernel.org/pub/scm/linux/kernel/git/mchehab/linux-media Pull media fixes from Mauro Carvalho Chehab: - fix a clang build issue with mediatec vcodec - add missing variable initialization to dib3000mb write function * tag 'media/v6.13-3' of git://git.kernel.org/pub/scm/linux/kernel/git/mchehab/linux-media: media: mediatek: vcodec: mark vdec_vp9_slice_map_counts_eob_coef noinline media: dvb-frontends: dib3000mb: fix uninit-value in dib3000_write_reg	2024-12-21 10:56:34 -08:00
Linus Torvalds	a99b4a369a	Merge tag 'pci-v6.13-fixes-2' of git://git.kernel.org/pub/scm/linux/kernel/git/pci/pci Pull PCI fixes from Krzysztof Wilczyński: "Two small patches that are important for fixing boot time hang on Intel JHL7540 'Titan Ridge' platforms equipped with a Thunderbolt controller. The boot time issue manifests itself when a PCI Express bandwidth control is unnecessarily enabled on the Thunderbolt controller downstream ports, which only supports a link speed of 2.5 GT/s in accordance with USB4 v2 specification (p. 671, sec. 11.2.1, "PCIe Physical Layer Logical Sub-block"). As such, there is no need to enable bandwidth control on such downstream port links, which also works around the issue. Both patches were tested by the original reporter on the hardware on which the failure origin golly manifested itself. Both fixes were proven to resolve the reported boot hang issue, and both patches have been in linux-next this week with no reported problems" * tag 'pci-v6.13-fixes-2' of git://git.kernel.org/pub/scm/linux/kernel/git/pci/pci: PCI/bwctrl: Enable only if more than one speed is supported PCI: Honor Max Link Speed when determining supported speeds	2024-12-21 10:51:04 -08:00
Linus Torvalds	78b1346123	Merge tag 'pm-6.13-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm Pull power management fixes from Rafael Wysocki: "These fix some amd-pstate driver issues: - Detect preferred core support in amd-pstate before driver registration to avoid initialization ordering issues (K Prateek Nayak) - Fix issues with with boost numerator handling in amd-pstate leading to inconsistently programmed CPPC max performance values (Mario Limonciello)" * tag 'pm-6.13-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm: cpufreq/amd-pstate: Use boost numerator for upper bound of frequencies cpufreq/amd-pstate: Store the boost numerator as highest perf again cpufreq/amd-pstate: Detect preferred core support before driver registration	2024-12-21 10:47:47 -08:00
Linus Torvalds	be6bb3619e	Merge tag 'thermal-6.13-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm Pull thermal control fixes from Rafael Wysocki: "Fix two issues with the user thermal thresholds feature introduced in this development cycle (Daniel Lezcano)" * tag 'thermal-6.13-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm: thermal/thresholds: Fix boundaries and detection routine thermal/thresholds: Fix uapi header macros leading to a compilation error	2024-12-21 10:44:44 -08:00
Linus Torvalds	5100b6f9e7	Merge tag 'acpi-6.13-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm Pull ACPI fix from Rafael Wysocki: "Unbreak ACPI EC support on LoongArch that has been broken earlier in this development cycle (Huacai Chen)" * tag 'acpi-6.13-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/rafael/linux-pm: ACPI: EC: Enable EC support on LoongArch by default	2024-12-21 10:42:35 -08:00
Linus Torvalds	baa172c77a	Merge tag '6.13-rc3-SMB3-client-fixes' of git://git.samba.org/sfrench/cifs-2.6 Pull smb client fixes from Steve French: - fix regression in display of write stats - fix rmmod failure with network namespaces - two minor cleanups * tag '6.13-rc3-SMB3-client-fixes' of git://git.samba.org/sfrench/cifs-2.6: smb: fix bytes written value in /proc/fs/cifs/Stats smb: client: fix TCP timers deadlock after rmmod smb: client: Deduplicate "select NETFS_SUPPORT" in Kconfig smb: use macros instead of constants for leasekey size and default cifsattrs value	2024-12-21 09:35:18 -08:00
Linus Torvalds	4a5da3f5d3	Merge tag 'nfs-for-6.13-2' of git://git.linux-nfs.org/projects/trondmy/linux-nfs Pull NFS client fixes from Trond Myklebust: - NFS/pnfs: Fix a live lock between recalled layouts and layoutget - Fix a build warning about an undeclared symbol 'nfs_idmap_cache_timeout' * tag 'nfs-for-6.13-2' of git://git.linux-nfs.org/projects/trondmy/linux-nfs: fs/nfs: fix missing declaration of nfs_idmap_cache_timeout NFS/pnfs: Fix a live lock between recalled layouts and layoutget	2024-12-21 09:32:24 -08:00
Linus Torvalds	7684392f17	Merge tag 'ceph-for-6.13-rc4' of https://github.com/ceph/ceph-client Pull ceph fixes from Ilya Dryomov: "A handful of important CephFS fixes from Max, Alex and myself: memory corruption due to a buffer overrun, potential infinite loop and several memory leaks on the error paths. All but one marked for stable" * tag 'ceph-for-6.13-rc4' of https://github.com/ceph/ceph-client: ceph: allocate sparse_ext map only for sparse reads ceph: fix memory leak in ceph_direct_read_write() ceph: improve error handling and short/overflow-read logic in __ceph_sync_read() ceph: validate snapdirname option length when mounting ceph: give up on paths longer than PATH_MAX ceph: fix memory leaks in __ceph_sync_read()	2024-12-21 09:29:46 -08:00
Masahiro Yamada	9435dc77a3	modpost: distinguish same module paths from different dump files Since commit `13b25489b6` ("kbuild: change working directory to external module directory with M="), module paths are always relative to the top of the external module tree. The module paths recorded in Module.symvers are no longer globally unique when they are passed via KBUILD_EXTRA_SYMBOLS for building other external modules, which may result in false-positive "exported twice" errors. Such errors should not occur because external modules should be able to override in-tree modules. To address this, record the dump file path in struct module and check it when searching for a module. Fixes: `13b25489b6` ("kbuild: change working directory to external module directory with M=") Reported-by: Jon Hunter <jonathanh@nvidia.com> Closes: https://lore.kernel.org/all/eb21a546-a19c-40df-b821-bbba80f19a3d@nvidia.com/ Signed-off-by: Masahiro Yamada <masahiroy@kernel.org> Tested-by: Jon Hunter <jonathanh@nvidia.com>	2024-12-21 12:42:10 +09:00
Nicolas Schier	54956567a0	kbuild: deb-pkg: Do not install maint scripts for arch 'um' Stop installing Debian maintainer scripts when building a user-mode-linux Debian package. Debian maintainer scripts are used for e.g. requesting rebuilds of initrd, rebuilding DKMS modules and updating of grub configuration. As all of this is not relevant for UML but also may lead to failures while processing the kernel hooks, do no more install maintainer scripts for the UML package. Suggested-by: Masahiro Yamada <masahiroy@kernel.org> Signed-off-by: Nicolas Schier <nicolas@fjasle.eu> Signed-off-by: Masahiro Yamada <masahiroy@kernel.org>	2024-12-21 12:42:10 +09:00
Masahiro Yamada	a34e92d2e8	kbuild: deb-pkg: add debarch for ARCH=um 'make ARCH=um bindeb-pkg' shows the following warning. $ make ARCH=um bindeb-pkg [snip] GEN debian WARNING Your architecture doesn't have its equivalent Debian userspace architecture defined! Falling back to the current host architecture (amd64). Please add support for um to ./scripts/package/mkdebian ... This commit hard-codes i386/amd64 because UML is only supported for x86. Signed-off-by: Masahiro Yamada <masahiroy@kernel.org> Reviewed-by: Nicolas Schier <nicolas@fjasle.eu>	2024-12-21 12:42:04 +09:00
Geert Uytterhoeven	d67393f4d2	kbuild: Drop support for include/asm-<arch> in headers_check.pl "include/asm-<arch>" was replaced by "arch/<arch>/include/asm" a long time ago. All assembler header files are now included using "#include <asm/*>", so there is no longer a need to rewrite paths. Signed-off-by: Geert Uytterhoeven <geert+renesas@glider.be> Signed-off-by: Masahiro Yamada <masahiroy@kernel.org>	2024-12-21 11:43:17 +09:00
Cong Wang	4a58963d10	selftests/bpf: Test bpf_skb_change_tail() in TC ingress Similarly to the previous test, we also need a test case to cover positive offsets as well, TC is an excellent hook for this. Signed-off-by: Cong Wang <cong.wang@bytedance.com> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Tested-by: Zijian Zhang <zijianzhang@bytedance.com> Acked-by: John Fastabend <john.fastabend@gmail.com> Link: https://lore.kernel.org/bpf/20241213034057.246437-5-xiyou.wangcong@gmail.com	2024-12-20 23:13:31 +01:00
Cong Wang	472759c9f5	selftests/bpf: Introduce socket_helpers.h for TC tests Pull socket helpers out of sockmap_helpers.h so that they can be reused for TC tests as well. This prepares for the next patch. Signed-off-by: Cong Wang <cong.wang@bytedance.com> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: John Fastabend <john.fastabend@gmail.com> Link: https://lore.kernel.org/bpf/20241213034057.246437-4-xiyou.wangcong@gmail.com	2024-12-20 23:13:31 +01:00
Cong Wang	9ee0c7b865	selftests/bpf: Add a BPF selftest for bpf_skb_change_tail() As requested by Daniel, we need to add a selftest to cover bpf_skb_change_tail() cases in skb_verdict. Here we test trimming, growing and error cases, and validate its expected return values and the expected sizes of the payload. Signed-off-by: Cong Wang <cong.wang@bytedance.com> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: John Fastabend <john.fastabend@gmail.com> Link: https://lore.kernel.org/bpf/20241213034057.246437-3-xiyou.wangcong@gmail.com	2024-12-20 23:13:31 +01:00
Cong Wang	9ecc4d858b	bpf: Check negative offsets in __bpf_skb_min_len() skb_network_offset() and skb_transport_offset() can be negative when they are called after we pull the transport header, for example, when we use eBPF sockmap at the point of ->sk_data_ready(). __bpf_skb_min_len() uses an unsigned int to get these offsets, this leads to a very large number which then causes bpf_skb_change_tail() failed unexpectedly. Fix this by using a signed int to get these offsets and ensure the minimum is at least zero. Fixes: `5293efe62d` ("bpf: add bpf_skb_change_tail helper") Signed-off-by: Cong Wang <cong.wang@bytedance.com> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: John Fastabend <john.fastabend@gmail.com> Link: https://lore.kernel.org/bpf/20241213034057.246437-2-xiyou.wangcong@gmail.com	2024-12-20 23:13:31 +01:00
Linus Torvalds	499551201b	Merge tag 'arm64-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/arm64/linux Pull arm64 fix from Catalin Marinas: "Fix a sparse warning in the arm64 signal code dealing with the user shadow stack register, GCSPR_EL0" * tag 'arm64-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/arm64/linux: arm64/signal: Silence sparse warning storing GCSPR_EL0	2024-12-20 14:10:01 -08:00
Levi Zim	5153a75ef3	tcp_bpf: Fix copied value in tcp_bpf_sendmsg bpf kselftest sockhash::test_txmsg_cork_hangs in test_sockmap.c triggers a kernel NULL pointer dereference: BUG: kernel NULL pointer dereference, address: 0000000000000008 ? __die_body+0x6e/0xb0 ? __die+0x8b/0xa0 ? page_fault_oops+0x358/0x3c0 ? local_clock+0x19/0x30 ? lock_release+0x11b/0x440 ? kernelmode_fixup_or_oops+0x54/0x60 ? __bad_area_nosemaphore+0x4f/0x210 ? mmap_read_unlock+0x13/0x30 ? bad_area_nosemaphore+0x16/0x20 ? do_user_addr_fault+0x6fd/0x740 ? prb_read_valid+0x1d/0x30 ? exc_page_fault+0x55/0xd0 ? asm_exc_page_fault+0x2b/0x30 ? splice_to_socket+0x52e/0x630 ? shmem_file_splice_read+0x2b1/0x310 direct_splice_actor+0x47/0x70 splice_direct_to_actor+0x133/0x300 ? do_splice_direct+0x90/0x90 do_splice_direct+0x64/0x90 ? __ia32_sys_tee+0x30/0x30 do_sendfile+0x214/0x300 __se_sys_sendfile64+0x8e/0xb0 __x64_sys_sendfile64+0x25/0x30 x64_sys_call+0xb82/0x2840 do_syscall_64+0x75/0x110 entry_SYSCALL_64_after_hwframe+0x4b/0x53 This is caused by tcp_bpf_sendmsg() returning a larger value(12289) than size (8192), which causes the while loop in splice_to_socket() to release an uninitialized pipe buf. The underlying cause is that this code assumes sk_msg_memcopy_from_iter() will copy all bytes upon success but it actually might only copy part of it. This commit changes it to use the real copied bytes. Signed-off-by: Levi Zim <rsworktech@outlook.com> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Tested-by: Björn Töpel <bjorn@kernel.org> Reviewed-by: John Fastabend <john.fastabend@gmail.com> Link: https://lore.kernel.org/bpf/20241130-tcp-bpf-sendmsg-v1-2-bae583d014f3@outlook.com	2024-12-20 22:53:36 +01:00
Levi Zim	fdf478d236	skmsg: Return copied bytes in sk_msg_memcopy_from_iter Previously sk_msg_memcopy_from_iter returns the copied bytes from the last copy_from_iter{,_nocache} call upon success. This commit changes it to return the total number of copied bytes on success. Signed-off-by: Levi Zim <rsworktech@outlook.com> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Tested-by: Björn Töpel <bjorn@kernel.org> Reviewed-by: John Fastabend <john.fastabend@gmail.com> Link: https://lore.kernel.org/bpf/20241130-tcp-bpf-sendmsg-v1-1-bae583d014f3@outlook.com	2024-12-20 22:53:36 +01:00
Linus Torvalds	d74276290c	Merge tag 'hwmon-for-v6.13-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/groeck/linux-staging Pull hwmon fixes from Guenter Roeck: - Fix reporting of negative temperature, current, and voltage values in the tmp513 driver * tag 'hwmon-for-v6.13-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/groeck/linux-staging: hwmon: (tmp513) Fix interpretation of values of Temperature Result and Limit Registers hwmon: (tmp513) Fix Current Register value interpretation hwmon: (tmp513) Fix interpretation of values of Shunt Voltage and Limit Registers	2024-12-20 13:48:41 -08:00
Rob Herring (Arm)	8600058ba2	of: Add coreboot firmware to excluded default cells list Google Juniper and other Chromebook platforms have a very old bootloader which populates /firmware node without proper address/size-cells leading to warnings: Missing '#address-cells' in /firmware WARNING: CPU: 0 PID: 1 at drivers/of/base.c:106 of_bus_n_addr_cells+0x90/0xf0 Modules linked in: CPU: 0 UID: 0 PID: 1 Comm: swapper/0 Not tainted 6.12.0 #1 933ab9971ff4d5dc58cb378a96f64c7f72e3454d Hardware name: Google juniper sku16 board (DT) ... Missing '#size-cells' in /firmware WARNING: CPU: 0 PID: 1 at drivers/of/base.c:133 of_bus_n_size_cells+0x90/0xf0 Modules linked in: CPU: 0 UID: 0 PID: 1 Comm: swapper/0 Tainted: G W 6.12.0 #1 933ab9971ff4d5dc58cb378a96f64c7f72e3454d Tainted: [W]=WARN Hardware name: Google juniper sku16 board (DT) These platform won't receive updated bootloader/firmware, so add an exclusion for platforms with a "coreboot" compatible node. While this is wider than necessary, that's the easiest fix and it doesn't doesn't matter if we miss checking other platforms using coreboot. We may revisit this later and address with a fixup to the DT itself. Reported-by: Sasha Levin <sashal@kernel.org> Closes: https://lore.kernel.org/all/Z0NUdoG17EwuCigT@sashalap/ Cc: AngeloGioacchino Del Regno <angelogioacchino.delregno@collabora.com> Cc: Matthias Brugger <matthias.bgg@gmail.com> Cc: Chen-Yu Tsai <wenst@chromium.org> Cc: Krzysztof Kozlowski <krzysztof.kozlowski@linaro.org> Signed-off-by: Rob Herring (Arm) <robh@kernel.org>	2024-12-20 15:39:22 -06:00
Linus Torvalds	11167b29e5	Merge tag 'block-6.13-20241220' of git://git.kernel.dk/linux Pull block fixes from Jens Axboe: - Minor cleanups for bdev/nvme using the helpers introduced - Revert of a deadlock fix that still needs more work - Fix a UAF of hctx in the cpu hotplug code * tag 'block-6.13-20241220' of git://git.kernel.dk/linux: block: avoid to reuse `hctx` not removed from cpuhp callback list block: Revert "block: Fix potential deadlock while freezing queue and acquiring sysfs_lock" nvme: use blk_validate_block_size() for max LBA check block/bdev: use helper for max block size check	2024-12-20 13:37:58 -08:00
Linus Torvalds	7c05bd9230	Merge tag 'io_uring-6.13-20241220' of git://git.kernel.dk/linux Pull io_uring fixes from Jens Axboe: - Fix for a file ref leak for registered ring fds - Turn the ->timeout_lock into a raw spinlock, as it nests under the io-wq lock which is a raw spinlock as it's called from the scheduler side - Limit ring resizing to DEFER_TASKRUN for now. We will broaden this in the future, but for now, ensure that it's only feasible on rings with a single user - Add sanity check for io-wq enqueuing * tag 'io_uring-6.13-20241220' of git://git.kernel.dk/linux: io_uring: check if iowq is killed before queuing io_uring/register: limit ring resizing to DEFER_TASKRUN io_uring: Fix registered ring file refcount leak io_uring: make ctx->timeout_lock a raw spinlock	2024-12-20 13:32:43 -08:00
Linus Torvalds	e9b8ffafd2	Merge tag 'usb-6.13-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/usb Pull USB / Thunderbolt fixes from Greg KH: "Here are some important, and small, fixes for USB and Thunderbolt issues that have come up in the -rc releases. And some new device ids for good measure. Included in here are: - Much reported xhci bugfix for usb-storage devices (and other devices as well, tripped me up on a video camera) - thunderbolt fixes for some small reported issues - new usb-serial device ids All of these have been in linux-next this week with no reported issues" * tag 'usb-6.13-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/usb: usb: xhci: fix ring expansion regression in 6.13-rc1 xhci: Turn NEC specific quirk for handling Stop Endpoint errors generic thunderbolt: Improve redrive mode handling USB: serial: option: add Telit FE910C04 rmnet compositions USB: serial: option: add MediaTek T7XX compositions USB: serial: option: add Netprisma LCUK54 modules for WWAN Ready USB: serial: option: add MeiG Smart SLM770A USB: serial: option: add TCL IK512 MBIM & ECM thunderbolt: Don't display nvm_version unless upgrade supported thunderbolt: Add support for Intel Panther Lake-M/P	2024-12-20 11:09:40 -08:00
Linus Torvalds	5127e1495b	Merge tag 'spi-fix-v6.13-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/broonie/spi Pull spi fix from Mark Brown: "A fix for the remove path of the Rockchip driver, the code was just clearly and obviously wrong" * tag 'spi-fix-v6.13-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/broonie/spi: spi: rockchip-sfc: Fix error in remove progress	2024-12-20 11:06:25 -08:00
Linus Torvalds	b648264cd4	Merge tag 'regulator-fix-v6.13-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/broonie/regulator Pull regulator fix from Mark Brown: "The recently added regulator-uv-survival-time-ms property was renamed during the review of the series that added it, but unfortunately only in the DT binding and not in the code that parses the binding. This brings the code in line with the binding, if someone started using the original name we can add compat support for it but there's nothing upstream yet and it's a very niche feature so hopefully not" * tag 'regulator-fix-v6.13-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/broonie/regulator: regulator: rename regulator-uv-survival-time-ms according to DT binding	2024-12-20 11:04:02 -08:00
Linus Torvalds	af215c980c	Merge tag 'drm-fixes-2024-12-20' of https://gitlab.freedesktop.org/drm/kernel Pull drm fixes from Dave Airlie: "Probably the last pull before Christmas holidays, I'll still be around for most of the time anyways, nothing too major in here, bunch of amdgpu and i915 along with a smattering of fixes across the board. core: - fix FB dependency - avoid div by 0 more in vrefresh - maintainers update display: - fix DP tunnel error path dma-buf: - fix !DEBUG_FS sched: - docs warning fix panel: - collection of misc panel fixes i915: - Reset engine utilization buffer before registration - Ensure busyness counter increases motonically - Accumulate active runtime on gt reset amdgpu: - Disable BOCO when CONFIG_HOTPLUG_PCI_PCIE is not enabled - scheduler job fixes - IP version check fixes - devcoredump fix - GPUVM update fix - NBIO 2.5 fix udmabuf: - fix memory leak on last export - sealing fixes ivpu: - fix NULL pointer - fix memory leak - fix WARN" * tag 'drm-fixes-2024-12-20' of https://gitlab.freedesktop.org/drm/kernel: (33 commits) drm/sched: Fix drm_sched_fini() docu generation accel/ivpu: Fix WARN in ivpu_ipc_send_receive_internal() accel/ivpu: Fix memory leak in ivpu_mmu_reserved_context_init() accel/ivpu: Fix general protection fault in ivpu_bo_list() drm/amdgpu/nbio7.0: fix IP version check drm/amd: Update strapping for NBIO 2.5.0 drm/amdgpu: Handle NULL bo->tbo.resource (again) in amdgpu_vm_bo_update drm/amdgpu: fix amdgpu_coredump drm/amdgpu/smu14.0.2: fix IP version check drm/amdgpu/gfx12: fix IP version check drm/amdgpu/mmhub4.1: fix IP version check drm/amdgpu/nbio7.11: fix IP version check drm/amdgpu/nbio7.7: fix IP version check drm/amdgpu: don't access invalid sched drm/amd: Require CONFIG_HOTPLUG_PCI_PCIE for BOCO drm: rework FB_CORE dependency drm/fbdev: Select FB_CORE dependency for fbdev on DMA and TTM fbdev: Fix recursive dependencies wrt BACKLIGHT_CLASS_DEVICE i915/guc: Accumulate active runtime on gt reset i915/guc: Ensure busyness counter increases motonically ...	2024-12-20 10:17:53 -08:00
Linus Torvalds	5b83bcdea5	Merge tag 'trace-ringbuffer-v6.13-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/trace/linux-trace Pull ring-buffer fixes from Steven Rostedt: - Fix possible overflow of mmapped ring buffer with bad offset If the mmap() to the ring buffer passes in a start address that is passed the end of the mmapped file, it is not caught and a slab-out-of-bounds is triggered. Add a check to make sure the start address is within the bounds - Do not use TP_printk() to boot mapped ring buffers As a boot mapped ring buffer's data may have pointers that map to the previous boot's memory map, it is unsafe to allow the TP_printk() to be used to read the boot mapped buffer's events. If a TP_printk() points to a static string from within the kernel it will not match the current kernel mapping if KASLR is active, and it can fault. Have it simply print out the raw fields. * tag 'trace-ringbuffer-v6.13-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/trace/linux-trace: trace/ring-buffer: Do not use TP_printk() formatting for boot mapped buffers ring-buffer: Fix overflow in __rb_map_vma	2024-12-20 10:13:26 -08:00
Alexander Lobakin	724c6ce38b	stddef: make __struct_group() UAPI C++-friendly For the most part of the C++ history, it couldn't have type declarations inside anonymous unions for different reasons. At the same time, __struct_group() relies on the latters, so when the @TAG argument is not empty, C++ code doesn't want to build (even under `extern "C"`): ../linux/include/uapi/linux/pkt_cls.h:25:24: error: 'struct tc_u32_sel::<unnamed union>::tc_u32_sel_hdr,' invalid; an anonymous union may only have public non-static data members [-fpermissive] The safest way to fix this without trying to switch standards (which is impossible in UAPI anyway) etc., is to disable tag declaration for that language. This won't break anything since for now it's not buildable at all. Use a separate definition for __struct_group() when __cplusplus is defined to mitigate the error, including the version from tools/. Fixes: `50d7bd38c3` ("stddef: Introduce struct_group() helper macro") Reported-by: Christopher Ferris <cferris@google.com> Closes: https://lore.kernel.org/linux-hardening/Z1HZpe3WE5As8UAz@google.com Suggested-by: Kees Cook <kees@kernel.org> # __struct_group_tag() Signed-off-by: Alexander Lobakin <aleksander.lobakin@intel.com> Reviewed-by: Gustavo A. R. Silva <gustavoars@kernel.org> Link: https://lore.kernel.org/r/20241219135734.2130002-1-aleksander.lobakin@intel.com Signed-off-by: Kees Cook <kees@kernel.org>	2024-12-20 09:05:53 -08:00
Arnd Bergmann	a31ffd6ed5	Merge tag 'arm-soc/for-6.13/devicetree-arm64-fixes' of https://github.com/Broadcom/stblinux into arm/fixes This pull request contains Broadcom ARM64-based SoCs Device Tree fixes for 6.13, please pull the following: - Willow corrects the L2 cache line size on the Raspberry Pi 5 (2712) to the correct value of 64 bytes * tag 'arm-soc/for-6.13/devicetree-arm64-fixes' of https://github.com/Broadcom/stblinux: arm64: dts: broadcom: Fix L2 linesize for Raspberry Pi 5 Link: https://lore.kernel.org/r/20241217190547.868744-1-florian.fainelli@broadcom.com Signed-off-by: Arnd Bergmann <arnd@arndb.de>	2024-12-20 18:02:27 +01:00
Arnd Bergmann	a61dae1101	Merge tag 'riscv-soc-fixes-for-v6.13-rc4' of https://git.kernel.org/pub/scm/linux/kernel/git/conor/linux into arm/fixes RISC-V soc driver fixes for v6.13-rc4 A single fix for the Auto Update driver, where a mistake in array indexing (accessing as a u32 rather than a u8) caused the driver to read the wrong feature disable bits. Signed-off-by: Conor Dooley <conor.dooley@microchip.com> * tag 'riscv-soc-fixes-for-v6.13-rc4' of https://git.kernel.org/pub/scm/linux/kernel/git/conor/linux: firmware: microchip: fix UL_IAP lock check in mpfs_auto_update_state() Link: https://lore.kernel.org/r/20241218-suffrage-unfazed-fa0113072a42@spud Signed-off-by: Arnd Bergmann <arnd@arndb.de>	2024-12-20 18:00:30 +01:00
Zijian Zhang	d888b7af7c	tcp_bpf: Add sk_rmem_alloc related logic for tcp_bpf ingress redirection When we do sk_psock_verdict_apply->sk_psock_skb_ingress, an sk_msg will be created out of the skb, and the rmem accounting of the sk_msg will be handled by the skb. For skmsgs in __SK_REDIRECT case of tcp_bpf_send_verdict, when redirecting to the ingress of a socket, although we sk_rmem_schedule and add sk_msg to the ingress_msg of sk_redir, we do not update sk_rmem_alloc. As a result, except for the global memory limit, the rmem of sk_redir is nearly unlimited. Thus, add sk_rmem_alloc related logic to limit the recv buffer. Since the function sk_msg_recvmsg and __sk_psock_purge_ingress_msg are used in these two paths. We use "msg->skb" to test whether the sk_msg is skb backed up. If it's not, we shall do the memory accounting explicitly. Fixes: `604326b41a` ("bpf, sockmap: convert to generic sk_msg interface") Signed-off-by: Zijian Zhang <zijianzhang@bytedance.com> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Reviewed-by: John Fastabend <john.fastabend@gmail.com> Link: https://lore.kernel.org/bpf/20241210012039.1669389-3-zijianzhang@bytedance.com	2024-12-20 17:59:47 +01:00
Cong Wang	54f89b3178	tcp_bpf: Charge receive socket buffer in bpf_tcp_ingress() When bpf_tcp_ingress() is called, the skmsg is being redirected to the ingress of the destination socket. Therefore, we should charge its receive socket buffer, instead of sending socket buffer. Because sk_rmem_schedule() tests pfmemalloc of skb, we need to introduce a wrapper and call it for skmsg. Fixes: `604326b41a` ("bpf, sockmap: convert to generic sk_msg interface") Signed-off-by: Cong Wang <cong.wang@bytedance.com> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Reviewed-by: John Fastabend <john.fastabend@gmail.com> Link: https://lore.kernel.org/bpf/20241210012039.1669389-2-zijianzhang@bytedance.com	2024-12-20 17:59:47 +01:00
Kan Liang	aa5d2ca7c1	perf/x86/intel: Fix bitmask of OCR and FRONTEND events for LNC The released OCR and FRONTEND events utilized more bits on Lunar Lake p-core. The corresponding mask in the extra_regs has to be extended to unblock the extra bits. Add a dedicated intel_lnc_extra_regs. Fixes: `a932aa0e86` ("perf/x86: Add Lunar Lake and Arrow Lake support") Reported-by: Andi Kleen <ak@linux.intel.com> Signed-off-by: Kan Liang <kan.liang@linux.intel.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Cc: stable@vger.kernel.org Link: https://lkml.kernel.org/r/20241216160252.430858-1-kan.liang@linux.intel.com	2024-12-20 15:31:14 +01:00
NeilBrown	7917f01a28	nfsd: restore callback functionality for NFSv4.0 A recent patch inadvertently broke callbacks for NFSv4.0. In the 4.0 case we do not expect a session to be found but still need to call setup_callback_client() which will not try to dereference it. This patch moves the check for failure to find a session into the 4.1+ branch of setup_callback_client() Fixes: `1e02c641c3` ("NFSD: Prevent NULL dereference in nfsd4_process_cb_update()") Signed-off-by: NeilBrown <neilb@suse.de> Reviewed-by: Jeff Layton <jlayton@kernel.org> Signed-off-by: Chuck Lever <chuck.lever@oracle.com>	2024-12-20 09:17:12 -05:00
Mark Brown	926e862058	arm64/signal: Silence sparse warning storing GCSPR_EL0 We are seeing a sparse warning in gcs_restore_signal(): arch/arm64/kernel/signal.c:1054:9: sparse: sparse: cast removes address space '__user' of expression when storing the final GCSPR_EL0 value back into the register, caused by the fact that write_sysreg_s() casts the value it writes to a u64 which sparse sees as discarding the __userness of the pointer. Avoid this by treating the address as an integer, casting to a pointer only when using it to write to userspace. While we're at it also inline gcs_signal_cap_valid() into it's one user and make equivalent updates to gcs_signal_entry(). Reported-by: kernel test robot <lkp@intel.com> Closes: https://lore.kernel.org/oe-kbuild-all/202412082005.OBJ0BbWs-lkp@intel.com/ Signed-off-by: Mark Brown <broonie@kernel.org> Link: https://lore.kernel.org/r/20241214-arm64-gcs-signal-sparse-v3-1-5e8d18fffc0c@kernel.org Signed-off-by: Catalin Marinas <catalin.marinas@arm.com>	2024-12-20 14:12:04 +00:00
Takashi Iwai	8cbd01ba9c	Merge tag 'asoc-fix-v6.13-rc3' of https://git.kernel.org/pub/scm/linux/kernel/git/broonie/sound into for-linus ASoC: Fixes for v6.13 A mix of quirks and small fixes, nothing too major anywhere.	2024-12-20 14:09:45 +01:00
Takashi Iwai	66a0a2b047	ALSA: sh: Fix wrong argument order for copy_from_iter() Fix a brown paper bag bug I introduced at converting to the standard iter helper; the arguments were wrongly passed and have to be swapped. Fixes: `9b5f8ee43e` ("ALSA: sh: Use standard helper for buffer accesses") Reported-by: kernel test robot <lkp@intel.com> Closes: https://lore.kernel.org/oe-kbuild-all/202412140019.jat5Dofr-lkp@intel.com/ Link: https://patch.msgid.link/20241220114417.5898-1-tiwai@suse.de Signed-off-by: Takashi Iwai <tiwai@suse.de>	2024-12-20 12:45:38 +01:00
Li Zhijian	55853cb829	selftests/alsa: Fix circular dependency involving global-timer The pattern rule `$(OUTPUT)/%: %.c` inadvertently included a circular dependency on the global-timer target due to its inclusion in $(TEST_GEN_PROGS_EXTENDED). This resulted in a circular dependency warning during the build process. To resolve this, the dependency on $(TEST_GEN_PROGS_EXTENDED) has been replaced with an explicit dependency on $(OUTPUT)/libatest.so. This change ensures that libatest.so is built before any other targets that require it, without creating a circular dependency. This fix addresses the following warning: make[4]: Entering directory 'tools/testing/selftests/alsa' make[4]: Circular default_modconfig/kselftest/alsa/global-timer <- default_modconfig/kselftest/alsa/global-timer dependency dropped. make[4]: Nothing to be done for 'all'. make[4]: Leaving directory 'tools/testing/selftests/alsa' Cc: Mark Brown <broonie@kernel.org> Cc: Jaroslav Kysela <perex@perex.cz> Cc: Takashi Iwai <tiwai@suse.com> Cc: Shuah Khan <shuah@kernel.org> Signed-off-by: Li Zhijian <lizhijian@fujitsu.com> Link: https://patch.msgid.link/20241218025931.914164-1-lizhijian@fujitsu.com Signed-off-by: Takashi Iwai <tiwai@suse.de>	2024-12-20 10:00:41 +01:00
Fedor Pchelkin	fa0308134d	ALSA: memalloc: prefer dma_mapping_error() over explicit address checking With CONFIG_DMA_API_DEBUG enabled, the following warning is observed: DMA-API: snd_hda_intel 0000:03:00.1: device driver failed to check map error[device address=0x00000000ffff0000] [size=20480 bytes] [mapped as single] WARNING: CPU: 28 PID: 2255 at kernel/dma/debug.c:1036 check_unmap+0x1408/0x2430 CPU: 28 UID: 42 PID: 2255 Comm: wireplumber Tainted: G W L 6.12.0-10-133577cad6bf48e5a7848c4338124081393bfe8a+ #759 debug_dma_unmap_page+0xe9/0xf0 snd_dma_wc_free+0x85/0x130 [snd_pcm] snd_pcm_lib_free_pages+0x1e3/0x440 [snd_pcm] snd_pcm_common_ioctl+0x1c9a/0x2960 [snd_pcm] snd_pcm_ioctl+0x6a/0xc0 [snd_pcm] ... Check for returned DMA addresses using specialized dma_mapping_error() helper which is generally recommended for this purpose by Documentation/core-api/dma-api.rst. Fixes: `c880a51466` ("ALSA: memalloc: Use proper DMA mapping API for x86 WC buffer allocations") Reported-by: Mikhail Gavrilov <mikhail.v.gavrilov@gmail.com> Closes: https://lore.kernel.org/r/CABXGCsNB3RsMGvCucOy3byTEOxoc-Ys+zB_HQ=Opb_GhX1ioDA@mail.gmail.com/ Tested-by: Mikhail Gavrilov <mikhail.v.gavrilov@gmail.com> Signed-off-by: Fedor Pchelkin <pchelkin@ispras.ru> Link: https://patch.msgid.link/20241219203345.195898-1-pchelkin@ispras.ru Signed-off-by: Takashi Iwai <tiwai@suse.de>	2024-12-20 09:54:12 +01:00
Jaroslav Kysela	3d3f43fab4	ALSA: compress_offload: improve file descriptors installation for dma-buf Avoid to use single dma_buf_fd() call for both directions. This code ensures that both file descriptors are allocated before fd_install(). Link: https://lore.kernel.org/linux-sound/6a923647-4495-4cff-a253-b73f48cfd0ea@stanley.mountain/ Fixes: `04177158cf` ("ALSA: compress_offload: introduce accel operation mode") Reported-by: Dan Carpenter <dan.carpenter@linaro.org> Cc: Vinod Koul <vkoul@kernel.org> Signed-off-by: Jaroslav Kysela <perex@perex.cz> Signed-off-by: Takashi Iwai <tiwai@suse.de> Link: https://patch.msgid.link/20241217100726.732863-1-perex@perex.cz	2024-12-20 09:52:15 +01:00
Jaroslav Kysela	f25a51b47c	ALSA: compress_offload: use safe list iteration in snd_compr_task_seq() The sequence function can call snd_compr_task_free_one(). Use list_for_each_entry_safe_reverse() to make sure that the used pointers are safe. Link: https://lore.kernel.org/linux-sound/f2769cff-6c7a-4092-a2d1-c33a5411a182@stanley.mountain/ Fixes: `04177158cf` ("ALSA: compress_offload: introduce accel operation mode") Reported-by: Dan Carpenter <dan.carpenter@linaro.org> Cc: Vinod Koul <vkoul@kernel.org> Signed-off-by: Jaroslav Kysela <perex@perex.cz> Signed-off-by: Takashi Iwai <tiwai@suse.de> Link: https://patch.msgid.link/20241217100707.732766-1-perex@perex.cz	2024-12-20 09:51:36 +01:00
Arnd Bergmann	6018f2fe10	ALSA: compress_offload: avoid 64-bit get_user() On some architectures, get_user() cannot read a 64-bit user variable: arm-linux-gnueabi-ld: sound/core/compress_offload.o: in function `snd_compr_ioctl': compress_offload.c:(.text.snd_compr_ioctl+0x538): undefined reference to `__get_user_bad' Use an equivalent copy_from_user() instead. Fixes: `04177158cf` ("ALSA: compress_offload: introduce accel operation mode") Signed-off-by: Arnd Bergmann <arnd@arndb.de> Acked-by: Shengjiu Wang <shengjiu.wang@gmail.com> Acked-by: Vinod Koul <vkoul@kernel.org> Signed-off-by: Takashi Iwai <tiwai@suse.de> Link: https://patch.msgid.link/20241216093410.377112-2-arnd@kernel.org	2024-12-20 09:49:15 +01:00
Arnd Bergmann	1ae40d5231	ALSA: compress_offload: import DMA_BUF namespace The compression offload code cannot be in a loadable module unless it imports that namespace: ERROR: modpost: module snd-compress uses symbol dma_buf_get from namespace DMA_BUF, but does not import it. ERROR: modpost: module snd-compress uses symbol dma_buf_put from namespace DMA_BUF, but does not import it. ERROR: modpost: module snd-compress uses symbol dma_buf_fd from namespace DMA_BUF, but does not import it. Fixes: `04177158cf` ("ALSA: compress_offload: introduce accel operation mode") Signed-off-by: Arnd Bergmann <arnd@arndb.de> Acked-by: Shengjiu Wang <shengjiu.wang@gmail.com> Acked-by: Vinod Koul <vkoul@kernel.org> Signed-off-by: Takashi Iwai <tiwai@suse.de> Link: https://patch.msgid.link/20241216093410.377112-1-arnd@kernel.org	2024-12-20 09:49:15 +01:00
Dave Airlie	e639fb046b	Merge tag 'amd-drm-fixes-6.13-2024-12-18' of https://gitlab.freedesktop.org/agd5f/linux into drm-fixes amd-drm-fixes-6.13-2024-12-18: amdgpu: - Disable BOCO when CONFIG_HOTPLUG_PCI_PCIE is not enabled - scheduler job fixes - IP version check fixes - devcoredump fix - GPUVM update fix - NBIO 2.5 fix Signed-off-by: Dave Airlie <airlied@redhat.com> From: Alex Deucher <alexander.deucher@amd.com> Link: https://patchwork.freedesktop.org/patch/msgid/20241218204637.2966198-1-alexander.deucher@amd.com	2024-12-20 16:21:44 +10:00
Sean Christopherson	386d69f9f2	KVM: x86/mmu: Treat TDP MMU faults as spurious if access is already allowed Treat slow-path TDP MMU faults as spurious if the access is allowed given the existing SPTE to fix a benign warning (other than the WARN itself) due to replacing a writable SPTE with a read-only SPTE, and to avoid the unnecessary LOCK CMPXCHG and subsequent TLB flush. If a read fault races with a write fault, fast GUP fails for any reason when trying to "promote" the read fault to a writable mapping, and KVM resolves the write fault first, then KVM will end up trying to install a read-only SPTE (for a !map_writable fault) overtop a writable SPTE. Note, it's not entirely clear why fast GUP fails, or if that's even how KVM ends up with a !map_writable fault with a writable SPTE. If something else is going awry, e.g. due to a bug in mmu_notifiers, then treating read faults as spurious in this scenario could effectively mask the underlying problem. However, retrying the faulting access instead of overwriting an existing SPTE is functionally correct and desirable irrespective of the WARN, and fast GUP _can_ legitimately fail with a writable VMA, e.g. if the Accessed bit in primary MMU's PTE is toggled and causes a PTE value mismatch. The WARN was also recently added, specifically to track down scenarios where KVM is unnecessarily overwrites SPTEs, i.e. treating the fault as spurious doesn't regress KVM's bug-finding capabilities in any way. In short, letting the WARN linger because there's a tiny chance it's due to a bug elsewhere would be excessively paranoid. Fixes: `1a175082b1` ("KVM: x86/mmu: WARN and flush if resolving a TDP MMU fault clears MMU-writable") Reported-by: Lei Yang <leiyang@redhat.com> Closes: https://bugzilla.kernel.org/show_bug.cgi?id=219588 Tested-by: Lei Yang <leiyang@redhat.com> Link: https://lore.kernel.org/r/20241218213611.3181643-1-seanjc@google.com Signed-off-by: Sean Christopherson <seanjc@google.com>	2024-12-19 17:47:52 -08:00
Sean Christopherson	4d5163cba4	KVM: SVM: Allow guest writes to set MSR_AMD64_DE_CFG bits Drop KVM's arbitrary behavior of making DE_CFG.LFENCE_SERIALIZE read-only for the guest, as rejecting writes can lead to guest crashes, e.g. Windows in particular doesn't gracefully handle unexpected #GPs on the WRMSR, and nothing in the AMD manuals suggests that LFENCE_SERIALIZE is read-only _if it exists_. KVM only allows LFENCE_SERIALIZE to be set, by the guest or host, if the underlying CPU has X86_FEATURE_LFENCE_RDTSC, i.e. if LFENCE is guaranteed to be serializing. So if the guest sets LFENCE_SERIALIZE, KVM will provide the desired/correct behavior without any additional action (the guest's value is never stuffed into hardware). And having LFENCE be serializing even when it's not _required_ to be is a-ok from a functional perspective. Fixes: `74a0e79df6` ("KVM: SVM: Disallow guest from changing userspace's MSR_AMD64_DE_CFG value") Fixes: `d1d93fa90f` ("KVM: SVM: Add MSR-based feature support for serializing LFENCE") Reported-by: Simon Pilkington <simonp.git@mailbox.org> Closes: https://lore.kernel.org/all/52914da7-a97b-45ad-86a0-affdf8266c61@mailbox.org Cc: Tom Lendacky <thomas.lendacky@amd.com> Cc: stable@vger.kernel.org Reviewed-by: Tom Lendacky <thomas.lendacky@amd.com> Link: https://lore.kernel.org/r/20241211172952.1477605-1-seanjc@google.com Signed-off-by: Sean Christopherson <seanjc@google.com>	2024-12-19 17:47:52 -08:00
Sean Christopherson	9b42d1e8e4	KVM: x86: Play nice with protected guests in complete_hypercall_exit() Use is_64_bit_hypercall() instead of is_64_bit_mode() to detect a 64-bit hypercall when completing said hypercall. For guests with protected state, e.g. SEV-ES and SEV-SNP, KVM must assume the hypercall was made in 64-bit mode as the vCPU state needed to detect 64-bit mode is unavailable. Hacking the sev_smoke_test selftest to generate a KVM_HC_MAP_GPA_RANGE hypercall via VMGEXIT trips the WARN: ------------[ cut here ]------------ WARNING: CPU: 273 PID: 326626 at arch/x86/kvm/x86.h:180 complete_hypercall_exit+0x44/0xe0 [kvm] Modules linked in: kvm_amd kvm ... [last unloaded: kvm] CPU: 273 UID: 0 PID: 326626 Comm: sev_smoke_test Not tainted 6.12.0-smp--392e932fa0f3-feat #470 Hardware name: Google Astoria/astoria, BIOS 0.20240617.0-0 06/17/2024 RIP: 0010:complete_hypercall_exit+0x44/0xe0 [kvm] Call Trace: <TASK> kvm_arch_vcpu_ioctl_run+0x2400/0x2720 [kvm] kvm_vcpu_ioctl+0x54f/0x630 [kvm] __se_sys_ioctl+0x6b/0xc0 do_syscall_64+0x83/0x160 entry_SYSCALL_64_after_hwframe+0x76/0x7e </TASK> ---[ end trace 0000000000000000 ]--- Fixes: `b5aead0064` ("KVM: x86: Assume a 64-bit hypercall for guests with protected state") Cc: stable@vger.kernel.org Cc: Tom Lendacky <thomas.lendacky@amd.com> Reviewed-by: Xiaoyao Li <xiaoyao.li@intel.com> Reviewed-by: Nikunj A Dadhania <nikunj@amd.com> Reviewed-by: Tom Lendacky <thomas.lendacky@amd.com> Reviewed-by: Binbin Wu <binbin.wu@linux.intel.com> Reviewed-by: Kai Huang <kai.huang@intel.com> Link: https://lore.kernel.org/r/20241128004344.4072099-2-seanjc@google.com Signed-off-by: Sean Christopherson <seanjc@google.com>	2024-12-19 17:47:51 -08:00
Suravee Suthikulpanit	d81cadbe16	KVM: SVM: Disable AVIC on SNP-enabled system without HvInUseWrAllowed feature On SNP-enabled system, VMRUN marks AVIC Backing Page as in-use while the guest is running for both secure and non-secure guest. Any hypervisor write to the in-use vCPU's AVIC backing page (e.g. to inject an interrupt) will generate unexpected #PF in the host. Currently, attempt to run AVIC guest would result in the following error: BUG: unable to handle page fault for address: ff3a442e549cc270 #PF: supervisor write access in kernel mode #PF: error_code(0x80000003) - RMP violation PGD b6ee01067 P4D b6ee02067 PUD 10096d063 PMD 11c540063 PTE 80000001149cc163 SEV-SNP: PFN 0x1149cc unassigned, dumping non-zero entries in 2M PFN region: [0x114800 - 0x114a00] ... Newer AMD system is enhanced to allow hypervisor to modify the backing page for non-secure guest on SNP-enabled system. This enhancement is available when the CPUID Fn8000_001F_EAX bit 30 is set (HvInUseWrAllowed). This table describes AVIC support matrix w.r.t. SNP enablement: \| Non-SNP system \| SNP system ----------------------------------------------------- Non-SNP guest \| AVIC Activate \| AVIC Activate iff \| \| HvInuseWrAllowed=1 ----------------------------------------------------- SNP guest \| N/A \| Secure AVIC Therefore, check and disable AVIC in kvm_amd driver when the feature is not available on SNP-enabled system. See the AMD64 Architecture Programmer’s Manual (APM) Volume 2 for detail. (https://www.amd.com/content/dam/amd/en/documents/processor-tech-docs/ programmer-references/40332.pdf) Fixes: `216d106c7f` ("x86/sev: Add SEV-SNP host initialization support") Signed-off-by: Suravee Suthikulpanit <suravee.suthikulpanit@amd.com> Link: https://lore.kernel.org/r/20241104075845.7583-1-suravee.suthikulpanit@amd.com Signed-off-by: Sean Christopherson <seanjc@google.com>	2024-12-19 17:47:51 -08:00
Dave Airlie	87fd883325	Merge tag 'drm-misc-fixes-2024-12-19' of https://gitlab.freedesktop.org/drm/misc/kernel into drm-fixes drm-misc-fixes for v6.13-rc4: - udma-buf fixes related to sealing. - dma-buf build warning fix when debugfs is not enabled. - Assorted drm/panel fixes. - Correct error return in drm_dp_tunnel_mgr_create. - Fix even more divide by zero in drm_mode_vrefresh. - Fix FBDEV dependencies in Kconfig. - Documentation fix for drm_sched_fini. - IVPU NULL pointer, memory leak and WARN fix. Signed-off-by: Dave Airlie <airlied@redhat.com> From: Maarten Lankhorst <maarten.lankhorst@linux.intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/d0763051-87b7-483e-89e0-a9f993383450@linux.intel.com	2024-12-20 07:13:45 +10:00
Pavel Begunkov	dbd2ca9367	io_uring: check if iowq is killed before queuing task work can be executed after the task has gone through io_uring termination, whether it's the final task_work run or the fallback path. In this case, task work will find ->io_wq being already killed and null'ed, which is a problem if it then tries to forward the request to io_queue_iowq(). Make io_queue_iowq() fail requests in this case. Note that it also checks PF_KTHREAD, because the user can first close a DEFER_TASKRUN ring and shortly after kill the task, in which case ->iowq check would race. Cc: stable@vger.kernel.org Fixes: `50c52250e2` ("block: implement async io_uring discard cmd") Fixes: `773af69121` ("io_uring: always reissue from task_work context") Reported-by: Will <willsroot@protonmail.com> Signed-off-by: Pavel Begunkov <asml.silence@gmail.com> Link: https://lore.kernel.org/r/63312b4a2c2bb67ad67b857d17a300e1d3b078e8.1734637909.git.asml.silence@gmail.com Signed-off-by: Jens Axboe <axboe@kernel.dk>	2024-12-19 13:31:53 -07:00
Bharath SM	92941c7f2c	smb: fix bytes written value in /proc/fs/cifs/Stats With recent netfs apis changes, the bytes written value was not getting updated in /proc/fs/cifs/Stats. Fix this by updating tcon->bytes in write operations. Fixes: `3ee1a1fc39` ("cifs: Cut over to using netfslib") Signed-off-by: Bharath SM <bharathsm@microsoft.com> Signed-off-by: Steve French <stfrench@microsoft.com>	2024-12-19 12:14:11 -06:00
Linus Torvalds	8faabc041a	Merge tag 'net-6.13-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net Pull networking fixes from Paolo Abeni: "Including fixes from can and netfilter. Current release - regressions: - rtnetlink: try the outer netns attribute in rtnl_get_peer_net() - rust: net::phy fix module autoloading Current release - new code bugs: - phy: avoid undefined behavior in _led_polarity_set() - eth: octeontx2-pf: fix netdev memory leak in rvu_rep_create() Previous releases - regressions: - smc: check sndbuf_space again after NOSPACE flag is set in smc_poll - ipvs: fix clamp() of ip_vs_conn_tab on small memory systems - dsa: restore dsa_software_vlan_untag() ability to operate on VLAN-untagged traffic - eth: - tun: fix tun_napi_alloc_frags() - ionic: no double destroy workqueue - idpf: trigger SW interrupt when exiting wb_on_itr mode - rswitch: rework ts tags management - team: fix feature exposure when no ports are present Previous releases - always broken: - core: fix repeated netlink messages in queue dump - mdiobus: fix an OF node reference leak - smc: check iparea_offset and ipv6_prefixes_cnt when receiving proposal msg - can: fix missed interrupts with m_can_pci - eth: oa_tc6: fix infinite loop error when tx credits becomes 0" tag 'net-6.13-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net: (45 commits) net: mctp: handle skb cleanup on sock_queue failures net: mdiobus: fix an OF node reference leak octeontx2-pf: fix error handling of devlink port in rvu_rep_create() octeontx2-pf: fix netdev memory leak in rvu_rep_create() psample: adjust size if rate_as_probability is set netdev-genl: avoid empty messages in queue dump net: dsa: restore dsa_software_vlan_untag() ability to operate on VLAN-untagged traffic selftests: openvswitch: fix tcpdump execution net: usb: qmi_wwan: add Quectel RG255C net: phy: avoid undefined behavior in *_led_polarity_set() netfilter: ipset: Fix for recursive locking warning ipvs: Fix clamp() of ip_vs_conn_tab on small memory systems can: m_can: fix missed interrupts with m_can_pci can: m_can: set init flag earlier in probe rtnetlink: Try the outer netns attribute in rtnl_get_peer_net(). net: netdevsim: fix nsim_pp_hold_write() idpf: trigger SW interrupt when exiting wb_on_itr mode idpf: add support for SW triggered interrupts qed: fix possible uninit pointer read in qed_mcp_nvm_info_populate() net: ethernet: bgmac-platform: fix an OF node reference leak ...	2024-12-19 09:19:11 -08:00
Linus Torvalds	baaa2567a7	Merge tag 'mmc-v6.13-rc2' of git://git.kernel.org/pub/scm/linux/kernel/git/ulfh/mmc Pull MMC fixes from Ulf Hansson: - mtk-sd: Cleanup the wakeup configuration in error/remove-path - sdhci-tegra: Correct quirk for ADMA2 length * tag 'mmc-v6.13-rc2' of git://git.kernel.org/pub/scm/linux/kernel/git/ulfh/mmc: mmc: mtk-sd: disable wakeup in .remove() and in the error path of .probe() mmc: sdhci-tegra: Remove SDHCI_QUIRK_BROKEN_ADMA_ZEROLEN_DESC quirk	2024-12-19 08:53:51 -08:00
Linus Torvalds	a0db71c7fe	Merge tag 'pwm/for-6.13-rc4-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/ukleinek/linux Pull pwm fix from Uwe Kleine-König: "Fix regression in pwm-stm32 driver when converting to new waveform support Fabrice Gasnier found and fixed a regression I introduced with v6.13-rc1 when converting the stm32 pwm driver to support the new waveform stuff. On some hardware variants this completely broke the driver" * tag 'pwm/for-6.13-rc4-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/ukleinek/linux: pwm: stm32: Fix complementary output in round_waveform_tohw()	2024-12-19 08:50:05 -08:00
Linus Torvalds	466b2d40f6	Merge tag 'v6.13-rc3-ksmbd-server-fixes' of git://git.samba.org/ksmbd Pull smb server fixes from Steve French: - Two fixes for better handling maximum outstanding requests - Fix simultaneous negotiate protocol race * tag 'v6.13-rc3-ksmbd-server-fixes' of git://git.samba.org/ksmbd: ksmbd: conn lock to serialize smb2 negotiate ksmbd: fix broken transfers when exceeding max simultaneous operations ksmbd: count all requests in req_running counter	2024-12-19 08:45:37 -08:00
Lukas Wunner	774c71c52a	PCI/bwctrl: Enable only if more than one speed is supported If a PCIe port only supports a single speed, enabling bandwidth control is pointless: There's no need to monitor autonomous speed changes, nor can the speed be changed. Not enabling it saves a small amount of memory and compute resources, but also fixes a boot hang reported by Niklas: It occurs when enabling bandwidth control on Downstream Ports of Intel JHL7540 "Titan Ridge 2018" Thunderbolt controllers. The ports only support 2.5 GT/s in accordance with USB4 v2 sec 11.2.1, so the present commit works around the issue. PCIe r6.2 sec 8.2.1 prescribes that: "A device must support 2.5 GT/s and is not permitted to skip support for any data rates between 2.5 GT/s and the highest supported rate." Consequently, bandwidth control is currently only disabled if a port doesn't support higher speeds than 2.5 GT/s. However the Implementation Note in PCIe r6.2 sec 7.5.3.18 cautions: "It is strongly encouraged that software primarily utilize the Supported Link Speeds Vector instead of the Max Link Speed field, so that software can determine the exact set of supported speeds on current and future hardware. This can avoid software being confused if a future specification defines Links that do not require support for all slower speeds." In other words, future revisions of the PCIe Base Spec may allow gaps in the Supported Link Speeds Vector. To be future-proof, don't just check whether speeds above 2.5 GT/s are supported, but rather check whether more than one speed is supported. Fixes: `665745f274` ("PCI/bwctrl: Re-add BW notification portdrv as PCIe BW controller") Closes: https://lore.kernel.org/r/db8e457fcd155436449b035e8791a8241b0df400.camel@kernel.org Link: https://lore.kernel.org/r/3564908a9c99fc0d2a292473af7a94ebfc8f5820.1734428762.git.lukas@wunner.de Reported-by: Niklas Schnelle <niks@kernel.org> Tested-by: Niklas Schnelle <niks@kernel.org> Signed-off-by: Lukas Wunner <lukas@wunner.de> Signed-off-by: Krzysztof Wilczyński <kwilczynski@kernel.org> Reviewed-by: Jonathan Cameron <Jonthan.Cameron@huawei.com> Reviewed-by: Mario Limonciello <mario.limonciello@amd.com> Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>	2024-12-19 16:36:36 +00:00
Lukas Wunner	3202ca2215	PCI: Honor Max Link Speed when determining supported speeds The Supported Link Speeds Vector in the Link Capabilities 2 Register indicates the supported link speeds. The Max Link Speed field in the Link Capabilities Register indicates the maximum of those speeds. pcie_get_supported_speeds() neglects to honor the Max Link Speed field and will thus incorrectly deem higher speeds as supported. Fix it. One user-visible issue addressed here is an incorrect value in the sysfs attribute "max_link_speed". But the main motivation is a boot hang reported by Niklas: Intel JHL7540 "Titan Ridge 2018" Thunderbolt controllers supports 2.5-8 GT/s speeds, but indicate 2.5 GT/s as maximum. Ilpo recalls seeing this on more devices. It can be explained by the controller's Downstream Ports supporting 8 GT/s if an Endpoint is attached, but limiting to 2.5 GT/s if the port interfaces to a PCIe Adapter, in accordance with USB4 v2 sec 11.2.1: "This section defines the functionality of an Internal PCIe Port that interfaces to a PCIe Adapter. [...] The Logical sub-block shall update the PCIe configuration registers with the following characteristics: [...] Max Link Speed field in the Link Capabilities Register set to 0001b (data rate of 2.5 GT/s only). Note: These settings do not represent actual throughput. Throughput is implementation specific and based on the USB4 Fabric performance." The present commit is not sufficient on its own to fix Niklas' boot hang, but it is a prerequisite: A subsequent commit will fix the boot hang by enabling bandwidth control only if more than one speed is supported. The GENMASK() macro used herein specifies 0 as lowest bit, even though the Supported Link Speeds Vector ends at bit 1. This is done on purpose to avoid a GENMASK(0, 1) macro if Max Link Speed is zero. That macro would be invalid as the lowest bit is greater than the highest bit. Ilpo has witnessed a zero Max Link Speed on Root Complex Integrated Endpoints in particular, so it does occur in practice. (The Link Capabilities Register is optional on RCiEPs per PCIe r6.2 sec 7.5.3.) Fixes: `d2bd39c045` ("PCI: Store all PCIe Supported Link Speeds") Closes: https://lore.kernel.org/r/70829798889c6d779ca0f6cd3260a765780d1369.camel@kernel.org Link: https://lore.kernel.org/r/fe03941e3e1cc42fb9bf4395e302bff53ee2198b.1734428762.git.lukas@wunner.de Reported-by: Niklas Schnelle <niks@kernel.org> Tested-by: Niklas Schnelle <niks@kernel.org> Signed-off-by: Lukas Wunner <lukas@wunner.de> Signed-off-by: Krzysztof Wilczyński <kwilczynski@kernel.org> Reviewed-by: Jonathan Cameron <Jonathan.Cameron@huawei.com> Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>	2024-12-19 16:35:59 +00:00
Jens Axboe	c261e4f1dd	io_uring/register: limit ring resizing to DEFER_TASKRUN With DEFER_TASKRUN, we know the ring can't be both waited upon and resized at the same time. This is important for CQ resizing. Allowing SQ ring resizing is more trivial, but isn't the interesting use case. Hence limit ring resizing in general to DEFER_TASKRUN only for now. This isn't a huge problem as CQ ring resizing is generally the most useful on networking type of workloads where it can be hard to size the ring appropriately upfront, and those should be using DEFER_TASKRUN for better performance. Fixes: `79cfe9e59c` ("io_uring/register: add IORING_REGISTER_RESIZE_RINGS") Signed-off-by: Jens Axboe <axboe@kernel.dk>	2024-12-19 09:32:26 -07:00
Enzo Matsumiya	e9f2517a3e	smb: client: fix TCP timers deadlock after rmmod Commit `ef7134c7fc` ("smb: client: Fix use-after-free of network namespace.") fixed a netns UAF by manually enabled socket refcounting (sk->sk_net_refcnt=1 and sock_inuse_add(net, 1)). The reason the patch worked for that bug was because we now hold references to the netns (get_net_track() gets a ref internally) and they're properly released (internally, on __sk_destruct()), but only because sk->sk_net_refcnt was set. Problem: (this happens regardless of CONFIG_NET_NS_REFCNT_TRACKER and regardless if init_net or other) Setting sk->sk_net_refcnt=1 manually and after socket creation is not only out of cifs scope, but also technically wrong -- it's set conditionally based on user (=1) vs kernel (=0) sockets. And net/ implementations seem to base their user vs kernel space operations on it. e.g. upon TCP socket close, the TCP timers are not cleared because sk->sk_net_refcnt=1: (cf. commit `151c9c724d` ("tcp: properly terminate timers for kernel sockets")) net/ipv4/tcp.c: void tcp_close(struct sock *sk, long timeout) { lock_sock(sk); __tcp_close(sk, timeout); release_sock(sk); if (!sk->sk_net_refcnt) inet_csk_clear_xmit_timers_sync(sk); sock_put(sk); } Which will throw a lockdep warning and then, as expected, deadlock on tcp_write_timer(). A way to reproduce this is by running the reproducer from `ef7134c7fc` and then 'rmmod cifs'. A few seconds later, the deadlock/lockdep warning shows up. Fix: We shouldn't mess with socket internals ourselves, so do not set sk_net_refcnt manually. Also change __sock_create() to sock_create_kern() for explicitness. As for non-init_net network namespaces, we deal with it the best way we can -- hold an extra netns reference for server->ssocket and drop it when it's released. This ensures that the netns still exists whenever we need to create/destroy server->ssocket, but is not directly tied to it. Fixes: `ef7134c7fc` ("smb: client: Fix use-after-free of network namespace.") Cc: stable@vger.kernel.org Signed-off-by: Enzo Matsumiya <ematsumiya@suse.de> Signed-off-by: Steve French <stfrench@microsoft.com>	2024-12-19 09:25:20 -06:00
Dragan Simic	ee1c8e6b29	smb: client: Deduplicate "select NETFS_SUPPORT" in Kconfig Repeating automatically selected options in Kconfig files is redundant, so let's delete repeated "select NETFS_SUPPORT" that was added accidentally. Fixes: `69c3c023af` ("cifs: Implement netfslib hooks") Signed-off-by: Dragan Simic <dsimic@manjaro.org> Signed-off-by: Steve French <stfrench@microsoft.com>	2024-12-19 09:24:35 -06:00
Bharath SM	a769bee5f9	smb: use macros instead of constants for leasekey size and default cifsattrs value Replace default hardcoded value for cifsAttrs with ATTR_ARCHIVE macro Use SMB2_LEASE_KEY_SIZE macro for leasekey size in smb2_lease_break Signed-off-by: Bharath SM <bharathsm@microsoft.com> Signed-off-by: Steve French <stfrench@microsoft.com>	2024-12-19 09:24:32 -06:00
Bagas Sanjaya	1b684ca15f	drm/sched: Fix drm_sched_fini() docu generation Commit `baf4afc583` ("drm/sched: Improve teardown documentation") added a list of drm_sched_fini()'s problems. The list triggers htmldocs warning (but renders correctly in htmldocs output): Documentation/gpu/drm-mm:571: ./drivers/gpu/drm/scheduler/sched_main.c:1359: ERROR: Unexpected indentation. Separate the list from the preceding paragraph by a blank line to fix the warning. While at it, also end the aforementioned paragraph by a colon. Fixes: `baf4afc583` ("drm/sched: Improve teardown documentation") Reported-by: Stephen Rothwell <sfr@canb.auug.org.au> Closes: https://lore.kernel.org/r/20241108175655.6d3fcfb7@canb.auug.org.au/ Signed-off-by: Bagas Sanjaya <bagasdotme@gmail.com> [phasta: Adjust commit message] Signed-off-by: Philipp Stanner <phasta@kernel.org> Link: https://patchwork.freedesktop.org/patch/msgid/20241217034915.62594-1-bagasdotme@gmail.com	2024-12-19 16:03:56 +01:00
Jerome Marchand	716f2bca1c	selftests/bpf: Fix compilation error in get_uprobe_offset() In get_uprobe_offset(), the call to procmap_query() use the constant PROCMAP_QUERY_VMA_EXECUTABLE, even if PROCMAP_QUERY is not defined. Define PROCMAP_QUERY_VMA_EXECUTABLE when PROCMAP_QUERY isn't. Fixes: `4e9e07603e` ("selftests/bpf: make use of PROCMAP_QUERY ioctl if available") Signed-off-by: Jerome Marchand <jmarchan@redhat.com> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Yonghong Song <yonghong.song@linux.dev> Link: https://lore.kernel.org/bpf/20241218175724.578884-1-jmarchan@redhat.com	2024-12-19 13:24:39 +01:00
Jacek Lawrynowicz	0f6482caa6	accel/ivpu: Fix WARN in ivpu_ipc_send_receive_internal() Move pm_runtime_set_active() to ivpu_pm_init() so when ivpu_ipc_send_receive_internal() is executed before ivpu_pm_enable() it already has correct runtime state, even if last resume was not successful. Fixes: `8ed520ff46` ("accel/ivpu: Move set autosuspend delay to HW specific code") Cc: stable@vger.kernel.org # v6.7+ Reviewed-by: Karol Wachowski <karol.wachowski@intel.com> Reviewed-by: Jeffrey Hugo <quic_jhugo@quicinc.com> Signed-off-by: Jacek Lawrynowicz <jacek.lawrynowicz@linux.intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/20241210130939.1575610-4-jacek.lawrynowicz@linux.intel.com	2024-12-19 13:16:59 +01:00
Jacek Lawrynowicz	6c9ba75f14	accel/ivpu: Fix memory leak in ivpu_mmu_reserved_context_init() Add appropriate error handling to ensure all allocated resources are released upon encountering an error. Fixes: `a74f4d9913` ("accel/ivpu: Defer MMU root page table allocation") Cc: Karol Wachowski <karol.wachowski@intel.com> Reviewed-by: Karol Wachowski <karol.wachowski@intel.com> Reviewed-by: Jeffrey Hugo <quic_jhugo@quicinc.com> Signed-off-by: Jacek Lawrynowicz <jacek.lawrynowicz@linux.intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/20241210130939.1575610-3-jacek.lawrynowicz@linux.intel.com	2024-12-19 13:16:21 +01:00
Jacek Lawrynowicz	4b2efb9db0	accel/ivpu: Fix general protection fault in ivpu_bo_list() Check if ctx is not NULL before accessing its fields. Fixes: `37dee2a2f4` ("accel/ivpu: Improve buffer object debug logs") Cc: stable@vger.kernel.org # v6.8 Reviewed-by: Karol Wachowski <karol.wachowski@intel.com> Reviewed-by: Jeffrey Hugo <quic_jhugo@quicinc.com> Signed-off-by: Jacek Lawrynowicz <jacek.lawrynowicz@linux.intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/20241210130939.1575610-2-jacek.lawrynowicz@linux.intel.com	2024-12-19 13:16:20 +01:00
Tiezhu Yang	29d44cce32	selftests/bpf: Use asm constraint "m" for LoongArch Currently, LoongArch LLVM does not support the constraint "o" and no plan to support it, it only supports the similar constraint "m", so change the constraints from "nor" in the "else" case to arch-specific "nmr" to avoid the build error such as "unexpected asm memory constraint" for LoongArch. Fixes: `630301b0d5` ("selftests/bpf: Add basic USDT selftests") Suggested-by: Weining Lu <luweining@loongson.cn> Suggested-by: Li Chen <chenli@loongson.cn> Signed-off-by: Tiezhu Yang <yangtiezhu@loongson.cn> Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Reviewed-by: Huacai Chen <chenhuacai@loongson.cn> Cc: stable@vger.kernel.org Link: https://llvm.org/docs/LangRef.html#supported-constraint-code-list Link: https://github.com/llvm/llvm-project/blob/main/llvm/lib/Target/LoongArch/LoongArchISelDAGToDAG.cpp#L172 Link: https://lore.kernel.org/bpf/20241219111506.20643-1-yangtiezhu@loongson.cn	2024-12-19 13:15:52 +01:00
Greg Kroah-Hartman	1b62f3cb74	Merge tag 'thunderbolt-for-v6.13-rc4' of ssh://gitolite.kernel.org/pub/scm/linux/kernel/git/westeri/thunderbolt into usb-linus Mika writes: thunderbolt: Fixes for v6.13-rc4 This includes following USB4/Thunderbolt fixes for v6.13-rc4: - Add Intel Panther Lake PCI IDs - Do not show nvm_version for retimers that are not supported - Fix redrive mode handling. All these have been in linux-next with no reported issues. * tag 'thunderbolt-for-v6.13-rc4' of ssh://gitolite.kernel.org/pub/scm/linux/kernel/git/westeri/thunderbolt: thunderbolt: Improve redrive mode handling thunderbolt: Don't display nvm_version unless upgrade supported thunderbolt: Add support for Intel Panther Lake-M/P	2024-12-19 12:35:02 +01:00
Ahmad Fatoum	1322149606	regulator: rename regulator-uv-survival-time-ms according to DT binding The regulator bindings don't document regulator-uv-survival-time-ms, but the more descriptive regulator-uv-less-critical-window-ms instead. Looking back at v3[1] and v4[2] of the series adding the support, the property was indeed renamed between these patch series, but unfortunately the rename only made it into the DT bindings with the driver code still using the old name. Let's therefore rename the property in the driver code to follow suit. This will break backwards compatibility, but there are no upstream device trees using the property and we never documented the old name of the property anyway. ¯\_(ツ)_/¯" [1]: https://lore.kernel.org/all/20231025084614.3092295-7-o.rempel@pengutronix.de/ [2]: https://lore.kernel.org/all/20231026144824.4065145-5-o.rempel@pengutronix.de/ Signed-off-by: Ahmad Fatoum <a.fatoum@pengutronix.de> Link: https://patch.msgid.link/20241218-regulator-uv-survival-time-ms-rename-v1-1-6cac9c3c75da@pengutronix.de Signed-off-by: Mark Brown <broonie@kernel.org>	2024-12-19 11:15:24 +00:00
Chen-Yu Tsai	32c9c06adb	ASoC: mediatek: disable buffer pre-allocation On Chromebooks based on Mediatek MT8195 or MT8188, the audio frontend (AFE) is limited to accessing a very small window (1 MiB) of memory, which is described as a reserved memory region in the device tree. On these two platforms, the maximum buffer size is given as 512 KiB. The MediaTek common code uses the same value for preallocations. This means that only the first two PCM substreams get preallocations, and then the whole space is exhausted, barring any other substreams from working. Since the substreams used are not always the first two, this means audio won't work correctly. This is observed on the MT8188 Geralt Chromebooks, on which the "mediatek,dai-link" property was dropped when it was upstreamed. That property causes the driver to only register the PCM substreams listed in the property, and in the order given. Instead of trying to compute an optimal value and figuring out which streams are used, simply disable preallocation. The PCM buffers are managed by the core and are allocated and released on the fly. There should be no impact to any of the other MediaTek platforms. Signed-off-by: Chen-Yu Tsai <wenst@chromium.org> Reviewed-by: AngeloGioacchino Del Regno <angelogioacchino.delregno@collabora.com> Link: https://patch.msgid.link/20241219105303.548437-1-wenst@chromium.org Signed-off-by: Mark Brown <broonie@kernel.org>	2024-12-19 11:15:09 +00:00
Jeremy Kerr	ce1219c3f7	net: mctp: handle skb cleanup on sock_queue failures Currently, we don't use the return value from sock_queue_rcv_skb, which means we may leak skbs if a message is not successfully queued to a socket. Instead, ensure that we're freeing the skb where the sock hasn't otherwise taken ownership of the skb by adding checks on the sock_queue_rcv_skb() to invoke a kfree on failure. In doing so, rather than using the 'rc' value to trigger the kfree_skb(), use the skb pointer itself, which is more explicit. Also, add a kunit test for the sock delivery failure cases. Fixes: `4a992bbd36` ("mctp: Implement message fragmentation & reassembly") Cc: stable@vger.kernel.org Signed-off-by: Jeremy Kerr <jk@codeconstruct.com.au> Link: https://patch.msgid.link/20241218-mctp-next-v2-1-1c1729645eaa@codeconstruct.com.au Signed-off-by: Paolo Abeni <pabeni@redhat.com>	2024-12-19 11:52:49 +01:00
Joe Hattori	572af9f284	net: mdiobus: fix an OF node reference leak fwnode_find_mii_timestamper() calls of_parse_phandle_with_fixed_args() but does not decrement the refcount of the obtained OF node. Add an of_node_put() call before returning from the function. This bug was detected by an experimental static analysis tool that I am developing. Fixes: `bc1bee3b87` ("net: mdiobus: Introduce fwnode_mdiobus_register_phy()") Signed-off-by: Joe Hattori <joe@pf.is.s.u-tokyo.ac.jp> Reviewed-by: Andrew Lunn <andrew@lunn.ch> Link: https://patch.msgid.link/20241218035106.1436405-1-joe@pf.is.s.u-tokyo.ac.jp Signed-off-by: Paolo Abeni <pabeni@redhat.com>	2024-12-19 11:45:42 +01:00
Dave Airlie	e9088ac19e	Merge tag 'drm-intel-fixes-2024-12-18' of https://gitlab.freedesktop.org/drm/i915/kernel into drm-fixes - Reset engine utilization buffer before registration (Umesh Nerlige Ramappa) - Ensure busyness counter increases motonically (Umesh Nerlige Ramappa) - Accumulate active runtime on gt reset (Umesh Nerlige Ramappa) Signed-off-by: Dave Airlie <airlied@redhat.com> From: Tvrtko Ursulin <tursulin@igalia.com> Link: https://patchwork.freedesktop.org/patch/msgid/Z2LppUZudGKXwWjW@linux	2024-12-19 20:31:36 +10:00
Paolo Abeni	b4adc04954	Merge tag 'nf-24-12-19' of git://git.kernel.org/pub/scm/linux/kernel/git/netfilter/nf Pablo Neira Ayuso says: ==================== Netfilter/IPVS fixes for net The following series contains two fixes for Netfilter/IPVS: 1) Possible build failure in IPVS on systems with less than 512MB memory due to incorrect use of clamp(), from David Laight. 2) Fix bogus lockdep nesting splat with ipset list:set type, from Phil Sutter. netfilter pull request 24-12-19 * tag 'nf-24-12-19' of git://git.kernel.org/pub/scm/linux/kernel/git/netfilter/nf: netfilter: ipset: Fix for recursive locking warning ipvs: Fix clamp() of ip_vs_conn_tab on small memory systems ==================== Link: https://patch.msgid.link/20241218234137.1687288-1-pablo@netfilter.org Signed-off-by: Paolo Abeni <pabeni@redhat.com>	2024-12-19 09:55:21 +01:00
Harshit Mogalapalli	b95c8c33ae	octeontx2-pf: fix error handling of devlink port in rvu_rep_create() Unregister the devlink port when register_netdev() fails. Fixes: `9ed0343f56` ("octeontx2-pf: Add devlink port support") Reviewed-by: Przemek Kitszel <przemyslaw.kitszel@intel.com> Signed-off-by: Harshit Mogalapalli <harshit.m.mogalapalli@oracle.com> Link: https://patch.msgid.link/20241217052326.1086191-2-harshit.m.mogalapalli@oracle.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-18 19:23:51 -08:00
Harshit Mogalapalli	51df947678	octeontx2-pf: fix netdev memory leak in rvu_rep_create() When rvu_rep_devlink_port_register() fails, free_netdev(ndev) for this incomplete iteration before going to "exit:" label. Fixes: `9ed0343f56` ("octeontx2-pf: Add devlink port support") Reviewed-by: Przemek Kitszel <przemyslaw.kitszel@intel.com> Signed-off-by: Harshit Mogalapalli <harshit.m.mogalapalli@oracle.com> Link: https://patch.msgid.link/20241217052326.1086191-1-harshit.m.mogalapalli@oracle.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-18 19:23:50 -08:00
Adrian Moreno	5eecd85c77	psample: adjust size if rate_as_probability is set If PSAMPLE_ATTR_SAMPLE_PROBABILITY flag is to be sent, the available size for the packet data has to be adjusted accordingly. Also, check the error code returned by nla_put_flag. Fixes: `7b1b2b60c6` ("net: psample: allow using rate as probability") Signed-off-by: Adrian Moreno <amorenoz@redhat.com> Reviewed-by: Aaron Conole <aconole@redhat.com> Reviewed-by: Ido Schimmel <idosch@nvidia.com> Link: https://patch.msgid.link/20241217113739.3929300-1-amorenoz@redhat.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-18 19:23:04 -08:00
Jakub Kicinski	5eb70dbebf	netdev-genl: avoid empty messages in queue dump Empty netlink responses from do() are not correct (as opposed to dump() where not dumping anything is perfectly fine). We should return an error if the target object does not exist, in this case if the netdev is down it has no queues. Fixes: `6b6171db7f` ("netdev-genl: Add netlink framework functions for queue") Reported-by: syzbot+0a884bc2d304ce4af70f@syzkaller.appspotmail.com Reviewed-by: Eric Dumazet <edumazet@google.com> Reviewed-by: Joe Damato <jdamato@fastly.com> Link: https://patch.msgid.link/20241218022508.815344-1-kuba@kernel.org Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-18 19:22:51 -08:00
Vladimir Oltean	16f027cd40	net: dsa: restore dsa_software_vlan_untag() ability to operate on VLAN-untagged traffic Robert Hodaszi reports that locally terminated traffic towards VLAN-unaware bridge ports is broken with ocelot-8021q. He is describing the same symptoms as for commit `1f9fc48fd3` ("net: dsa: sja1105: fix reception from VLAN-unaware bridges"). For context, the set merged as "VLAN fixes for Ocelot driver": https://lore.kernel.org/netdev/20240815000707.2006121-1-vladimir.oltean@nxp.com/ was developed in a slightly different form earlier this year, in January. Initially, the switch was unconditionally configured to set OCELOT_ES0_TAG when using ocelot-8021q, regardless of port operating mode. This led to the situation where VLAN-unaware bridge ports would always push their PVID - see ocelot_vlan_unaware_pvid() - a negligible value anyway - into RX packets. To strip this in software, we would have needed DSA to know what private VID the switch chose for VLAN-unaware bridge ports, and pushed into the packets. This was implemented downstream, and a remnant of it remains in the form of a comment mentioning ds->ops->get_private_vid(), as something which would maybe need to be considered in the future. However, for upstream, it was deemed inappropriate, because it would mean introducing yet another behavior for stripping VLAN tags from VLAN-unaware bridge ports, when one already existed (ds->untag_bridge_pvid). The latter has been marked as obsolete along with an explanation why it is logically broken, but still, it would have been confusing. So, for upstream, felix_update_tag_8021q_rx_rule() was developed, which essentially changed the state of affairs from "Felix with ocelot-8021q delivers all packets as VLAN-tagged towards the CPU" into "Felix with ocelot-8021q delivers all packets from VLAN-aware bridge ports towards the CPU". This was done on the premise that in VLAN-unaware mode, there's nothing useful in the VLAN tags, and we can avoid introducing ds->ops->get_private_vid() in the DSA receive path if we configure the switch to not push those VLAN tags into packets in the first place. Unfortunately, and this is when the trainwreck started, the selftests developed initially and posted with the series were not re-ran. dsa_software_vlan_untag() was initially written given the assumption that users of this feature would send _all_ traffic as VLAN-tagged. It was only partially adapted to the new scheme, by removing ds->ops->get_private_vid(), which also used to be necessary in standalone ports mode. Where the trainwreck became even worse is that I had a second opportunity to think about this, when the dsa_software_vlan_untag() logic change initially broke sja1105, in commit `1f9fc48fd3` ("net: dsa: sja1105: fix reception from VLAN-unaware bridges"). I did not connect the dots that it also breaks ocelot-8021q, for pretty much the same reason that not all received packets will be VLAN-tagged. To be compatible with the optimized Felix control path which runs felix_update_tag_8021q_rx_rule() to only push VLAN tags when useful (in VLAN-aware mode), we need to restore the old dsa_software_vlan_untag() logic. The blamed commit introduced the assumption that dsa_software_vlan_untag() will see only VLAN-tagged packets, assumption which is false. What corrupts RX traffic is the fact that we call skb_vlan_untag() on packets which are not VLAN-tagged in the first place. Fixes: `93e4649efa` ("net: dsa: provide a software untagging function on RX for VLAN-aware bridges") Reported-by: Robert Hodaszi <robert.hodaszi@digi.com> Closes: https://lore.kernel.org/netdev/20241215163334.615427-1-robert.hodaszi@digi.com/ Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Link: https://patch.msgid.link/20241216135059.1258266-1-vladimir.oltean@nxp.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-18 19:22:36 -08:00
Jakub Kicinski	a713c017ef	Merge branch '200GbE' of git://git.kernel.org/pub/scm/linux/kernel/git/tnguy/net-queue Tony Nguyen says: ==================== idpf: trigger SW interrupt when exiting wb_on_itr mode Joshua Hay says: This patch series introduces SW triggered interrupt support for idpf, then uses said interrupt to fix a race condition between completion writebacks and re-enabling interrupts. * '200GbE' of git://git.kernel.org/pub/scm/linux/kernel/git/tnguy/net-queue: idpf: trigger SW interrupt when exiting wb_on_itr mode idpf: add support for SW triggered interrupts ==================== Link: https://patch.msgid.link/20241217225715.4005644-1-anthony.l.nguyen@intel.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-18 19:20:20 -08:00
Adrian Moreno	a17975992c	selftests: openvswitch: fix tcpdump execution Fix the way tcpdump is executed by: - Using the right variable for the namespace. Currently the use of the empty "ns" makes the command fail. - Waiting until it starts to capture to ensure the interesting traffic is caught on slow systems. - Using line-buffered output to ensure logs are available when the test is paused with "-p". Otherwise the last chunk of data might only be written when tcpdump is killed. Fixes: `74cc26f416` ("selftests: openvswitch: add interface support") Signed-off-by: Adrian Moreno <amorenoz@redhat.com> Acked-by: Eelco Chaudron <echaudro@redhat.com> Link: https://patch.msgid.link/20241217211652.483016-1-amorenoz@redhat.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-18 19:18:41 -08:00
Leo Stone	d3ac65d274	mm: huge_memory: handle strsep not finding delimiter split_huge_pages_write() does not handle the case where strsep finds no delimiter in the given string and sets the input buffer to NULL, which allows this reproducer to trigger a protection fault. Link: https://lkml.kernel.org/r/20241216042752.257090-2-leocstone@gmail.com Signed-off-by: Leo Stone <leocstone@gmail.com> Reported-by: syzbot+8a3da2f1bbf59227c289@syzkaller.appspotmail.com Closes: https://syzkaller.appspot.com/bug?extid=8a3da2f1bbf59227c289 Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:47 -08:00
Suren Baghdasaryan	60da7445a1	alloc_tag: fix set_codetag_empty() when !CONFIG_MEM_ALLOC_PROFILING_DEBUG It was recently noticed that set_codetag_empty() might be used not only to mark NULL alloctag references as empty to avoid warnings but also to reset valid tags (in clear_page_tag_ref()). Since set_codetag_empty() is defined as NOOP for CONFIG_MEM_ALLOC_PROFILING_DEBUG=n, such use of set_codetag_empty() leads to subtle bugs. Fix set_codetag_empty() for CONFIG_MEM_ALLOC_PROFILING_DEBUG=n to reset the tag reference. Link: https://lkml.kernel.org/r/20241130001423.1114965-2-surenb@google.com Fixes: `a8fc28dad6` ("alloc_tag: introduce clear_page_tag_ref() helper function") Signed-off-by: Suren Baghdasaryan <surenb@google.com> Reported-by: David Wang <00107082@163.com> Closes: https://lore.kernel.org/lkml/20241124074318.399027-1-00107082@163.com/ Cc: David Wang <00107082@163.com> Cc: Kent Overstreet <kent.overstreet@linux.dev> Cc: Mike Rapoport (Microsoft) <rppt@kernel.org> Cc: Pasha Tatashin <pasha.tatashin@soleen.com> Cc: Sourav Panda <souravpanda@google.com> Cc: Yu Zhao <yuzhao@google.com> Cc: <stable@vger.kernel.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:46 -08:00
Suren Baghdasaryan	e269b5d291	alloc_tag: fix module allocation tags populated area calculation vm_module_tags_populate() calculation of the populated area assumes that area starts at a page boundary and therefore when new pages are allocation, the end of the area is page-aligned as well. If the start of the area is not page-aligned then allocating a page and incrementing the end of the area by PAGE_SIZE leads to an area at the end but within the area boundary which is not populated. Accessing this are will lead to a kernel panic. Fix the calculation by down-aligning the start of the area and using that as the location allocated pages are mapped to. [gehao@kylinos.cn: fix vm_module_tags_populate's KASAN poisoning logic] Link: https://lkml.kernel.org/r/20241205170528.81000-1-hao.ge@linux.dev [gehao@kylinos.cn: fix panic when CONFIG_KASAN enabled and CONFIG_KASAN_VMALLOC not enabled] Link: https://lkml.kernel.org/r/20241212072126.134572-1-hao.ge@linux.dev Link: https://lkml.kernel.org/r/20241130001423.1114965-1-surenb@google.com Fixes: `0f9b685626` ("alloc_tag: populate memory for module tags as needed") Signed-off-by: Suren Baghdasaryan <surenb@google.com> Reported-by: kernel test robot <oliver.sang@intel.com> Closes: https://lore.kernel.org/oe-lkp/202411132111.6a221562-lkp@intel.com Acked-by: Yu Zhao <yuzhao@google.com> Tested-by: Adrian Huang <ahuang12@lenovo.com> Cc: David Wang <00107082@163.com> Cc: Kent Overstreet <kent.overstreet@linux.dev> Cc: Mike Rapoport (Microsoft) <rppt@kernel.org> Cc: Pasha Tatashin <pasha.tatashin@soleen.com> Cc: Sourav Panda <souravpanda@google.com> Cc: <stable@vger.kernel.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:46 -08:00
David Wang	640a603943	mm/codetag: clear tags before swap When CONFIG_MEM_ALLOC_PROFILING_DEBUG is set, kernel WARN would be triggered when calling __alloc_tag_ref_set() during swap: alloc_tag was not cleared (got tag for mm/filemap.c:1951) WARNING: CPU: 0 PID: 816 at ./include/linux/alloc_tag.h... Clear code tags before swap can fix the warning. And this patch also fix a potential invalid address dereference in alloc_tag_add_check() when CONFIG_MEM_ALLOC_PROFILING_DEBUG is set and ref->ct is CODETAG_EMPTY, which is defined as ((void *)1). Link: https://lkml.kernel.org/r/20241213013332.89910-1-00107082@163.com Fixes: `51f43d5d82` ("mm/codetag: swap tags when migrate pages") Signed-off-by: David Wang <00107082@163.com> Reported-by: kernel test robot <oliver.sang@intel.com> Closes: https://lore.kernel.org/oe-lkp/202412112227.df61ebb-lkp@intel.com Acked-by: Suren Baghdasaryan <surenb@google.com> Cc: Kent Overstreet <kent.overstreet@linux.dev> Cc: Yu Zhao <yuzhao@google.com> Cc: <stable@vger.kernel.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:46 -08:00
Bart Van Assche	30c2de0a26	mm/vmstat: fix a W=1 clang compiler warning Fix the following clang compiler warning that is reported if the kernel is built with W=1: ./include/linux/vmstat.h:518:36: error: arithmetic between different enumeration types ('enum node_stat_item' and 'enum lru_list') [-Werror,-Wenum-enum-conversion] 518 \| return node_stat_name(NR_LRU_BASE + lru) + 3; // skip "nr_" \| ~~~~~~~~~~~ ^ ~~~ Link: https://lkml.kernel.org/r/20241212213126.1269116-1-bvanassche@acm.org Fixes: `9d7ea9a297` ("mm/vmstat: add helpers to get vmstat item names for each enum type") Signed-off-by: Bart Van Assche <bvanassche@acm.org> Cc: Konstantin Khlebnikov <koct9i@gmail.com> Cc: Nathan Chancellor <nathan@kernel.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:46 -08:00
Usama Arif	42b2eb6983	mm: convert partially_mapped set/clear operations to be atomic Other page flags in the 2nd page, like PG_hwpoison and PG_anon_exclusive can get modified concurrently. Changes to other page flags might be lost if they are happening at the same time as non-atomic partially_mapped operations. Hence, make partially_mapped operations atomic. Link: https://lkml.kernel.org/r/20241212183351.1345389-1-usamaarif642@gmail.com Fixes: `8422acdc97` ("mm: introduce a pageflag for partially mapped folios") Reported-by: David Hildenbrand <david@redhat.com> Link: https://lore.kernel.org/all/e53b04ad-1827-43a2-a1ab-864c7efecf6e@redhat.com/ Signed-off-by: Usama Arif <usamaarif642@gmail.com> Acked-by: David Hildenbrand <david@redhat.com> Acked-by: Johannes Weiner <hannes@cmpxchg.org> Acked-by: Roman Gushchin <roman.gushchin@linux.dev> Cc: Barry Song <baohua@kernel.org> Cc: Domenico Cerasuolo <cerasuolodomenico@gmail.com> Cc: Jonathan Corbet <corbet@lwn.net> Cc: Matthew Wilcox <willy@infradead.org> Cc: Mike Rapoport (Microsoft) <rppt@kernel.org> Cc: Nico Pache <npache@redhat.com> Cc: Rik van Riel <riel@surriel.com> Cc: Ryan Roberts <ryan.roberts@arm.com> Cc: Shakeel Butt <shakeel.butt@linux.dev> Cc: Yu Zhao <yuzhao@google.com> Cc: <stable@vger.kernel.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:45 -08:00
Ryusuke Konishi	6309b8ce98	nilfs2: fix buffer head leaks in calls to truncate_inode_pages() When block_invalidatepage was converted to block_invalidate_folio, the fallback to block_invalidatepage in folio_invalidate() if the address_space_operations method invalidatepage (currently invalidate_folio) was not set, was removed. Unfortunately, some pseudo-inodes in nilfs2 use empty_aops set by inode_init_always_gfp() as is, or explicitly set it to address_space_operations. Therefore, with this change, block_invalidatepage() is no longer called from folio_invalidate(), and as a result, the buffer_head structures attached to these pages/folios are no longer freed via try_to_free_buffers(). Thus, these buffer heads are now leaked by truncate_inode_pages(), which cleans up the page cache from inode evict(), etc. Three types of caches use empty_aops: gc inode caches and the DAT shadow inode used by GC, and b-tree node caches. Of these, b-tree node caches explicitly call invalidate_mapping_pages() during cleanup, which involves calling try_to_free_buffers(), so the leak was not visible during normal operation but worsened when GC was performed. Fix this issue by using address_space_operations with invalidate_folio set to block_invalidate_folio instead of empty_aops, which will ensure the same behavior as before. Link: https://lkml.kernel.org/r/20241212164556.21338-1-konishi.ryusuke@gmail.com Fixes: `7ba13abbd3` ("fs: Turn block_invalidatepage into block_invalidate_folio") Signed-off-by: Ryusuke Konishi <konishi.ryusuke@gmail.com> Cc: <stable@vger.kernel.org> [5.18+] Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:45 -08:00
Matthew Wilcox (Oracle)	a2e740e216	vmalloc: fix accounting with i915 If the caller of vmap() specifies VM_MAP_PUT_PAGES (currently only the i915 driver), we will decrement nr_vmalloc_pages and MEMCG_VMALLOC in vfree(). These counters are incremented by vmalloc() but not by vmap() so this will cause an underflow. Check the VM_MAP_PUT_PAGES flag before decrementing either counter. Link: https://lkml.kernel.org/r/20241211202538.168311-1-willy@infradead.org Fixes: `b944afc9d6` ("mm: add a VM_MAP_PUT_PAGES flag for vmap") Signed-off-by: Matthew Wilcox (Oracle) <willy@infradead.org> Acked-by: Johannes Weiner <hannes@cmpxchg.org> Reviewed-by: Shakeel Butt <shakeel.butt@linux.dev> Reviewed-by: Balbir Singh <balbirs@nvidia.com> Acked-by: Michal Hocko <mhocko@suse.com> Cc: Christoph Hellwig <hch@lst.de> Cc: Muchun Song <muchun.song@linux.dev> Cc: Roman Gushchin <roman.gushchin@linux.dev> Cc: "Uladzislau Rezki (Sony)" <urezki@gmail.com> Cc: <stable@vger.kernel.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:45 -08:00
David Hildenbrand	faeec8e23c	mm/page_alloc: don't call pfn_to_page() on possibly non-existent PFN in split_large_buddy() In split_large_buddy(), we might call pfn_to_page() on a PFN that might not exist. In corner cases, such as when freeing the highest pageblock in the last memory section, this could result with CONFIG_SPARSEMEM && !CONFIG_SPARSEMEM_EXTREME in __pfn_to_section() returning NULL and and __section_mem_map_addr() dereferencing that NULL pointer. Let's fix it, and avoid doing a pfn_to_page() call for the first iteration, where we already have the page. So far this was found by code inspection, but let's just CC stable as the fix is easy. Link: https://lkml.kernel.org/r/20241210093437.174413-1-david@redhat.com Fixes: `fd919a85cd` ("mm: page_isolation: prepare for hygienic freelists") Signed-off-by: David Hildenbrand <david@redhat.com> Reported-by: Vlastimil Babka <vbabka@suse.cz> Closes: https://lkml.kernel.org/r/e1a898ba-a717-4d20-9144-29df1a6c8813@suse.cz Reviewed-by: Vlastimil Babka <vbabka@suse.cz> Reviewed-by: Zi Yan <ziy@nvidia.com> Acked-by: Johannes Weiner <hannes@cmpxchg.org> Cc: Yu Zhao <yuzhao@google.com> Cc: <stable@vger.kernel.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:45 -08:00
Lorenzo Stoakes	8ac662f5da	fork: avoid inappropriate uprobe access to invalid mm If dup_mmap() encounters an issue, currently uprobe is able to access the relevant mm via the reverse mapping (in build_map_info()), and if we are very unlucky with a race window, observe invalid XA_ZERO_ENTRY state which we establish as part of the fork error path. This occurs because uprobe_write_opcode() invokes anon_vma_prepare() which in turn invokes find_mergeable_anon_vma() that uses a VMA iterator, invoking vma_iter_load() which uses the advanced maple tree API and thus is able to observe XA_ZERO_ENTRY entries added to dup_mmap() in commit `d240629148` ("fork: use __mt_dup() to duplicate maple tree in dup_mmap()"). This change was made on the assumption that only process tear-down code would actually observe (and make use of) these values. However this very unlikely but still possible edge case with uprobes exists and unfortunately does make these observable. The uprobe operation prevents races against the dup_mmap() operation via the dup_mmap_sem semaphore, which is acquired via uprobe_start_dup_mmap() and dropped via uprobe_end_dup_mmap(), and held across register_for_each_vma() prior to invoking build_map_info() which does the reverse mapping lookup. Currently these are acquired and dropped within dup_mmap(), which exposes the race window prior to error handling in the invoking dup_mm() which tears down the mm. We can avoid all this by just moving the invocation of uprobe_start_dup_mmap() and uprobe_end_dup_mmap() up a level to dup_mm() and only release this lock once the dup_mmap() operation succeeds or clean up is done. This means that the uprobe code can never observe an incompletely constructed mm and resolves the issue in this case. Link: https://lkml.kernel.org/r/20241210172412.52995-1-lorenzo.stoakes@oracle.com Fixes: `d240629148` ("fork: use __mt_dup() to duplicate maple tree in dup_mmap()") Signed-off-by: Lorenzo Stoakes <lorenzo.stoakes@oracle.com> Reported-by: syzbot+2d788f4f7cb660dac4b7@syzkaller.appspotmail.com Closes: https://lore.kernel.org/all/6756d273.050a0220.2477f.003d.GAE@google.com/ Cc: Adrian Hunter <adrian.hunter@intel.com> Cc: Alexander Shishkin <alexander.shishkin@linux.intel.com> Cc: Arnaldo Carvalho de Melo <acme@kernel.org> Cc: Ian Rogers <irogers@google.com> Cc: Ingo Molnar <mingo@redhat.com> Cc: Jann Horn <jannh@google.com> Cc: Jiri Olsa <jolsa@kernel.org> Cc: Kan Liang <kan.liang@linux.intel.com> Cc: Liam R. Howlett <Liam.Howlett@Oracle.com> Cc: Mark Rutland <mark.rutland@arm.com> Cc: Masami Hiramatsu <mhiramat@kernel.org> Cc: Namhyung Kim <namhyung@kernel.org> Cc: Oleg Nesterov <oleg@redhat.com> Cc: Peng Zhang <zhangpeng.00@bytedance.com> Cc: Peter Zijlstra <peterz@infradead.org> Cc: Vlastimil Babka <vbabka@suse.cz> Cc: David Hildenbrand <david@redhat.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:44 -08:00
Edward Adam Davis	901ce9705f	nilfs2: prevent use of deleted inode syzbot reported a WARNING in nilfs_rmdir. [1] Because the inode bitmap is corrupted, an inode with an inode number that should exist as a ".nilfs" file was reassigned by nilfs_mkdir for "file0", causing an inode duplication during execution. And this causes an underflow of i_nlink in rmdir operations. The inode is used twice by the same task to unmount and remove directories ".nilfs" and "file0", it trigger warning in nilfs_rmdir. Avoid to this issue, check i_nlink in nilfs_iget(), if it is 0, it means that this inode has been deleted, and iput is executed to reclaim it. [1] WARNING: CPU: 1 PID: 5824 at fs/inode.c:407 drop_nlink+0xc4/0x110 fs/inode.c:407 ... Call Trace: <TASK> nilfs_rmdir+0x1b0/0x250 fs/nilfs2/namei.c:342 vfs_rmdir+0x3a3/0x510 fs/namei.c:4394 do_rmdir+0x3b5/0x580 fs/namei.c:4453 __do_sys_rmdir fs/namei.c:4472 [inline] __se_sys_rmdir fs/namei.c:4470 [inline] __x64_sys_rmdir+0x47/0x50 fs/namei.c:4470 do_syscall_x64 arch/x86/entry/common.c:52 [inline] do_syscall_64+0xf3/0x230 arch/x86/entry/common.c:83 entry_SYSCALL_64_after_hwframe+0x77/0x7f Link: https://lkml.kernel.org/r/20241209065759.6781-1-konishi.ryusuke@gmail.com Fixes: `d25006523d` ("nilfs2: pathname operations") Signed-off-by: Ryusuke Konishi <konishi.ryusuke@gmail.com> Reported-by: syzbot+9260555647a5132edd48@syzkaller.appspotmail.com Closes: https://syzkaller.appspot.com/bug?extid=9260555647a5132edd48 Tested-by: syzbot+9260555647a5132edd48@syzkaller.appspotmail.com Signed-off-by: Edward Adam Davis <eadavis@qq.com> Cc: <stable@vger.kernel.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:44 -08:00
Kairui Song	74363ec674	zram: fix uninitialized ZRAM not releasing backing device Setting backing device is done before ZRAM initialization. If we set the backing device, then remove the ZRAM module without initializing the device, the backing device reference will be leaked and the device will be hold forever. Fix this by always reset the ZRAM fully on rmmod or reset store. Link: https://lkml.kernel.org/r/20241209165717.94215-3-ryncsn@gmail.com Fixes: `013bf95a83` ("zram: add interface to specif backing device") Signed-off-by: Kairui Song <kasong@tencent.com> Reported-by: Desheng Wu <deshengwu@tencent.com> Suggested-by: Sergey Senozhatsky <senozhatsky@chromium.org> Reviewed-by: Sergey Senozhatsky <senozhatsky@chromium.org> Cc: <stable@vger.kernel.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:44 -08:00
Kairui Song	be48c412f6	zram: refuse to use zero sized block device as backing device Patch series "zram: fix backing device setup issue", v2. This series fixes two bugs of backing device setting: - ZRAM should reject using a zero sized (or the uninitialized ZRAM device itself) as the backing device. - Fix backing device leaking when removing a uninitialized ZRAM device. This patch (of 2): Setting a zero sized block device as backing device is pointless, and one can easily create a recursive loop by setting the uninitialized ZRAM device itself as its own backing device by (zram0 is uninitialized): echo /dev/zram0 > /sys/block/zram0/backing_dev It's definitely a wrong config, and the module will pin itself, kernel should refuse doing so in the first place. By refusing to use zero sized device we avoided misuse cases including this one above. Link: https://lkml.kernel.org/r/20241209165717.94215-1-ryncsn@gmail.com Link: https://lkml.kernel.org/r/20241209165717.94215-2-ryncsn@gmail.com Fixes: `013bf95a83` ("zram: add interface to specif backing device") Signed-off-by: Kairui Song <kasong@tencent.com> Reported-by: Desheng Wu <deshengwu@tencent.com> Reviewed-by: Sergey Senozhatsky <senozhatsky@chromium.org> Cc: <stable@vger.kernel.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:44 -08:00
Zi Yan	c51a4f11e6	mm: use clear_user_(high)page() for arch with special user folio handling Some architectures have special handling after clearing user folios: architectures, which set cpu_dcache_is_aliasing() to true, require flushing dcache; arc, which sets cpu_icache_is_aliasing() to true, changes folio->flags to make icache coherent to dcache. So __GFP_ZERO using only clear_page() is not enough to zero user folios and clear_user_(high)page() must be used. Otherwise, user data will be corrupted. Fix it by always clearing user folios with clear_user_(high)page() when cpu_dcache_is_aliasing() is true or cpu_icache_is_aliasing() is true. Rename alloc_zeroed() to user_alloc_needs_zeroing() and invert the logic to clarify its intend. Link: https://lkml.kernel.org/r/20241209182326.2955963-2-ziy@nvidia.com Fixes: `5708d96da2` ("mm: avoid zeroing user movable page twice with init_on_alloc=1") Signed-off-by: Zi Yan <ziy@nvidia.com> Reported-by: Geert Uytterhoeven <geert+renesas@glider.be> Closes: https://lore.kernel.org/linux-mm/CAMuHMdV1hRp_NtR5YnJo=HsfgKQeH91J537Gh4gKk3PFZhSkbA@mail.gmail.com/ Tested-by: Geert Uytterhoeven <geert+renesas@glider.be> Acked-by: Vlastimil Babka <vbabka@suse.cz> Cc: Alexander Potapenko <glider@google.com> Cc: David Hildenbrand <david@redhat.com> Cc: John Hubbard <jhubbard@nvidia.com> Cc: Kees Cook <keescook@chromium.org> Cc: Kefeng Wang <wangkefeng.wang@huawei.com> Cc: Mathieu Desnoyers <mathieu.desnoyers@efficios.com> Cc: Matthew Wilcox <willy@infradead.org> Cc: Miaohe Lin <linmiaohe@huawei.com> Cc: Ryan Roberts <ryan.roberts@arm.com> Cc: Vineet Gupta <vgupta@kernel.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:43 -08:00
Zi Yan	5c0541e11c	mm: introduce cpu_icache_is_aliasing() across all architectures In commit `eacd0e950d` ("ARC: [mm] Lazy D-cache flush (non aliasing VIPT)"), arc adds the need to flush dcache to make icache see the code page change. This also requires special handling for clear_user_(high)page(). Introduce cpu_icache_is_aliasing() to make MM code query special clear_user_(high)page() easier. This will be used by the following commit. Link: https://lkml.kernel.org/r/20241209182326.2955963-1-ziy@nvidia.com Fixes: `5708d96da2` ("mm: avoid zeroing user movable page twice with init_on_alloc=1") Signed-off-by: Zi Yan <ziy@nvidia.com> Suggested-by: Mathieu Desnoyers <mathieu.desnoyers@efficios.com> Reviewed-by: Mathieu Desnoyers <mathieu.desnoyers@efficios.com> Acked-by: Vlastimil Babka <vbabka@suse.cz> Cc: Alexander Potapenko <glider@google.com> Cc: David Hildenbrand <david@redhat.com> Cc: Geert Uytterhoeven <geert+renesas@glider.be> Cc: John Hubbard <jhubbard@nvidia.com> Cc: Kees Cook <keescook@chromium.org> Cc: Kefeng Wang <wangkefeng.wang@huawei.com> Cc: Matthew Wilcox <willy@infradead.org> Cc: Miaohe Lin <linmiaohe@huawei.com> Cc: Ryan Roberts <ryan.roberts@arm.com> Cc: Vineet Gupta <vgupta@kernel.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:43 -08:00
Petr Malat	31c5629920	mm: add RCU annotation to pte_offset_map(_lock) RCU lock is taken by ___pte_offset_map() unless it returns NULL. Add this information to its inline callers to avoid sparse warning about context imbalance in pte_unmap(). Link: https://lkml.kernel.org/r/20241210000604.700710-1-oss@malat.biz Signed-off-by: Petr Malat <oss@malat.biz> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:43 -08:00
Lorenzo Stoakes	42c4e4b20d	mm: correctly reference merged VMA On second merge attempt on mmap() we incorrectly discard the possibly merged VMA, resulting in a possible use-after-free (and most certainly a reference to the wrong VMA) in this instance in the subsequent __mmap_complete() invocation. Correct this mistake by reassigning vma correctly if a merge succeeds in this case. Link: https://lkml.kernel.org/r/20241206215229.244413-1-lorenzo.stoakes@oracle.com Fixes: `5ac87a885a` ("mm: defer second attempt at merge on mmap()") Signed-off-by: Lorenzo Stoakes <lorenzo.stoakes@oracle.com> Suggested-by: Jann Horn <jannh@google.com> Reported-by: syzbot+91cf8da9401355f946c3@syzkaller.appspotmail.com Closes: https://lore.kernel.org/all/67536a25.050a0220.a30f1.0149.GAE@google.com/ Reviewed-by: Liam R. Howlett <Liam.Howlett@Oracle.com> Reviewed-by: Vlastimil Babka <vbabka@suse.cz> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:42 -08:00
Kefeng Wang	f5d09de9f1	mm: use aligned address in copy_user_gigantic_page() In current kernel, hugetlb_wp() calls copy_user_large_folio() with the fault address. Where the fault address may be not aligned with the huge page size. Then, copy_user_large_folio() may call copy_user_gigantic_page() with the address, while copy_user_gigantic_page() requires the address to be huge page size aligned. So, this may cause memory corruption or information leak, addtional, use more obvious naming 'addr_hint' instead of 'addr' for copy_user_gigantic_page(). Link: https://lkml.kernel.org/r/20241028145656.932941-2-wangkefeng.wang@huawei.com Fixes: `530dd9926d` ("mm: memory: improve copy_user_large_folio()") Signed-off-by: Kefeng Wang <wangkefeng.wang@huawei.com> Reviewed-by: David Hildenbrand <david@redhat.com> Cc: Huang Ying <ying.huang@intel.com> Cc: Matthew Wilcox (Oracle) <willy@infradead.org> Cc: Muchun Song <muchun.song@linux.dev> Cc: <stable@vger.kernel.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:42 -08:00
Kefeng Wang	8aca2bc96c	mm: use aligned address in clear_gigantic_page() In current kernel, hugetlb_no_page() calls folio_zero_user() with the fault address. Where the fault address may be not aligned with the huge page size. Then, folio_zero_user() may call clear_gigantic_page() with the address, while clear_gigantic_page() requires the address to be huge page size aligned. So, this may cause memory corruption or information leak, addtional, use more obvious naming 'addr_hint' instead of 'addr' for clear_gigantic_page(). Link: https://lkml.kernel.org/r/20241028145656.932941-1-wangkefeng.wang@huawei.com Fixes: `78fefd04c1` ("mm: memory: convert clear_huge_page() to folio_zero_user()") Signed-off-by: Kefeng Wang <wangkefeng.wang@huawei.com> Reviewed-by: "Huang, Ying" <ying.huang@intel.com> Reviewed-by: David Hildenbrand <david@redhat.com> Cc: Matthew Wilcox (Oracle) <willy@infradead.org> Cc: Muchun Song <muchun.song@linux.dev> Cc: <stable@vger.kernel.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:42 -08:00
Hugh Dickins	dad2dc9c92	mm: shmem: fix ShmemHugePages at swapout /proc/meminfo ShmemHugePages has been showing overlarge amounts (more than Shmem) after swapping out THPs: we forgot to update NR_SHMEM_THPS. Add shmem_update_stats(), to avoid repetition, and risk of making that mistake again: the call from shmem_delete_from_page_cache() is the bugfix; the call from shmem_replace_folio() is reassuring, but not really a bugfix (replace corrects misplaced swapin readahead, but huge swapin readahead would be a mistake). Link: https://lkml.kernel.org/r/5ba477c8-a569-70b5-923e-09ab221af45b@google.com Fixes: `809bc86517` ("mm: shmem: support large folio swap out") Signed-off-by: Hugh Dickins <hughd@google.com> Reviewed-by: Shakeel Butt <shakeel.butt@linux.dev> Reviewed-by: Yosry Ahmed <yosryahmed@google.com> Reviewed-by: Baolin Wang <baolin.wang@linux.alibaba.com> Tested-by: Baolin Wang <baolin.wang@linux.alibaba.com> Cc: <stable@vger.kernel.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:42 -08:00
Heming Zhao	7782e3b3b0	ocfs2: fix the space leak in LA when releasing LA Commit `30dd3478c3` ("ocfs2: correctly use ocfs2_find_next_zero_bit()") introduced an issue, the ocfs2_sync_local_to_main() ignores the last contiguous free bits, which causes an OCFS2 volume to lose the last free clusters of LA window during the release routine. Please note, because commit `dfe6c5692f` ("ocfs2: fix the la space leak when unmounting an ocfs2 volume") was reverted, this commit is a replacement fix for commit `dfe6c5692f`. Link: https://lkml.kernel.org/r/20241205104835.18223-3-heming.zhao@suse.com Fixes: `30dd3478c3` ("ocfs2: correctly use ocfs2_find_next_zero_bit()") Signed-off-by: Heming Zhao <heming.zhao@suse.com> Suggested-by: Joseph Qi <joseph.qi@linux.alibaba.com> Reviewed-by: Joseph Qi <joseph.qi@linux.alibaba.com> Cc: Mark Fasheh <mark@fasheh.com> Cc: Joel Becker <jlbec@evilplan.org> Cc: Junxiao Bi <junxiao.bi@oracle.com> Cc: Changwei Ge <gechangwei@live.cn> Cc: Jun Piao <piaojun@huawei.com> Cc: <stable@vger.kernel.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:41 -08:00
Heming Zhao	1a72d2ebee	ocfs2: revert "ocfs2: fix the la space leak when unmounting an ocfs2 volume" Patch series "Revert ocfs2 commit `dfe6c5692f` and provide a new fix". SUSE QA team detected a mistake in my commit `dfe6c5692f` ("ocfs2: fix the la space leak when unmounting an ocfs2 volume"). I am very sorry for my error. (If my eyes are correct) From the mailling list mails, this patch shouldn't be applied to 4.19 5.4 5.10 5.15 6.1 6.6, and these branches should perform a revert operation. Reason for revert: In commit `dfe6c5692f`, I mistakenly wrote: "This bug has existed since the initial OCFS2 code.". The statement is wrong. The correct introduction commit is `30dd3478c3`. IOW, if the branch doesn't include `30dd3478c3`, `dfe6c5692f` should also not be included. This reverts commit `dfe6c5692f` ("ocfs2: fix the la space leak when unmounting an ocfs2 volume"). In commit `dfe6c5692f`, the commit log "This bug has existed since the initial OCFS2 code." is wrong. The correct introduction commit is `30dd3478c3` ("ocfs2: correctly use ocfs2_find_next_zero_bit()"). The influence of commit `dfe6c5692f` is that it provides a correct fix for the latest kernel. however, it shouldn't be pushed to stable branches. Let's use this commit to revert all branches that include `dfe6c5692f` and use a new fix method to fix commit `30dd3478c3`. Link: https://lkml.kernel.org/r/20241205104835.18223-1-heming.zhao@suse.com Link: https://lkml.kernel.org/r/20241205104835.18223-2-heming.zhao@suse.com Fixes: `dfe6c5692f` ("ocfs2: fix the la space leak when unmounting an ocfs2 volume") Signed-off-by: Heming Zhao <heming.zhao@suse.com> Reviewed-by: Joseph Qi <joseph.qi@linux.alibaba.com> Cc: Mark Fasheh <mark@fasheh.com> Cc: Joel Becker <jlbec@evilplan.org> Cc: Junxiao Bi <junxiao.bi@oracle.com> Cc: Changwei Ge <gechangwei@live.cn> Cc: Jun Piao <piaojun@huawei.com> Cc: <stable@vger.kernel.org> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:41 -08:00
Huang Ying	da5bd7fa78	mailmap: add entry for Ying Huang Map my old company email to my personal email. Link: https://lkml.kernel.org/r/20241205124201.529308-1-huang.ying.caritas@gmail.com Signed-off-by: "Huang, Ying" <huang.ying.caritas@gmail.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:41 -08:00
Isaac J. Manjarres	6a75f19af1	selftests/memfd: run sysctl tests when PID namespace support is enabled The sysctl tests for vm.memfd_noexec rely on the kernel to support PID namespaces (i.e. the kernel is built with CONFIG_PID_NS=y). If the kernel the test runs on does not support PID namespaces, the first sysctl test will fail when attempting to spawn a new thread in a new PID namespace, abort the test, preventing the remaining tests from being run. This is not desirable, as not all kernels need PID namespaces, but can still use the other features provided by memfd. Therefore, only run the sysctl tests if the kernel supports PID namespaces. Otherwise, skip those tests and emit an informative message to let the user know why the sysctl tests are not being run. Link: https://lkml.kernel.org/r/20241205192943.3228757-1-isaacmanjarres@google.com Fixes: `11f75a0144` ("selftests/memfd: add tests for MFD_NOEXEC_SEAL MFD_EXEC") Signed-off-by: Isaac J. Manjarres <isaacmanjarres@google.com> Reviewed-by: Jeff Xu <jeffxu@google.com> Cc: Suren Baghdasaryan <surenb@google.com> Cc: Kalesh Singh <kaleshsingh@google.com> Cc: <stable@vger.kernel.org> [6.6+] Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:41 -08:00
Lorenzo Stoakes	dbf8be8218	docs/mm: add VMA locks documentation Locking around VMAs is complicated and confusing. While we have a number of disparate comments scattered around the place, we seem to be reaching a level of complexity that justifies a serious effort at clearly documenting how locks are expected to be used when it comes to interacting with mm_struct and vm_area_struct objects. This is especially pertinent as regards the efforts to find sensible abstractions for these fundamental objects in kernel rust code whose compiler strictly requires some means of expressing these rules (and through this expression, self-document these requirements as well as enforce them). The document limits scope to mmap and VMA locks and those that are immediately adjacent and relevant to them - so additionally covers page table locking as this is so very closely tied to VMA operations (and relies upon us handling these correctly). The document tries to cover some of the nastier and more confusing edge cases and concerns especially around lock ordering and page table teardown. The document is split between generally useful information for users of mm interfaces, and separately a section intended for mm kernel developers providing a discussion around internal implementation details. [lorenzo.stoakes@oracle.com: v3] Link: https://lkml.kernel.org/r/20241114205402.859737-1-lorenzo.stoakes@oracle.com [lorenzo.stoakes@oracle.com: docs/mm: minor corrections] Link: https://lkml.kernel.org/r/d3de735a-25ae-4eb2-866c-a9624fe6f795@lucifer.local [jannh@google.com: docs/mm: add more warnings around page table access] Link: https://lkml.kernel.org/r/20241118-vma-docs-addition1-onv3-v2-1-c9d5395b72ee@google.com Link: https://lkml.kernel.org/r/20241108135708.48567-1-lorenzo.stoakes@oracle.com Signed-off-by: Lorenzo Stoakes <lorenzo.stoakes@oracle.com> Acked-by: Qi Zheng <zhengqi.arch@bytedance.com> Acked-by: Mike Rapoport (Microsoft) <rppt@kernel.org> Reviewed-by: Bagas Sanjaya <bagasdotme@gmail.com> Reviewed-by: Jann Horn <jannh@google.com> Cc: Alice Ryhl <aliceryhl@google.com> Cc: Boqun Feng <boqun.feng@gmail.com> Cc: Hillf Danton <hdanton@sina.com> Cc: Jonathan Corbet <corbet@lwn.net> Cc: Liam R. Howlett <Liam.Howlett@Oracle.com> Cc: Matthew Wilcox <willy@infradead.org> Cc: SeongJae Park <sj@kernel.org> Cc: Suren Baghdasaryan <surenb@google.com> Cc: Vlastimil Babka <vbabka@suse.cz> Signed-off-by: Andrew Morton <akpm@linux-foundation.org>	2024-12-18 19:04:41 -08:00
Jakub Kicinski	dbfca1641e	Merge tag 'linux-can-fixes-for-6.13-20241218' of git://git.kernel.org/pub/scm/linux/kernel/git/mkl/linux-can Marc Kleine-Budde says: ==================== pull-request: can 2024-12-18 There are 2 patches by Matthias Schiffer for the m_can_pci driver that handles the m_can cores found on the Intel Elkhart Lake processor. They fix the initialization and the interrupt handling under high CAN bus load. * tag 'linux-can-fixes-for-6.13-20241218' of git://git.kernel.org/pub/scm/linux/kernel/git/mkl/linux-can: can: m_can: fix missed interrupts with m_can_pci can: m_can: set init flag earlier in probe ==================== Link: https://patch.msgid.link/20241218121722.2311963-1-mkl@pengutronix.de Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-18 17:51:39 -08:00
Martin Hou	5c964c8a97	net: usb: qmi_wwan: add Quectel RG255C Add support for Quectel RG255C which is based on Qualcomm SDX35 chip. The composition is DM / NMEA / AT / QMI. T: Bus=01 Lev=01 Prnt=01 Port=04 Cnt=01 Dev#= 2 Spd=480 MxCh= 0 D: Ver= 2.01 Cls=00(>ifc ) Sub=00 Prot=00 MxPS=64 #Cfgs= 1 P: Vendor=2c7c ProdID=0316 Rev= 5.15 S: Manufacturer=Quectel S: Product=RG255C-CN S: SerialNumber=c68192c1 C:* #Ifs= 4 Cfg#= 1 Atr=a0 MxPwr=500mA I:* If#= 0 Alt= 0 #EPs= 2 Cls=ff(vend.) Sub=ff Prot=30 Driver=option E: Ad=01(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=81(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms I:* If#= 1 Alt= 0 #EPs= 2 Cls=ff(vend.) Sub=00 Prot=00 Driver=option E: Ad=82(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=02(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms I:* If#= 2 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=ff Prot=40 Driver=option E: Ad=84(I) Atr=03(Int.) MxPS= 10 Ivl=32ms E: Ad=83(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=03(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms I:* If#= 3 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=ff Prot=50 Driver=qmi_wwan E: Ad=86(I) Atr=03(Int.) MxPS= 8 Ivl=32ms E: Ad=85(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=04(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms Signed-off-by: Martin Hou <martin.hou@foxmail.com> Link: https://patch.msgid.link/tencent_17DDD787B48E8A5AB8379ED69E23A0CD9309@qq.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-18 17:24:03 -08:00
Jann Horn	12d908116f	io_uring: Fix registered ring file refcount leak Currently, io_uring_unreg_ringfd() (which cleans up registered rings) is only called on exit, but __io_uring_free (which frees the tctx in which the registered ring pointers are stored) is also called on execve (via begin_new_exec -> io_uring_task_cancel -> __io_uring_cancel -> io_uring_cancel_generic -> __io_uring_free). This means: A process going through execve while having registered rings will leak references to the rings' `struct file`. Fix it by zapping registered rings on execve(). This is implemented by moving the io_uring_unreg_ringfd() from io_uring_files_cancel() into its callee __io_uring_cancel(), which is called from io_uring_task_cancel() on execve. This could probably be exploited on 32-bit kernels by leaking 2^32 references to the same ring, because the file refcount is stored in a pointer-sized field and get_file() doesn't have protection against refcount overflow, just a WARN_ONCE(); but on 64-bit it should have no impact beyond a memory leak. Cc: stable@vger.kernel.org Fixes: `e7a6c00dc7` ("io_uring: add support for registering ring file descriptors") Signed-off-by: Jann Horn <jannh@google.com> Link: https://lore.kernel.org/r/20241218-uring-reg-ring-cleanup-v1-1-8f63e999045b@google.com Signed-off-by: Jens Axboe <axboe@kernel.dk>	2024-12-18 18:19:33 -07:00
Arnd Bergmann	cff865c700	net: phy: avoid undefined behavior in *_led_polarity_set() gcc runs into undefined behavior at the end of the three led_polarity_set() callback functions if it were called with a zero 'modes' argument and it just ends the function there without returning from it. This gets flagged by 'objtool' as a function that continues on to the next one: drivers/net/phy/aquantia/aquantia_leds.o: warning: objtool: aqr_phy_led_polarity_set+0xf: can't find jump dest instruction at .text+0x5d9 drivers/net/phy/intel-xway.o: warning: objtool: xway_gphy_led_polarity_set() falls through to next function xway_gphy_config_init() drivers/net/phy/mxl-gpy.o: warning: objtool: gpy_led_polarity_set() falls through to next function gpy_led_hw_control_get() There is no point to micro-optimize the behavior here to save a single-digit number of bytes in the kernel, so just change this to a "return -EINVAL" as we do when any unexpected bits are set. Fixes: `1758af47b9` ("net: phy: intel-xway: add support for PHY LEDs") Fixes: `9d55e68b19` ("net: phy: aquantia: correctly describe LED polarity override") Fixes: `eb89c79c1b` ("net: phy: mxl-gpy: correctly describe LED polarity") Signed-off-by: Arnd Bergmann <arnd@arndb.de> Reviewed-by: Andrew Lunn <andrew@lunn.ch> Link: https://patch.msgid.link/20241217081056.238792-1-arnd@kernel.org Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-18 16:50:23 -08:00
Hans de Goede	b3ded6072c	power: supply: bq24190: Fix BQ24296 Vbus regulator support There are 2 issues with bq24296_set_otg_vbus(): 1. When writing the OTG_CONFIG bit it uses POC_CHG_CONFIG_SHIFT which should be POC_OTG_CONFIG_SHIFT. 2. When turning the regulator off it never turns charging back on. Note this must be done through bq24190_charger_set_charge_type(), to ensure that the charge_type property value of none/trickle/fast is honored. Resolve both issues to fix BQ24296 Vbus regulator support not working. Fixes: `b150a703b5` ("power: supply: bq24190_charger: Add support for BQ24296") Signed-off-by: Hans de Goede <hdegoede@redhat.com> Link: https://lore.kernel.org/r/20241116203648.169100-2-hdegoede@redhat.com Signed-off-by: Sebastian Reichel <sebastian.reichel@collabora.com>	2024-12-19 00:35:30 +01:00
Phil Sutter	70b6f46a4e	netfilter: ipset: Fix for recursive locking warning With CONFIG_PROVE_LOCKING, when creating a set of type bitmap:ip, adding it to a set of type list:set and populating it from iptables SET target triggers a kernel warning: \| WARNING: possible recursive locking detected \| 6.12.0-rc7-01692-g5e9a28f41134-dirty #594 Not tainted \| -------------------------------------------- \| ping/4018 is trying to acquire lock: \| ffff8881094a6848 (&set->lock){+.-.}-{2:2}, at: ip_set_add+0x28c/0x360 [ip_set] \| \| but task is already holding lock: \| ffff88811034c048 (&set->lock){+.-.}-{2:2}, at: ip_set_add+0x28c/0x360 [ip_set] This is a false alarm: ipset does not allow nested list:set type, so the loop in list_set_kadd() can never encounter the outer set itself. No other set type supports embedded sets, so this is the only case to consider. To avoid the false report, create a distinct lock class for list:set type ipset locks. Fixes: `f830837f0e` ("netfilter: ipset: list:set set type support") Signed-off-by: Phil Sutter <phil@nwl.cc> Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>	2024-12-19 00:28:47 +01:00
David Laight	cf2c97423a	ipvs: Fix clamp() of ip_vs_conn_tab on small memory systems The 'max_avail' value is calculated from the system memory size using order_base_2(). order_base_2(x) is defined as '(x) ? fn(x) : 0'. The compiler generates two copies of the code that follows and then expands clamp(max, min, PAGE_SHIFT - 12) (11 on 32bit). This triggers a compile-time assert since min is 5. In reality a system would have to have less than 512MB memory for the bounds passed to clamp to be reversed. Swap the order of the arguments to clamp() to avoid the warning. Replace the clamp_val() on the line below with clamp(). clamp_val() is just 'an accident waiting to happen' and not needed here. Detected by compile time checks added to clamp(), specifically: minmax.h: use BUILD_BUG_ON_MSG() for the lo < hi test in clamp() Reported-by: Linux Kernel Functional Testing <lkft@linaro.org> Closes: https://lore.kernel.org/all/CA+G9fYsT34UkGFKxus63H6UVpYi5GRZkezT9MRLfAbM3f6ke0g@mail.gmail.com/ Fixes: `4f325e2627` ("ipvs: dynamically limit the connection hash table") Tested-by: Bartosz Golaszewski <bartosz.golaszewski@linaro.org> Reviewed-by: Bartosz Golaszewski <bartosz.golaszewski@linaro.org> Signed-off-by: David Laight <david.laight@aculab.com> Acked-by: Julian Anastasov <ja@ssi.bg> Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>	2024-12-18 23:37:27 +01:00
Linus Torvalds	eabcdba3ad	Merge tag 'for-6.13-rc3-tag' of git://git.kernel.org/pub/scm/linux/kernel/git/kdave/linux Pull btrfs fixes from David Sterba: - tree-checker catches invalid number of inline extent references - zoned mode fixes: - enhance zone append IO command so it also detects emulated writes - handle bio splitting at sectorsize boundary - when deleting a snapshot, fix a condition for visiting nodes in reloc trees * tag 'for-6.13-rc3-tag' of git://git.kernel.org/pub/scm/linux/kernel/git/kdave/linux: btrfs: tree-checker: reject inline extent items with 0 ref count btrfs: split bios to the fs sector size boundary btrfs: use bio_is_zone_append() in the completion handler btrfs: fix improper generation check in snapshot delete	2024-12-18 14:17:21 -08:00
Linus Torvalds	b69810f38c	Merge tag 'cxl-fixes-6.13-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/cxl/cxl Pull cxl fixes from Ira Weiny: - prevent probe failure when non-critical RAS unmasking fails - fix CXL 1.1 link status sysfs attribute - fix 4 way (and greater) switch interleave region creation * tag 'cxl-fixes-6.13-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/cxl/cxl: cxl/region: Fix region creation for greater than x2 switches cxl/pci: Check dport->regs.rcd_pcie_cap availability before accessing cxl/pci: Fix potential bogus return value upon successful probing	2024-12-18 12:52:57 -08:00
Alex Deucher	3abb660f9e	drm/amdgpu/nbio7.0: fix IP version check Use the helper function rather than reading it directly. Reviewed-by: Yang Wang <kevinyang.wang@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com> (cherry picked from commit `0ec43fbece`) Cc: stable@vger.kernel.org	2024-12-18 15:20:57 -05:00
Mario Limonciello	a7f9d98eb1	drm/amd: Update strapping for NBIO 2.5.0 This helps to avoid a spurious PME event on hotplug to Azalia. Cc: Vijendar Mukunda <Vijendar.Mukunda@amd.com> Reported-and-tested-by: ionut_n2001@yahoo.com Closes: https://bugzilla.kernel.org/show_bug.cgi?id=215884 Tested-by: Gabriel Marcano <gabemarcano@yahoo.com> Acked-by: Alex Deucher <alexander.deucher@amd.com> Link: https://lore.kernel.org/r/20241211024414.7840-1-mario.limonciello@amd.com Signed-off-by: Mario Limonciello <mario.limonciello@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com> (cherry picked from commit `3f6f237b9d`) Cc: stable@vger.kernel.org	2024-12-18 15:20:06 -05:00
Linus Torvalds	397d1d88af	Merge tag 'selinux-pr-20241217' of git://git.kernel.org/pub/scm/linux/kernel/git/pcmoore/selinux Pull selinux fix from Paul Moore: "One small SELinux patch to get rid improve our handling of unknown extended permissions by safely ignoring them. Not only does this make it easier to support newer SELinux policy on older kernels in the future, it removes to BUG() calls from the SELinux code." * tag 'selinux-pr-20241217' of git://git.kernel.org/pub/scm/linux/kernel/git/pcmoore/selinux: selinux: ignore unknown extended permissions	2024-12-18 12:10:15 -08:00
Huacai Chen	0674188f2f	ACPI: EC: Enable EC support on LoongArch by default Commit `a6021aa24f` ("ACPI: EC: make EC support compile-time conditional") only enable ACPI_EC on X86 by default, but the embedded controller is also widely used on LoongArch laptops so we also enable ACPI_EC for LoongArch. The laptop driver cannot work without EC, so also update the dependency of LOONGSON_LAPTOP to let it depend on APCI_EC. Fixes: `a6021aa24f` ("ACPI: EC: make EC support compile-time conditional") Reported-by: Xiaotian Wu <wuxiaotian@loongson.cn> Tested-by: Binbin Zhou <zhoubinbin@loongson.cn> Signed-off-by: Huacai Chen <chenhuacai@loongson.cn> Link: https://patch.msgid.link/20241217073704.3339587-1-chenhuacai@loongson.cn [ rjw: Added Fixes: ] Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>	2024-12-18 20:23:59 +01:00
Steven Rostedt	8cd63406d0	trace/ring-buffer: Do not use TP_printk() formatting for boot mapped buffers The TP_printk() of a TRACE_EVENT() is a generic printf format that any developer can create for their event. It may include pointers to strings and such. A boot mapped buffer may contain data from a previous kernel where the strings addresses are different. One solution is to copy the event content and update the pointers by the recorded delta, but a simpler solution (for now) is to just use the print_fields() function to print these events. The print_fields() function just iterates the fields and prints them according to what type they are, and ignores the TP_printk() format from the event itself. To understand the difference, when printing via TP_printk() the output looks like this: 4582.696626: kmem_cache_alloc: call_site=getname_flags+0x47/0x1f0 ptr=00000000e70e10e0 bytes_req=4096 bytes_alloc=4096 gfp_flags=GFP_KERNEL node=-1 accounted=false 4582.696629: kmem_cache_alloc: call_site=alloc_empty_file+0x6b/0x110 ptr=0000000095808002 bytes_req=360 bytes_alloc=384 gfp_flags=GFP_KERNEL node=-1 accounted=false 4582.696630: kmem_cache_alloc: call_site=security_file_alloc+0x24/0x100 ptr=00000000576339c3 bytes_req=16 bytes_alloc=16 gfp_flags=GFP_KERNEL\|__GFP_ZERO node=-1 accounted=false 4582.696653: kmem_cache_free: call_site=do_sys_openat2+0xa7/0xd0 ptr=00000000e70e10e0 name=names_cache But when printing via print_fields() (echo 1 > /sys/kernel/tracing/options/fields) the same event output looks like this: 4582.696626: kmem_cache_alloc: call_site=0xffffffff92d10d97 (-1831793257) ptr=0xffff9e0e8571e000 (-107689771147264) bytes_req=0x1000 (4096) bytes_alloc=0x1000 (4096) gfp_flags=0xcc0 (3264) node=0xffffffff (-1) accounted=(0) 4582.696629: kmem_cache_alloc: call_site=0xffffffff92d0250b (-1831852789) ptr=0xffff9e0e8577f800 (-107689770747904) bytes_req=0x168 (360) bytes_alloc=0x180 (384) gfp_flags=0xcc0 (3264) node=0xffffffff (-1) accounted=(0) 4582.696630: kmem_cache_alloc: call_site=0xffffffff92efca74 (-1829778828) ptr=0xffff9e0e8d35d3b0 (-107689640864848) bytes_req=0x10 (16) bytes_alloc=0x10 (16) gfp_flags=0xdc0 (3520) node=0xffffffff (-1) accounted=(0) 4582.696653: kmem_cache_free: call_site=0xffffffff92cfbea7 (-1831879001) ptr=0xffff9e0e8571e000 (-107689771147264) name=names_cache Cc: stable@vger.kernel.org Cc: Masami Hiramatsu <mhiramat@kernel.org> Cc: Mathieu Desnoyers <mathieu.desnoyers@efficios.com> Cc: Linus Torvalds <torvalds@linux-foundation.org> Link: https://lore.kernel.org/20241218141507.28389a1d@gandalf.local.home Fixes: `07714b4bb3` ("tracing: Handle old buffer mappings for event strings and functions") Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>	2024-12-18 14:20:38 -05:00
Edward Adam Davis	c58a812c8e	ring-buffer: Fix overflow in __rb_map_vma An overflow occurred when performing the following calculation: nr_pages = ((nr_subbufs + 1) << subbuf_order) - pgoff; Add a check before the calculation to avoid this problem. syzbot reported this as a slab-out-of-bounds in __rb_map_vma: BUG: KASAN: slab-out-of-bounds in __rb_map_vma+0x9ab/0xae0 kernel/trace/ring_buffer.c:7058 Read of size 8 at addr ffff8880767dd2b8 by task syz-executor187/5836 CPU: 0 UID: 0 PID: 5836 Comm: syz-executor187 Not tainted 6.13.0-rc2-syzkaller-00159-gf932fb9b4074 #0 Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 11/25/2024 Call Trace: <TASK> __dump_stack lib/dump_stack.c:94 [inline] dump_stack_lvl+0x116/0x1f0 lib/dump_stack.c:120 print_address_description mm/kasan/report.c:378 [inline] print_report+0xc3/0x620 mm/kasan/report.c:489 kasan_report+0xd9/0x110 mm/kasan/report.c:602 __rb_map_vma+0x9ab/0xae0 kernel/trace/ring_buffer.c:7058 ring_buffer_map+0x56e/0x9b0 kernel/trace/ring_buffer.c:7138 tracing_buffers_mmap+0xa6/0x120 kernel/trace/trace.c:8482 call_mmap include/linux/fs.h:2183 [inline] mmap_file mm/internal.h:124 [inline] __mmap_new_file_vma mm/vma.c:2291 [inline] __mmap_new_vma mm/vma.c:2355 [inline] __mmap_region+0x1786/0x2670 mm/vma.c:2456 mmap_region+0x127/0x320 mm/mmap.c:1348 do_mmap+0xc00/0xfc0 mm/mmap.c:496 vm_mmap_pgoff+0x1ba/0x360 mm/util.c:580 ksys_mmap_pgoff+0x32c/0x5c0 mm/mmap.c:542 __do_sys_mmap arch/x86/kernel/sys_x86_64.c:89 [inline] __se_sys_mmap arch/x86/kernel/sys_x86_64.c:82 [inline] __x64_sys_mmap+0x125/0x190 arch/x86/kernel/sys_x86_64.c:82 do_syscall_x64 arch/x86/entry/common.c:52 [inline] do_syscall_64+0xcd/0x250 arch/x86/entry/common.c:83 entry_SYSCALL_64_after_hwframe+0x77/0x7f The reproducer for this bug is: ------------------------8<------------------------- #include <fcntl.h> #include <stdlib.h> #include <unistd.h> #include <asm/types.h> #include <sys/mman.h> int main(int argc, char *argv) { int page_size = getpagesize(); int fd; void meta; system("echo 1 > /sys/kernel/tracing/buffer_size_kb"); fd = open("/sys/kernel/tracing/per_cpu/cpu0/trace_pipe_raw", O_RDONLY); meta = mmap(NULL, page_size, PROT_READ, MAP_SHARED, fd, page_size * 5); } ------------------------>8------------------------- Cc: stable@vger.kernel.org Fixes: `117c39200d` ("ring-buffer: Introducing ring-buffer mapping functions") Link: https://lore.kernel.org/tencent_06924B6674ED771167C23CC336C097223609@qq.com Reported-by: syzbot+345e4443a21200874b18@syzkaller.appspotmail.com Closes: https://syzkaller.appspot.com/bug?extid=345e4443a21200874b18 Signed-off-by: Edward Adam Davis <eadavis@qq.com> Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>	2024-12-18 14:15:10 -05:00
Linus Torvalds	c061cf420d	Merge tag 'trace-v6.13-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/trace/linux-trace Pull tracing fixes from Steven Rostedt: "Replace trace_check_vprintf() with test_event_printk() and ignore_event() The function test_event_printk() checks on boot up if the trace event printf() formats dereference any pointers, and if they do, it then looks at the arguments to make sure that the pointers they dereference will exist in the event on the ring buffer. If they do not, it issues a WARN_ON() as it is a likely bug. But this isn't the case for the strings that can be dereferenced with "%s", as some trace events (notably RCU and some IPI events) save a pointer to a static string in the ring buffer. As the string it points to lives as long as the kernel is running, it is not a bug to reference it, as it is guaranteed to be there when the event is read. But it is also possible (and a common bug) to point to some allocated string that could be freed before the trace event is read and the dereference is to bad memory. This case requires a run time check. The previous way to handle this was with trace_check_vprintf() that would process the printf format piece by piece and send what it didn't care about to vsnprintf() to handle arguments that were not strings. This kept it from having to reimplement vsnprintf(). But it relied on va_list implementation and for architectures that copied the va_list and did not pass it by reference, it wasn't even possible to do this check and it would be skipped. As 64bit x86 passed va_list by reference, most events were tested and this kept out bugs where strings would have been dereferenced after being freed. Instead of relying on the implementation of va_list, extend the boot up test_event_printk() function to validate all the "%s" strings that can be validated at boot, and for the few events that point to strings outside the ring buffer, flag both the event and the field that is dereferenced as "needs_test". Then before the event is printed, a call to ignore_event() is made, and if the event has the flag set, it iterates all its fields and for every field that is to be tested, it will read the pointer directly from the event in the ring buffer and make sure that it is valid. If the pointer is not valid, it will print a WARN_ON(), print out to the trace that the event has unsafe memory and ignore the print format. With this new update, the trace_check_vprintf() can be safely removed and now all events can be verified regardless of architecture" * tag 'trace-v6.13-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/trace/linux-trace: tracing: Check "%s" dereference via the field and not the TP_printk format tracing: Add "%s" check in test_event_printk() tracing: Add missing helper functions in event pointer dereference check tracing: Fix test_event_printk() to process entire print argument	2024-12-18 10:03:33 -08:00
Michel Dänzer	85230ee36d	drm/amdgpu: Handle NULL bo->tbo.resource (again) in amdgpu_vm_bo_update Third time's the charm, I hope? Fixes: `d3116756a7` ("drm/ttm: rename bo->mem and make it a pointer") Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/3837 Reviewed-by: Christian König <christian.koenig@amd.com> Signed-off-by: Michel Dänzer <mdaenzer@redhat.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com> (cherry picked from commit `695c2c745e`) Cc: stable@vger.kernel.org	2024-12-18 13:02:03 -05:00
Christian König	8d1a13816e	drm/amdgpu: fix amdgpu_coredump The VM pointer might already be outdated when that function is called. Use the PASID instead to gather the information instead. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com> (cherry picked from commit `57f812d171`) Cc: stable@vger.kernel.org	2024-12-18 13:01:54 -05:00
Alex Deucher	9e752ee26c	drm/amdgpu/smu14.0.2: fix IP version check Use the helper function rather than reading it directly. Reviewed-by: Yang Wang <kevinyang.wang@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com> (cherry picked from commit `8f2cd1067a`) Cc: stable@vger.kernel.org	2024-12-18 13:01:48 -05:00
Alex Deucher	41be00f839	drm/amdgpu/gfx12: fix IP version check Use the helper function rather than reading it directly. Reviewed-by: Yang Wang <kevinyang.wang@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com> (cherry picked from commit `f1fd1d0f40`) Cc: stable@vger.kernel.org	2024-12-18 13:01:43 -05:00
Alex Deucher	6ebc5b9219	drm/amdgpu/mmhub4.1: fix IP version check Use the helper function rather than reading it directly. Reviewed-by: Yang Wang <kevinyang.wang@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com> (cherry picked from commit `63bfd24088`) Cc: stable@vger.kernel.org	2024-12-18 13:01:37 -05:00
Alex Deucher	8c1ecc7197	drm/amdgpu/nbio7.11: fix IP version check Use the helper function rather than reading it directly. Reviewed-by: Yang Wang <kevinyang.wang@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com> (cherry picked from commit `2c8eeaaa0f`) Cc: stable@vger.kernel.org	2024-12-18 13:01:31 -05:00
Alex Deucher	458600da79	drm/amdgpu/nbio7.7: fix IP version check Use the helper function rather than reading it directly. Reviewed-by: Yang Wang <kevinyang.wang@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com> (cherry picked from commit `22b9555bc9`) Cc: stable@vger.kernel.org	2024-12-18 13:01:06 -05:00
Pierre-Eric Pelloux-Prayer	a93b1020eb	drm/amdgpu: don't access invalid sched Since `2320c9e6a7` ("drm/sched: memset() 'job' in drm_sched_job_init()") accessing job->base.sched can produce unexpected results as the initialisation of (*job)->base.sched done in amdgpu_job_alloc is overwritten by the memset. This commit fixes an issue when a CS would fail validation and would be rejected after job->num_ibs is incremented. In this case, amdgpu_ib_free(ring->adev, ...) will be called, which would crash the machine because the ring value is bogus. To fix this, pass a NULL pointer to amdgpu_ib_free(): we can do this because the device is actually not used in this function. The next commit will remove the ring argument completely. Fixes: `2320c9e6a7` ("drm/sched: memset() 'job' in drm_sched_job_init()") Signed-off-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com> (cherry picked from commit `2ae520cb12`)	2024-12-18 12:57:38 -05:00
Mario Limonciello	536ae08d7b	drm/amd: Require CONFIG_HOTPLUG_PCI_PCIE for BOCO If the kernel hasn't been compiled with PCIe hotplug support this can lead to problems with dGPUs that use BOCO because they effectively drop off the bus. To prevent issues, disable BOCO support when compiled without PCIe hotplug. Reported-by: Gabriel Marcano <gabemarcano@yahoo.com> Closes: https://gitlab.freedesktop.org/drm/amd/-/issues/1707#note_2696862 Acked-by: Alex Deucher <alexander.deucher@amd.com> Link: https://lore.kernel.org/r/20241211155601.3585256-1-superm1@kernel.org Signed-off-by: Mario Limonciello <mario.limonciello@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com> (cherry picked from commit `1ad5bdc28b`)	2024-12-18 12:56:49 -05:00
Linus Torvalds	37cb0c76ac	Merge tag 'hyperv-fixes-signed-20241217' of git://git.kernel.org/pub/scm/linux/kernel/git/hyperv/linux Pull hyperv fixes from Wei Liu: - Various fixes to Hyper-V tools in the kernel tree (Dexuan Cui, Olaf Hering, Vitaly Kuznetsov) - Fix a bug in the Hyper-V TSC page based sched_clock() (Naman Jain) - Two bug fixes in the Hyper-V utility functions (Michael Kelley) - Convert open-coded timeouts to secs_to_jiffies() in Hyper-V drivers (Easwar Hariharan) * tag 'hyperv-fixes-signed-20241217' of git://git.kernel.org/pub/scm/linux/kernel/git/hyperv/linux: tools/hv: reduce resource usage in hv_kvp_daemon tools/hv: add a .gitignore file tools/hv: reduce resouce usage in hv_get_dns_info helper hv/hv_kvp_daemon: Pass NIC name to hv_get_dns_info as well Drivers: hv: util: Avoid accessing a ringbuffer not initialized yet Drivers: hv: util: Don't force error code to ENODEV in util_probe() tools/hv: terminate fcopy daemon if read from uio fails drivers: hv: Convert open-coded timeouts to secs_to_jiffies() tools: hv: change permissions of NetworkManager configuration file x86/hyperv: Fix hv tsc page based sched_clock for hibernation tools: hv: Fix a complier warning in the fcopy uio daemon	2024-12-18 09:55:55 -08:00
Juergen Gross	349f0086ba	x86/static-call: fix 32-bit build In 32-bit x86 builds CONFIG_STATIC_CALL_INLINE isn't set, leading to static_call_initialized not being available. Define it as "0" in that case. Reported-by: Stephen Rothwell <sfr@canb.auug.org.au> Fixes: `0ef8047b73` ("x86/static-call: provide a way to do very early static-call updates") Signed-off-by: Juergen Gross <jgross@suse.com> Acked-by: Peter Zijlstra (Intel) <peterz@infradead.org> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>	2024-12-18 09:47:43 -08:00
Jon Lin	7f9a1eed1a	spi: rockchip-sfc: Fix error in remove progress Fix error in remove progress: [ 43.026148] Call trace: [ 43.026370] klist_next+0x1c/0x1d4 [ 43.026671] device_for_each_child+0x48/0xac [ 43.027049] spi_unregister_controller+0x30/0x130 [ 43.027469] rockchip_sfc_remove+0x48/0x80 [spi_rockchip_sfc] Signed-off-by: Jon Lin <jon.lin@rock-chips.com> Link: https://patch.msgid.link/20241218154741.901591-1-jon.lin@rock-chips.com Signed-off-by: Mark Brown <broonie@kernel.org>	2024-12-18 16:02:08 +00:00
Rafael J. Wysocki	05648c2f58	Merge tag 'amd-pstate-v6.13-2024-12-11' of ssh://gitolite.kernel.org/pub/scm/linux/kernel/git/superm1/linux Merge amd-pstate driver fixes for 6.13-rc4 from Mario Liminciello: "Fix a problem where systems without preferred cores were misdetecting preferred cores. Fix issues with with boost numerator handling leading to inconsistently programmed CPPC max performance values." * tag 'amd-pstate-v6.13-2024-12-11' of ssh://gitolite.kernel.org/pub/scm/linux/kernel/git/superm1/linux: cpufreq/amd-pstate: Use boost numerator for upper bound of frequencies cpufreq/amd-pstate: Store the boost numerator as highest perf again cpufreq/amd-pstate: Detect preferred core support before driver registration	2024-12-18 15:38:22 +01:00
Ming Lei	85672ca9ce	block: avoid to reuse `hctx` not removed from cpuhp callback list If the 'hctx' isn't removed from cpuhp callback list, we can't reuse it, otherwise use-after-free may be triggered. Reported-by: kernel test robot <oliver.sang@intel.com> Closes: https://lore.kernel.org/oe-lkp/202412172217.b906db7c-lkp@intel.com Tested-by: kernel test robot <oliver.sang@intel.com> Fixes: `22465bbac5` ("blk-mq: move cpuhp callback registering out of q->sysfs_lock") Signed-off-by: Ming Lei <ming.lei@redhat.com> Link: https://lore.kernel.org/r/20241218101617.3275704-3-ming.lei@redhat.com Signed-off-by: Jens Axboe <axboe@kernel.dk>	2024-12-18 07:25:37 -07:00
Ming Lei	224749be6c	block: Revert "block: Fix potential deadlock while freezing queue and acquiring sysfs_lock" This reverts commit `be26ba9642`. Commit `be26ba9642` ("block: Fix potential deadlock while freezing queue and acquiring sysfs_loc") actually reverts commit `22465bbac5` ("blk-mq: move cpuhp callback registering out of q->sysfs_lock"), and causes the original resctrl lockdep warning. So revert it and we need to fix the issue in another way. Cc: Nilay Shroff <nilay@linux.ibm.com> Fixes: `be26ba9642` ("block: Fix potential deadlock while freezing queue and acquiring sysfs_loc") Signed-off-by: Ming Lei <ming.lei@redhat.com> Link: https://lore.kernel.org/r/20241218101617.3275704-2-ming.lei@redhat.com Signed-off-by: Jens Axboe <axboe@kernel.dk>	2024-12-18 07:25:37 -07:00
Luis Chamberlain	51588b1b77	nvme: use blk_validate_block_size() for max LBA check The block layer already has support to validates proper block sizes with blk_validate_block_size(), we can leverage that as well. No functional changes. Signed-off-by: Luis Chamberlain <mcgrof@kernel.org> Reviewed-by: John Garry <john.g.garry@oracle.com> Reviewed-by: Keith Busch <kbusch@kernel.org> Reviewed-by: Christoph Hellwig <hch@lst.de> Link: https://lore.kernel.org/r/20241218020212.3657139-3-mcgrof@kernel.org Signed-off-by: Jens Axboe <axboe@kernel.dk>	2024-12-18 07:22:30 -07:00
Luis Chamberlain	26fff8a443	block/bdev: use helper for max block size check We already have a helper for checking the limits on the block size both low and high, just use that. No functional changes. Reviewed-by: John Garry <john.g.garry@oracle.com> Signed-off-by: Luis Chamberlain <mcgrof@kernel.org> Reviewed-by: Keith Busch <kbusch@kernel.org> Reviewed-by: Christoph Hellwig <hch@lst.de> Link: https://lore.kernel.org/r/20241218020212.3657139-2-mcgrof@kernel.org Signed-off-by: Jens Axboe <axboe@kernel.dk>	2024-12-18 07:22:30 -07:00
Shuming Fan	c9e3ebdc52	ASoC: rt722: add delay time to wait for the calibration procedure The calibration procedure needs some time to finish. This patch adds the delay time to ensure the calibration procedure is completed correctly. Signed-off-by: Shuming Fan <shumingf@realtek.com> Link: https://patch.msgid.link/20241218091307.96656-1-shumingf@realtek.com Signed-off-by: Mark Brown <broonie@kernel.org>	2024-12-18 14:17:32 +00:00
Daniel Lezcano	4feaedf7d2	thermal/thresholds: Fix boundaries and detection routine The current implementation does not work if the thermal zone is interrupt driven only. The boundaries are not correctly checked and computed as it happens only when the temperature is increasing or decreasing. The problem arises because the routine to detect when we cross a threshold is correlated with the computation of the boundaries. We assume we have to recompute the boundaries when a threshold is crossed but actually we should do that even if the it is not the case. Mixing the boundaries computation and the threshold detection for the sake of optimizing the routine is much more complex as it appears intuitively and prone to errors. This fix separates the boundaries computation and the threshold crossing detection into different routines. The result is a code much more simple to understand, thus easier to maintain. The drawback is we browse the thresholds list several time but we can consider that as neglictible because that happens when the temperature is updated. There are certainly some aeras to improve in the temperature update routine but it would be not adequate as this change aims to fix the thresholds for v6.13. Fixes: `445936f9e2` ("thermal: core: Add user thresholds support") Tested-by: Daniel Lezcano <daniel.lezcano@linaro.org> # rock5b, Lenovo x13s Signed-off-by: Daniel Lezcano <daniel.lezcano@linaro.org> Link: https://patch.msgid.link/20241216212644.1145122-1-daniel.lezcano@linaro.org Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>	2024-12-18 14:51:31 +01:00
Fabrice Gasnier	edc19bd0e5	pwm: stm32: Fix complementary output in round_waveform_tohw() When the timer supports complementary output, the CCxNE bit must be set additionally to the CCxE bit. So to not overwrite the latter use \|= instead of = to set the former. Fixes: `deaba9cff8` ("pwm: stm32: Implementation of the waveform callbacks") Signed-off-by: Fabrice Gasnier <fabrice.gasnier@foss.st.com> Link: https://lore.kernel.org/r/20241217150021.2030213-1-fabrice.gasnier@foss.st.com [ukleinek: Slightly improve commit log] Signed-off-by: Uwe Kleine-König <ukleinek@kernel.org>	2024-12-18 11:08:36 +01:00
Haren Myneni	05aa156e15	powerpc/pseries/vas: Add close() callback in vas_vm_ops struct The mapping VMA address is saved in VAS window struct when the paste address is mapped. This VMA address is used during migration to unmap the paste address if the window is active. The paste address mapping will be removed when the window is closed or with the munmap(). But the VMA address in the VAS window is not updated with munmap() which is causing invalid access during migration. The KASAN report shows: [16386.254991] BUG: KASAN: slab-use-after-free in reconfig_close_windows+0x1a0/0x4e8 [16386.255043] Read of size 8 at addr c00000014a819670 by task drmgr/696928 [16386.255096] CPU: 29 UID: 0 PID: 696928 Comm: drmgr Kdump: loaded Tainted: G B 6.11.0-rc5-nxgzip #2 [16386.255128] Tainted: [B]=BAD_PAGE [16386.255148] Hardware name: IBM,9080-HEX Power11 (architected) 0x820200 0xf000007 of:IBM,FW1110.00 (NH1110_016) hv:phyp pSeries [16386.255181] Call Trace: [16386.255202] [c00000016b297660] [c0000000018ad0ac] dump_stack_lvl+0x84/0xe8 (unreliable) [16386.255246] [c00000016b297690] [c0000000006e8a90] print_report+0x19c/0x764 [16386.255285] [c00000016b297760] [c0000000006e9490] kasan_report+0x128/0x1f8 [16386.255309] [c00000016b297880] [c0000000006eb5c8] __asan_load8+0xac/0xe0 [16386.255326] [c00000016b2978a0] [c00000000013f898] reconfig_close_windows+0x1a0/0x4e8 [16386.255343] [c00000016b297990] [c000000000140e58] vas_migration_handler+0x3a4/0x3fc [16386.255368] [c00000016b297a90] [c000000000128848] pseries_migrate_partition+0x4c/0x4c4 ... [16386.256136] Allocated by task 696554 on cpu 31 at 16377.277618s: [16386.256149] kasan_save_stack+0x34/0x68 [16386.256163] kasan_save_track+0x34/0x80 [16386.256175] kasan_save_alloc_info+0x58/0x74 [16386.256196] __kasan_slab_alloc+0xb8/0xdc [16386.256209] kmem_cache_alloc_noprof+0x200/0x3d0 [16386.256225] vm_area_alloc+0x44/0x150 [16386.256245] mmap_region+0x214/0x10c4 [16386.256265] do_mmap+0x5fc/0x750 [16386.256277] vm_mmap_pgoff+0x14c/0x24c [16386.256292] ksys_mmap_pgoff+0x20c/0x348 [16386.256303] sys_mmap+0xd0/0x160 ... [16386.256350] Freed by task 0 on cpu 31 at 16386.204848s: [16386.256363] kasan_save_stack+0x34/0x68 [16386.256374] kasan_save_track+0x34/0x80 [16386.256384] kasan_save_free_info+0x64/0x10c [16386.256396] __kasan_slab_free+0x120/0x204 [16386.256415] kmem_cache_free+0x128/0x450 [16386.256428] vm_area_free_rcu_cb+0xa8/0xd8 [16386.256441] rcu_do_batch+0x2c8/0xcf0 [16386.256458] rcu_core+0x378/0x3c4 [16386.256473] handle_softirqs+0x20c/0x60c [16386.256495] do_softirq_own_stack+0x6c/0x88 [16386.256509] do_softirq_own_stack+0x58/0x88 [16386.256521] __irq_exit_rcu+0x1a4/0x20c [16386.256533] irq_exit+0x20/0x38 [16386.256544] interrupt_async_exit_prepare.constprop.0+0x18/0x2c ... [16386.256717] Last potentially related work creation: [16386.256729] kasan_save_stack+0x34/0x68 [16386.256741] __kasan_record_aux_stack+0xcc/0x12c [16386.256753] __call_rcu_common.constprop.0+0x94/0xd04 [16386.256766] vm_area_free+0x28/0x3c [16386.256778] remove_vma+0xf4/0x114 [16386.256797] do_vmi_align_munmap.constprop.0+0x684/0x870 [16386.256811] __vm_munmap+0xe0/0x1f8 [16386.256821] sys_munmap+0x54/0x6c [16386.256830] system_call_exception+0x1a0/0x4a0 [16386.256841] system_call_vectored_common+0x15c/0x2ec [16386.256868] The buggy address belongs to the object at c00000014a819670 which belongs to the cache vm_area_struct of size 168 [16386.256887] The buggy address is located 0 bytes inside of freed 168-byte region [c00000014a819670, c00000014a819718) [16386.256915] The buggy address belongs to the physical page: [16386.256928] page: refcount:1 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0x14a81 [16386.256950] memcg:c0000000ba430001 [16386.256961] anon flags: 0x43ffff800000000(node=4\|zone=0\|lastcpupid=0x7ffff) [16386.256975] page_type: 0xfdffffff(slab) [16386.256990] raw: 043ffff800000000 c00000000501c080 0000000000000000 5deadbee00000001 [16386.257003] raw: 0000000000000000 00000000011a011a 00000001fdffffff c0000000ba430001 [16386.257018] page dumped because: kasan: bad access detected This patch adds close() callback in vas_vm_ops vm_operations_struct which will be executed during munmap() before freeing VMA. The VMA address in the VAS window is set to NULL after holding the window mmap_mutex. Fixes: `37e6764895` ("powerpc/pseries/vas: Add VAS migration handler") Signed-off-by: Haren Myneni <haren@linux.ibm.com> Signed-off-by: Madhavan Srinivasan <maddy@linux.ibm.com> Link: https://patch.msgid.link/20241214051758.997759-1-haren@linux.ibm.com	2024-12-18 14:03:41 +05:30
Marc Kleine-Budde	87f54c1219	Merge patch series "can: m_can: set init flag earlier in probe" This series fixes problems in the m_can_pci driver found on the Intel Elkhart Lake processor. Link: https://patch.msgid.link/e247f331cb72829fcbdfda74f31a59cbad1a6006.1728288535.git.matthias.schiffer@ew.tq-group.com Signed-off-by: Marc Kleine-Budde <mkl@pengutronix.de>	2024-12-18 09:32:14 +01:00
Matthias Schiffer	743375f8de	can: m_can: fix missed interrupts with m_can_pci The interrupt line of PCI devices is interpreted as edge-triggered, however the interrupt signal of the m_can controller integrated in Intel Elkhart Lake CPUs appears to be generated level-triggered. Consider the following sequence of events: - IR register is read, interrupt X is set - A new interrupt Y is triggered in the m_can controller - IR register is written to acknowledge interrupt X. Y remains set in IR As at no point in this sequence no interrupt flag is set in IR, the m_can interrupt line will never become deasserted, and no edge will ever be observed to trigger another run of the ISR. This was observed to result in the TX queue of the EHL m_can to get stuck under high load, because frames were queued to the hardware in m_can_start_xmit(), but m_can_finish_tx() was never run to account for their successful transmission. On an Elkhart Lake based board with the two CAN interfaces connected to each other, the following script can reproduce the issue: ip link set can0 up type can bitrate 1000000 ip link set can1 up type can bitrate 1000000 cangen can0 -g 2 -I 000 -L 8 & cangen can0 -g 2 -I 001 -L 8 & cangen can0 -g 2 -I 002 -L 8 & cangen can0 -g 2 -I 003 -L 8 & cangen can0 -g 2 -I 004 -L 8 & cangen can0 -g 2 -I 005 -L 8 & cangen can0 -g 2 -I 006 -L 8 & cangen can0 -g 2 -I 007 -L 8 & cangen can1 -g 2 -I 100 -L 8 & cangen can1 -g 2 -I 101 -L 8 & cangen can1 -g 2 -I 102 -L 8 & cangen can1 -g 2 -I 103 -L 8 & cangen can1 -g 2 -I 104 -L 8 & cangen can1 -g 2 -I 105 -L 8 & cangen can1 -g 2 -I 106 -L 8 & cangen can1 -g 2 -I 107 -L 8 & stress-ng --matrix 0 & To fix the issue, repeatedly read and acknowledge interrupts at the start of the ISR until no interrupt flags are set, so the next incoming interrupt will also result in an edge on the interrupt line. While we have received a report that even with this patch, the TX queue can become stuck under certain (currently unknown) circumstances on the Elkhart Lake, this patch completely fixes the issue with the above reproducer, and it is unclear whether the remaining issue has a similar cause at all. Fixes: `cab7ffc032` ("can: m_can: add PCI glue driver for Intel Elkhart Lake") Signed-off-by: Matthias Schiffer <matthias.schiffer@ew.tq-group.com> Reviewed-by: Markus Schneider-Pargmann <msp@baylibre.com> Link: https://patch.msgid.link/fdf0439c51bcb3a46c21e9fb21c7f1d06363be84.1728288535.git.matthias.schiffer@ew.tq-group.com Signed-off-by: Marc Kleine-Budde <mkl@pengutronix.de>	2024-12-18 09:30:52 +01:00
Matthias Schiffer	fca2977629	can: m_can: set init flag earlier in probe While an m_can controller usually already has the init flag from a hardware reset, no such reset happens on the integrated m_can_pci of the Intel Elkhart Lake. If the CAN controller is found in an active state, m_can_dev_setup() would fail because m_can_niso_supported() calls m_can_cccr_update_bits(), which refuses to modify any other configuration bits when CCCR_INIT is not set. To avoid this issue, set CCCR_INIT before attempting to modify any other configuration flags. Fixes: `cd5a46ce6f` ("can: m_can: don't enable transceiver when probing") Signed-off-by: Matthias Schiffer <matthias.schiffer@ew.tq-group.com> Reviewed-by: Markus Schneider-Pargmann <msp@baylibre.com> Link: https://patch.msgid.link/e247f331cb72829fcbdfda74f31a59cbad1a6006.1728288535.git.matthias.schiffer@ew.tq-group.com Signed-off-by: Marc Kleine-Budde <mkl@pengutronix.de>	2024-12-18 09:30:52 +01:00
Kuniyuki Iwashima	954a2b4071	rtnetlink: Try the outer netns attribute in rtnl_get_peer_net(). Xiao Liang reported that the cited commit changed netns handling in newlink() of netkit, veth, and vxcan. Before the patch, if we don't find a netns attribute in the peer device attributes, we tried to find another netns attribute in the outer netlink attributes by passing it to rtnl_link_get_net(). Let's restore the original behaviour. Fixes: `4832756676` ("rtnetlink: fix double call of rtnl_link_get_net_ifla()") Reported-by: Xiao Liang <shaw.leon@gmail.com> Closes: https://lore.kernel.org/netdev/CABAhCORBVVU8P6AHcEkENMj+gD2d3ce9t=A_o48E0yOQp8_wUQ@mail.gmail.com/#t Signed-off-by: Kuniyuki Iwashima <kuniyu@amazon.com> Tested-by: Xiao Liang <shaw.leon@gmail.com> Link: https://patch.msgid.link/20241216110432.51488-1-kuniyu@amazon.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-17 17:54:18 -08:00
Eric Dumazet	b9b8301d36	net: netdevsim: fix nsim_pp_hold_write() nsim_pp_hold_write() has two problems: 1) It may return with rtnl held, as found by syzbot. 2) Its return value does not propagate an error if any. Fixes: `1580cbcbfe` ("net: netdevsim: add some fake page pool use") Reported-by: syzbot <syzkaller@googlegroups.com> Signed-off-by: Eric Dumazet <edumazet@google.com> Reviewed-by: Simon Horman <horms@kernel.org> Link: https://patch.msgid.link/20241216083703.1859921-1-edumazet@google.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-17 17:46:17 -08:00
Andrea Righi	23579010cf	bpf: Fix bpf_get_smp_processor_id() on !CONFIG_SMP On x86-64 calling bpf_get_smp_processor_id() in a kernel with CONFIG_SMP disabled can trigger the following bug, as pcpu_hot is unavailable: [ 8.471774] BUG: unable to handle page fault for address: 00000000936a290c [ 8.471849] #PF: supervisor read access in kernel mode [ 8.471881] #PF: error_code(0x0000) - not-present page Fix by inlining a return 0 in the !CONFIG_SMP case. Fixes: `1ae6921009` ("bpf: inline bpf_get_smp_processor_id() helper") Signed-off-by: Andrea Righi <arighi@nvidia.com> Signed-off-by: Andrii Nakryiko <andrii@kernel.org> Link: https://lore.kernel.org/bpf/20241217195813.622568-1-arighi@nvidia.com	2024-12-17 16:09:24 -08:00
Nathan Chancellor	aef25be35d	hexagon: Disable constant extender optimization for LLVM prior to 19.1.0 The Hexagon-specific constant extender optimization in LLVM may crash on Linux kernel code [1], such as fs/bcache/btree_io.c after commit `32ed4a620c` ("bcachefs: Btree path tracepoints") in 6.12: clang: llvm/lib/Target/Hexagon/HexagonConstExtenders.cpp:745: bool (anonymous namespace)::HexagonConstExtenders::ExtRoot::operator<(const HCE::ExtRoot &) const: Assertion `ThisB->getParent() == OtherB->getParent()' failed. Stack dump: 0. Program arguments: clang --target=hexagon-linux-musl ... fs/bcachefs/btree_io.c 1. <eof> parser at end of file 2. Code generation 3. Running pass 'Function Pass Manager' on module 'fs/bcachefs/btree_io.c'. 4. Running pass 'Hexagon constant-extender optimization' on function '@__btree_node_lock_nopath' Without assertions enabled, there is just a hang during compilation. This has been resolved in LLVM main (20.0.0) [2] and backported to LLVM 19.1.0 but the kernel supports LLVM 13.0.1 and newer, so disable the constant expander optimization using the '-mllvm' option when using a toolchain that is not fixed. Cc: stable@vger.kernel.org Link: https://github.com/llvm/llvm-project/issues/99714 [1] Link: `68df06a0b2` [2] Link: `2ab8d93061` [3] Reviewed-by: Brian Cain <bcain@quicinc.com> Signed-off-by: Nathan Chancellor <nathan@kernel.org> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>	2024-12-17 14:07:53 -08:00
Joshua Hay	0c1683c681	idpf: trigger SW interrupt when exiting wb_on_itr mode There is a race condition between exiting wb_on_itr and completion write backs. For example, we are in wb_on_itr mode and a Tx completion is generated by HW, ready to be written back, as we are re-enabling interrupts: HW SW \| \| \| \| idpf_tx_splitq_clean_all \| \| napi_complete_done \| \| \| tx_completion_wb \| idpf_vport_intr_update_itr_ena_irq That tx_completion_wb happens before the vector is fully re-enabled. Continuing with this example, it is a UDP stream and the tx_completion_wb is the last one in the flow (there are no rx packets). Because the HW generated the completion before the interrupt is fully enabled, the HW will not fire the interrupt once the timer expires and the write back will not happen. NAPI poll won't be called. We have indicated we're back in interrupt mode but nothing else will trigger the interrupt. Therefore, the completion goes unprocessed, triggering a Tx timeout. To mitigate this, fire a SW triggered interrupt upon exiting wb_on_itr. This interrupt will catch the rogue completion and avoid the timeout. Add logic to set the appropriate bits in the vector's dyn_ctl register. Fixes: `9c4a27da0e` ("idpf: enable WB_ON_ITR") Reviewed-by: Madhu Chittim <madhu.chittim@intel.com> Signed-off-by: Joshua Hay <joshua.a.hay@intel.com> Tested-by: Krishneil Singh <krishneil.k.singh@intel.com> Signed-off-by: Tony Nguyen <anthony.l.nguyen@intel.com>	2024-12-17 14:00:26 -08:00
Olga Kornievskaia	9048cf05a1	NFSD: fix management of pending async copies Currently the pending_async_copies count is decremented just before a struct nfsd4_copy is destroyed. After commit `aa0ebd21df` ("NFSD: Add nfsd4_copy time-to-live") nfsd4_copy structures sticks around for 10 lease periods after the COPY itself has completed, the pending_async_copies count stays high for a long time. This causes NFSD to avoid the use of background copy even though the actual background copy workload might no longer be running. In this patch, decrement pending_async_copies once async copy thread is done processing the copy work. Fixes: `aa0ebd21df` ("NFSD: Add nfsd4_copy time-to-live") Signed-off-by: Olga Kornievskaia <okorniev@redhat.com> Signed-off-by: Chuck Lever <chuck.lever@oracle.com>	2024-12-17 16:35:53 -05:00
Joshua Hay	93433c1d91	idpf: add support for SW triggered interrupts SW triggered interrupts are guaranteed to fire after their timer expires, unlike Tx and Rx interrupts which will only fire after the timer expires _and_ a descriptor write back is available to be processed by the driver. Add the necessary fields, defines, and initializations to enable a SW triggered interrupt in the vector's dyn_ctl register. Reviewed-by: Madhu Chittim <madhu.chittim@intel.com> Signed-off-by: Joshua Hay <joshua.a.hay@intel.com> Tested-by: Krishneil Singh <krishneil.k.singh@intel.com> Signed-off-by: Tony Nguyen <anthony.l.nguyen@intel.com>	2024-12-17 13:28:55 -08:00
Willow Cunningham	058387d9c6	arm64: dts: broadcom: Fix L2 linesize for Raspberry Pi 5 Set the cache-line-size parameter of the L2 cache for each core to the correct value of 64 bytes. Previously, the L2 cache line size was incorrectly set to 128 bytes for the Broadcom BCM2712. This causes validation tests for the Performance Application Programming Interface (PAPI) tool to fail as they depend on sysfs accurately reporting cache line sizes. The correct value of 64 bytes is stated in the official documentation of the ARM Cortex A-72, which is linked in the comments of arm64/boot/dts/broadcom/bcm2712.dtsi as the source for cache-line-size. Fixes: `faa3381267` ("arm64: dts: broadcom: Add minimal support for Raspberry Pi 5") Signed-off-by: Willow Cunningham <willow.e.cunningham@maine.edu> Link: https://lore.kernel.org/r/20241007212954.214724-1-willow.e.cunningham@maine.edu Signed-off-by: Florian Fainelli <florian.fainelli@broadcom.com>	2024-12-17 11:03:22 -08:00
Qu Wenruo	dfb92681a1	btrfs: tree-checker: reject inline extent items with 0 ref count [BUG] There is a bug report in the mailing list where btrfs_run_delayed_refs() failed to drop the ref count for logical 25870311358464 num_bytes 2113536. The involved leaf dump looks like this: item 166 key (25870311358464 168 2113536) itemoff 10091 itemsize 50 extent refs 1 gen 84178 flags 1 ref#0: shared data backref parent 32399126528000 count 0 <<< ref#1: shared data backref parent 31808973717504 count 1 Notice the count number is 0. [CAUSE] There is no concrete evidence yet, but considering 0 -> 1 is also a single bit flipped, it's possible that hardware memory bitflip is involved, causing the on-disk extent tree to be corrupted. [FIX] To prevent us reading such corrupted extent item, or writing such damaged extent item back to disk, enhance the handling of BTRFS_EXTENT_DATA_REF_KEY and BTRFS_SHARED_DATA_REF_KEY keys for both inlined and key items, to detect such 0 ref count and reject them. CC: stable@vger.kernel.org # 5.4+ Link: https://lore.kernel.org/linux-btrfs/7c69dd49-c346-4806-86e7-e6f863a66f48@app.fastmail.com/ Reported-by: Frankie Fisher <frankie@terrorise.me.uk> Reviewed-by: Filipe Manana <fdmanana@suse.com> Signed-off-by: Qu Wenruo <wqu@suse.com> Reviewed-by: David Sterba <dsterba@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2024-12-17 19:54:32 +01:00
Christoph Hellwig	be691b5e59	btrfs: split bios to the fs sector size boundary Btrfs like other file systems can't really deal with I/O not aligned to it's internal block size (which strangely is called sector size in btrfs, for historical reasons), but the block layer split helper doesn't even know about that. Round down the split boundary so that all I/Os are aligned. Fixes: `d5e4377d50` ("btrfs: split zone append bios in btrfs_submit_bio") CC: stable@vger.kernel.org # 6.12 Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: Damien Le Moal <dlemoal@kernel.org> Reviewed-by: David Sterba <dsterba@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2024-12-17 19:54:32 +01:00
Christoph Hellwig	6c3864e055	btrfs: use bio_is_zone_append() in the completion handler Otherwise it won't catch bios turned into regular writes by the block level zone write plugging. The additional test it adds is for emulated zone append. Fixes: `9b1ce7f0c6` ("block: Implement zone append emulation") CC: stable@vger.kernel.org # 6.12 Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Reviewed-by: Damien Le Moal <dlemoal@kernel.org> Signed-off-by: Christoph Hellwig <hch@lst.de> Reviewed-by: David Sterba <dsterba@suse.com> Signed-off-by: David Sterba <dsterba@suse.com>	2024-12-17 19:54:32 +01:00
Josef Bacik	d75d72a858	btrfs: fix improper generation check in snapshot delete We have been using the following check if (generation <= root->root_key.offset) to make decisions about whether or not to visit a node during snapshot delete. This is because for normal subvolumes this is set to 0, and for snapshots it's set to the creation generation. The idea being that if the generation of the node is less than or equal to our creation generation then we don't need to visit that node, because it doesn't belong to us, we can simply drop our reference and move on. However reloc roots don't have their generation stored in root->root_key.offset, instead that is the objectid of their corresponding fs root. This means we can incorrectly not walk into nodes that need to be dropped when deleting a reloc root. There are a variety of consequences to making the wrong choice in two distinct areas. visit_node_for_delete() 1. False positive. We think we are newer than the block when we really aren't. We don't visit the node and drop our reference to the node and carry on. This would result in leaked space. 2. False negative. We do decide to walk down into a block that we should have just dropped our reference to. However this means that the child node will have refs > 1, so we will switch to UPDATE_BACKREF, and then the subsequent walk_down_proc() will notice that btrfs_header_owner(node) != root->root_key.objectid and it'll break out of the loop, and then walk_up_proc() will drop our reference, so this appears to be ok. do_walk_down() 1. False positive. We are in UPDATE_BACKREF and incorrectly decide that we are done and don't need to update the backref for our lower nodes. This is another case that simply won't happen with relocation, as we only have to do UPDATE_BACKREF if the node below us was shared and didn't have FULL_BACKREF set, and since we don't own that node because we're a reloc root we actually won't end up in this case. 2. False negative. Again this is tricky because as described above, we simply wouldn't be here from relocation, because we don't own any of the nodes because we never set btrfs_header_owner() to the reloc root objectid, and we always use FULL_BACKREF, we never actually need to set FULL_BACKREF on any children. Having spent a lot of time stressing relocation/snapshot delete recently I've not seen this pop in practice. But this is objectively incorrect, so fix this to get the correct starting generation based on the root we're dropping to keep me from thinking there's a problem here. CC: stable@vger.kernel.org Reviewed-by: Filipe Manana <fdmanana@suse.com> Signed-off-by: Josef Bacik <josef@toxicpanda.com> Signed-off-by: David Sterba <dsterba@suse.com>	2024-12-17 19:54:32 +01:00
Arnd Bergmann	2182e0f200	drm: rework FB_CORE dependency The 'select FB_CORE' statement moved from CONFIG_DRM to DRM_CLIENT_LIB, but there are now configurations that have code calling into fb_core as built-in even though the client_lib itself is a loadable module: x86_64-linux-ld: drivers/gpu/drm/drm_fb_helper.o: in function `drm_fb_helper_set_suspend': drm_fb_helper.c:(.text+0x2c6): undefined reference to `fb_set_suspend' x86_64-linux-ld: drivers/gpu/drm/drm_fb_helper.o: in function `drm_fb_helper_resume_worker': drm_fb_helper.c:(.text+0x2e1): undefined reference to `fb_set_suspend' In addition to DRM_CLIENT_LIB, the 'select' needs to be at least in DRM_KMS_HELPER and DRM_GEM_SHMEM_HELPER, so add it here. This patch is the KMS_HELPER part of [1]. Fixes: `dadd28d414` ("drm/client: Add client-lib module") Signed-off-by: Arnd Bergmann <arnd@arndb.de> Reviewed-by: Thomas Zimmermann <tzimmermann@suse.de> Link: https://patchwork.freedesktop.org/series/141411/ # 1 Link: https://patchwork.freedesktop.org/patch/msgid/20241216074450.8590-4-tzimmermann@suse.de Signed-off-by: Thomas Zimmermann <tzimmermann@suse.de>	2024-12-17 18:28:43 +01:00
Linus Torvalds	5529876063	Merge tag 'ftrace-v6.13-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/trace/linux-trace Pull ftrace fixes from Steven Rostedt: - Always try to initialize the idle functions when graph tracer starts A bug was found that when a CPU is offline when graph tracing starts and then comes online, that CPU is not traced. The fix to that was to move the initialization of the idle shadow stack over to the hot plug online logic, which also handle onlined CPUs. The issue was that it removed the initialization of the shadow stack when graph tracing starts, but the callbacks to the hot plug logic do nothing if graph tracing isn't currently running. Although that fix fixed the onlining of a CPU during tracing, it broke the CPUs that were already online. - Have microblaze not try to get the "true parent" in function tracing If function tracing and graph tracing are both enabled at the same time the parent of the functions traced by the function tracer may sometimes be the graph tracing trampoline. The graph tracing hijacks the return pointer of the function to trace it, but that can interfere with the function tracing parent output. This was fixed by using the ftrace_graph_ret_addr() function passing in the kernel stack pointer using the ftrace_regs_get_stack_pointer() function. But Al Viro reported that Microblaze does not implement the kernel_stack_pointer(regs) helper function that ftrace_regs_get_stack_pointer() uses and fails to compile when function graph tracing is enabled. It was first thought that this was a microblaze issue, but the real cause is that this only works when an architecture implements HAVE_DYNAMIC_FTRACE_WITH_ARGS, as a requirement for that config is to have ftrace always pass a valid ftrace_regs to the callbacks. That also means that the architecture supports ftrace_regs_get_stack_pointer() Microblaze does not set HAVE_DYNAMIC_FTRACE_WITH_ARGS nor does it implement ftrace_regs_get_stack_pointer() which caused it to fail to build. Only implement the "true parent" logic if an architecture has that config set" * tag 'ftrace-v6.13-rc3' of git://git.kernel.org/pub/scm/linux/kernel/git/trace/linux-trace: ftrace: Do not find "true_parent" if HAVE_DYNAMIC_FTRACE_WITH_ARGS is not set fgraph: Still initialize idle shadow stacks when starting	2024-12-17 09:14:31 -08:00
Linus Torvalds	a241d7f0d3	Merge tag 's390-6.13-3' of git://git.kernel.org/pub/scm/linux/kernel/git/s390/linux Pull s390 fixes from Alexander Gordeev: - Fix DirectMap accounting in /proc/meminfo file - Fix strscpy() return code handling that led to "unsigned 'len' is never less than zero" warning - Fix the calculation determining whether to use three- or four-level paging: account KMSAN modules metadata * tag 's390-6.13-3' of git://git.kernel.org/pub/scm/linux/kernel/git/s390/linux: s390/mm: Consider KMSAN modules metadata for paging levels s390/ipl: Fix never less than zero warning s390/mm: Fix DirectMap accounting	2024-12-17 09:09:32 -08:00
Thomas Zimmermann	8ce35bf0ef	drm/fbdev: Select FB_CORE dependency for fbdev on DMA and TTM Select FB_CORE if GEM's DMA and TTM implementations support fbdev emulation. Fixes linker errors about missing symbols from the fbdev subsystem. Also see [1] for a related SHMEM fix. Fixes: `dadd28d414` ("drm/client: Add client-lib module") Signed-off-by: Thomas Zimmermann <tzimmermann@suse.de> Link: https://patchwork.freedesktop.org/series/141411/ # 1 Reviewed-by: Arnd Bergmann <arnd@arndb.de> Link: https://patchwork.freedesktop.org/patch/msgid/20241216074450.8590-3-tzimmermann@suse.de	2024-12-17 18:06:23 +01:00
Thomas Zimmermann	8fc38062be	fbdev: Fix recursive dependencies wrt BACKLIGHT_CLASS_DEVICE Do not select BACKLIGHT_CLASS_DEVICE from FB_BACKLIGHT. The latter only controls backlight support within fbdev core code and data structures. Make fbdev drivers depend on BACKLIGHT_CLASS_DEVICE and let users select it explicitly. Fixes warnings about recursive dependencies, such as error: recursive dependency detected! symbol BACKLIGHT_CLASS_DEVICE is selected by FB_BACKLIGHT symbol FB_BACKLIGHT is selected by FB_SH_MOBILE_LCDC symbol FB_SH_MOBILE_LCDC depends on FB_DEVICE symbol FB_DEVICE depends on FB_CORE symbol FB_CORE is selected by DRM_GEM_DMA_HELPER symbol DRM_GEM_DMA_HELPER is selected by DRM_PANEL_ILITEK_ILI9341 symbol DRM_PANEL_ILITEK_ILI9341 depends on BACKLIGHT_CLASS_DEVICE BACKLIGHT_CLASS_DEVICE is user-selectable, so making drivers adapt to it is the correct approach in any case. For most drivers, backlight support is also configurable separately. v3: - Select BACKLIGHT_CLASS_DEVICE in PowerMac defconfigs (Christophe) - Fix PMAC_BACKLIGHT module dependency corner cases (Christophe) v2: - s/BACKLIGHT_DEVICE_CLASS/BACKLIGHT_CLASS_DEVICE (Helge) - Fix fbdev driver-dependency corner case (Arnd) Signed-off-by: Thomas Zimmermann <tzimmermann@suse.de> Reviewed-by: Arnd Bergmann <arnd@arndb.de> Link: https://patchwork.freedesktop.org/patch/msgid/20241216074450.8590-2-tzimmermann@suse.de	2024-12-17 18:06:10 +01:00
Linus Torvalds	ed90ed56e4	Merge tag 'erofs-for-6.13-rc4-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/xiang/erofs Pull erofs fixes from Gao Xiang: "The first one fixes a syzbot UAF report caused by a commit introduced in this cycle, but it also addresses a longstanding memory leak. The second one resolves a PSI memstall mis-accounting issue. The remaining patches switch file-backed mounts to use buffered I/Os by default instead of direct I/Os, since the page cache of underlay files is typically valid and maybe even dirty. This change also aligns with the default policy of loopback devices. A mount option has been added to try to use direct I/Os explicitly. Summary: - Fix (pcluster) memory leak and (sbi) UAF after umounting - Fix a case of PSI memstall mis-accounting - Use buffered I/Os by default for file-backed mounts" * tag 'erofs-for-6.13-rc4-fixes' of git://git.kernel.org/pub/scm/linux/kernel/git/xiang/erofs: erofs: use buffered I/O for file-backed mounts by default erofs: reference `struct erofs_device_info` for erofs_map_dev erofs: use `struct erofs_device_info` for the primary device erofs: add erofs_sb_free() helper MAINTAINERS: erofs: update Yue Hu's email address erofs: fix PSI memstall accounting erofs: fix rare pcluster memory leak after unmounting	2024-12-17 09:04:42 -08:00
John Stultz	4a07791457	locking/rtmutex: Make sure we wake anything on the wake_q when we release the lock->wait_lock Bert reported seeing occasional boot hangs when running with PREEPT_RT and bisected it down to commit `894d1b3db4` ("locking/mutex: Remove wakeups from under mutex::wait_lock"). It looks like I missed a few spots where we drop the wait_lock and potentially call into schedule without waking up the tasks on the wake_q structure. Since the tasks being woken are ww_mutex tasks they need to be able to run to release the mutex and unblock the task that currently is planning to wake them. Thus we can deadlock. So make sure we wake the wake_q tasks when we unlock the wait_lock. Closes: https://lore.kernel.org/lkml/20241211182502.2915-1-spasswolf@web.de Fixes: `894d1b3db4` ("locking/mutex: Remove wakeups from under mutex::wait_lock") Reported-by: Bert Karwatzki <spasswolf@web.de> Signed-off-by: John Stultz <jstultz@google.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Link: https://lkml.kernel.org/r/20241212222138.2400498-1-jstultz@google.com	2024-12-17 17:47:24 +01:00
Kan Liang	b8c3a2502a	perf/x86/intel/ds: Add PEBS format 6 The only difference between 5 and 6 is the new counters snapshotting group, without the following counters snapshotting enabling patches, it's impossible to utilize the feature in a PEBS record. It's safe to share the same code path with format 5. Add format 6, so the end user can at least utilize the legacy PEBS features. Fixes: `a932aa0e86` ("perf/x86: Add Lunar Lake and Arrow Lake support") Signed-off-by: Kan Liang <kan.liang@linux.intel.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Cc: stable@vger.kernel.org Link: https://lore.kernel.org/r/20241216204505.748363-1-kan.liang@linux.intel.com	2024-12-17 17:47:23 +01:00
Kan Liang	b6ccddd6fe	perf/x86/intel/uncore: Add Clearwater Forest support From the perspective of the uncore PMU, the Clearwater Forest is the same as the previous Sierra Forest. The only difference is the event list, which will be supported in the perf tool later. Signed-off-by: Kan Liang <kan.liang@linux.intel.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Link: https://lkml.kernel.org/r/20241211161146.235253-1-kan.liang@linux.intel.com	2024-12-17 17:47:23 +01:00
Linus Torvalds	1f13c38a85	Merge tag 'hardening-v6.13-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/kees/linux Pull hardening fix from Kees Cook: "Silence a GCC value-range warning that is being ironically triggered by bounds checking" * tag 'hardening-v6.13-rc4' of git://git.kernel.org/pub/scm/linux/kernel/git/kees/linux: fortify: Hide run-time copy size from value range tracking	2024-12-17 08:45:40 -08:00
Steven Rostedt	afd2627f72	tracing: Check "%s" dereference via the field and not the TP_printk format The TP_printk() portion of a trace event is executed at the time a event is read from the trace. This can happen seconds, minutes, hours, days, months, years possibly later since the event was recorded. If the print format contains a dereference to a string via "%s", and that string was allocated, there's a chance that string could be freed before it is read by the trace file. To protect against such bugs, there are two functions that verify the event. The first one is test_event_printk(), which is called when the event is created. It reads the TP_printk() format as well as its arguments to make sure nothing may be dereferencing a pointer that was not copied into the ring buffer along with the event. If it is, it will trigger a WARN_ON(). For strings that use "%s", it is not so easy. The string may not reside in the ring buffer but may still be valid. Strings that are static and part of the kernel proper which will not be freed for the life of the running system, are safe to dereference. But to know if it is a pointer to a static string or to something on the heap can not be determined until the event is triggered. This brings us to the second function that tests for the bad dereferencing of strings, trace_check_vprintf(). It would walk through the printf format looking for "%s", and when it finds it, it would validate that the pointer is safe to read. If not, it would produces a WARN_ON() as well and write into the ring buffer "[UNSAFE-MEMORY]". The problem with this is how it used va_list to have vsnprintf() handle all the cases that it didn't need to check. Instead of re-implementing vsnprintf(), it would make a copy of the format up to the %s part, and call vsnprintf() with the current va_list ap variable, where the ap would then be ready to point at the string in question. For architectures that passed va_list by reference this was possible. For architectures that passed it by copy it was not. A test_can_verify() function was used to differentiate between the two, and if it wasn't possible, it would disable it. Even for architectures where this was feasible, it was a stretch to rely on such a method that is undocumented, and could cause issues later on with new optimizations of the compiler. Instead, the first function test_event_printk() was updated to look at "%s" as well. If the "%s" argument is a pointer outside the event in the ring buffer, it would find the field type of the event that is the problem and mark the structure with a new flag called "needs_test". The event itself will be marked by TRACE_EVENT_FL_TEST_STR to let it be known that this event has a field that needs to be verified before the event can be printed using the printf format. When the event fields are created from the field type structure, the fields would copy the field type's "needs_test" value. Finally, before being printed, a new function ignore_event() is called which will check if the event has the TEST_STR flag set (if not, it returns false). If the flag is set, it then iterates through the events fields looking for the ones that have the "needs_test" flag set. Then it uses the offset field from the field structure to find the pointer in the ring buffer event. It runs the tests to make sure that pointer is safe to print and if not, it triggers the WARN_ON() and also adds to the trace output that the event in question has an unsafe memory access. The ignore_event() makes the trace_check_vprintf() obsolete so it is removed. Link: https://lore.kernel.org/all/CAHk-=wh3uOnqnZPpR0PeLZZtyWbZLboZ7cHLCKRWsocvs9Y7hQ@mail.gmail.com/ Cc: stable@vger.kernel.org Cc: Masami Hiramatsu <mhiramat@kernel.org> Cc: Mark Rutland <mark.rutland@arm.com> Cc: Mathieu Desnoyers <mathieu.desnoyers@efficios.com> Cc: Andrew Morton <akpm@linux-foundation.org> Cc: Al Viro <viro@ZenIV.linux.org.uk> Cc: Linus Torvalds <torvalds@linux-foundation.org> Link: https://lore.kernel.org/20241217024720.848621576@goodmis.org Fixes: `5013f454a3` ("tracing: Add check of trace event print fmts for dereferencing pointers") Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>	2024-12-17 11:40:11 -05:00
Steven Rostedt	65a25d9f7a	tracing: Add "%s" check in test_event_printk() The test_event_printk() code makes sure that when a trace event is registered, any dereferenced pointers in from the event's TP_printk() are pointing to content in the ring buffer. But currently it does not handle "%s", as there's cases where the string pointer saved in the ring buffer points to a static string in the kernel that will never be freed. As that is a valid case, the pointer needs to be checked at runtime. Currently the runtime check is done via trace_check_vprintf(), but to not have to replicate everything in vsnprintf() it does some logic with the va_list that may not be reliable across architectures. In order to get rid of that logic, more work in the test_event_printk() needs to be done. Some of the strings can be validated at this time when it is obvious the string is valid because the string will be saved in the ring buffer content. Do all the validation of strings in the ring buffer at boot in test_event_printk(), and make sure that the field of the strings that point into the kernel are accessible. This will allow adding checks at runtime that will validate the fields themselves and not rely on paring the TP_printk() format at runtime. Cc: stable@vger.kernel.org Cc: Masami Hiramatsu <mhiramat@kernel.org> Cc: Mark Rutland <mark.rutland@arm.com> Cc: Mathieu Desnoyers <mathieu.desnoyers@efficios.com> Cc: Andrew Morton <akpm@linux-foundation.org> Cc: Al Viro <viro@ZenIV.linux.org.uk> Cc: Linus Torvalds <torvalds@linux-foundation.org> Link: https://lore.kernel.org/20241217024720.685917008@goodmis.org Fixes: `5013f454a3` ("tracing: Add check of trace event print fmts for dereferencing pointers") Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>	2024-12-17 11:40:11 -05:00
Steven Rostedt	917110481f	tracing: Add missing helper functions in event pointer dereference check The process_pointer() helper function looks to see if various trace event macros are used. These macros are for storing data in the event. This makes it safe to dereference as the dereference will then point into the event on the ring buffer where the content of the data stays with the event itself. A few helper functions were missing. Those were: __get_rel_dynamic_array() __get_dynamic_array_len() __get_rel_dynamic_array_len() __get_rel_sockaddr() Also add a helper function find_print_string() to not need to use a middle man variable to test if the string exists. Cc: stable@vger.kernel.org Cc: Masami Hiramatsu <mhiramat@kernel.org> Cc: Mark Rutland <mark.rutland@arm.com> Cc: Mathieu Desnoyers <mathieu.desnoyers@efficios.com> Cc: Andrew Morton <akpm@linux-foundation.org> Cc: Al Viro <viro@ZenIV.linux.org.uk> Cc: Linus Torvalds <torvalds@linux-foundation.org> Link: https://lore.kernel.org/20241217024720.521836792@goodmis.org Fixes: `5013f454a3` ("tracing: Add check of trace event print fmts for dereferencing pointers") Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>	2024-12-17 11:40:11 -05:00
Steven Rostedt	a6629626c5	tracing: Fix test_event_printk() to process entire print argument The test_event_printk() analyzes print formats of trace events looking for cases where it may dereference a pointer that is not in the ring buffer which can possibly be a bug when the trace event is read from the ring buffer and the content of that pointer no longer exists. The function needs to accurately go from one print format argument to the next. It handles quotes and parenthesis that may be included in an argument. When it finds the start of the next argument, it uses a simple "c = strstr(fmt + i, ',')" to find the end of that argument! In order to include "%s" dereferencing, it needs to process the entire content of the print format argument and not just the content of the first ',' it finds. As there may be content like: ({ const char saved_ptr = trace_seq_buffer_ptr(p); static const char access_str[] = { "---", "--x", "w--", "w-x", "-u-", "-ux", "wu-", "wux" }; union kvm_mmu_page_role role; role.word = REC->role; trace_seq_printf(p, "sp gen %u gfn %llx l%u %u-byte q%u%s %s%s" " %snxe %sad root %u %s%c", REC->mmu_valid_gen, REC->gfn, role.level, role.has_4_byte_gpte ? 4 : 8, role.quadrant, role.direct ? " direct" : "", access_str[role.access], role.invalid ? " invalid" : "", role.efer_nx ? "" : "!", role.ad_disabled ? "!" : "", REC->root_count, REC->unsync ? "unsync" : "sync", 0); saved_ptr; }) Which is an example of a full argument of an existing event. As the code already handles finding the next print format argument, process the argument at the end of it and not the start of it. This way it has both the start of the argument as well as the end of it. Add a helper function "process_pointer()" that will do the processing during the loop as well as at the end. It also makes the code cleaner and easier to read. Cc: stable@vger.kernel.org Cc: Masami Hiramatsu <mhiramat@kernel.org> Cc: Mark Rutland <mark.rutland@arm.com> Cc: Mathieu Desnoyers <mathieu.desnoyers@efficios.com> Cc: Andrew Morton <akpm@linux-foundation.org> Cc: Al Viro <viro@ZenIV.linux.org.uk> Cc: Linus Torvalds <torvalds@linux-foundation.org> Link: https://lore.kernel.org/20241217024720.362271189@goodmis.org Fixes: `5013f454a3` ("tracing: Add check of trace event print fmts for dereferencing pointers") Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>	2024-12-17 11:40:11 -05:00
Linus Torvalds	59dbb9d81a	Merge tag 'xsa465+xsa466-6.13-tag' of git://git.kernel.org/pub/scm/linux/kernel/git/xen/tip Pull xen fixes from Juergen Gross: "Fix xen netfront crash (XSA-465) and avoid using the hypercall page that doesn't do speculation mitigations (XSA-466)" * tag 'xsa465+xsa466-6.13-tag' of git://git.kernel.org/pub/scm/linux/kernel/git/xen/tip: x86/xen: remove hypercall page x86/xen: use new hypercall functions instead of hypercall page x86/xen: add central hypercall functions x86/xen: don't do PV iret hypercall through hypercall page x86/static-call: provide a way to do very early static-call updates objtool/x86: allow syscall instruction x86: make get_cpu_vendor() accessible from Xen code xen/netfront: fix crash when removing device	2024-12-17 08:29:58 -08:00
Zhang Kunbo	bedb4e6088	fs/nfs: fix missing declaration of nfs_idmap_cache_timeout fs/nfs/super.c should include fs/nfs/nfs4idmap.h for declaration of nfs_idmap_cache_timeout. This fixes the sparse warning: fs/nfs/super.c:1397:14: warning: symbol 'nfs_idmap_cache_timeout' was not declared. Should it be static? Signed-off-by: Zhang Kunbo <zhangkunbo@huawei.com> Signed-off-by: Trond Myklebust <trond.myklebust@hammerspace.com>	2024-12-17 11:14:20 -05:00
Trond Myklebust	62e2a47cea	NFS/pnfs: Fix a live lock between recalled layouts and layoutget When the server is recalling a layout, we should ignore the count of outstanding layoutget calls, since the server is expected to return either NFS4ERR_RECALLCONFLICT or NFS4ERR_RETURNCONFLICT for as long as the recall is outstanding. Currently, we may end up livelocking, causing the layout to eventually be forcibly revoked. Fixes: `bf0291dd22` ("pNFS: Ensure LAYOUTGET and LAYOUTRETURN are properly serialised") Cc: stable@vger.kernel.org Signed-off-by: Trond Myklebust <trond.myklebust@hammerspace.com>	2024-12-17 11:10:55 -05:00
Jens Axboe	020b40f356	io_uring: make ctx->timeout_lock a raw spinlock Chase reports that their tester complaints about a locking context mismatch: ============================= [ BUG: Invalid wait context ] 6.13.0-rc1-gf137f14b7ccb-dirty #9 Not tainted ----------------------------- syz.1.25198/182604 is trying to lock: ffff88805e66a358 (&ctx->timeout_lock){-.-.}-{3:3}, at: spin_lock_irq include/linux/spinlock.h:376 [inline] ffff88805e66a358 (&ctx->timeout_lock){-.-.}-{3:3}, at: io_match_task_safe io_uring/io_uring.c:218 [inline] ffff88805e66a358 (&ctx->timeout_lock){-.-.}-{3:3}, at: io_match_task_safe+0x187/0x250 io_uring/io_uring.c:204 other info that might help us debug this: context-{5:5} 1 lock held by syz.1.25198/182604: #0: ffff88802b7d48c0 (&acct->lock){+.+.}-{2:2}, at: io_acct_cancel_pending_work+0x2d/0x6b0 io_uring/io-wq.c:1049 stack backtrace: CPU: 0 UID: 0 PID: 182604 Comm: syz.1.25198 Not tainted 6.13.0-rc1-gf137f14b7ccb-dirty #9 Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.15.0-1 04/01/2014 Call Trace: <TASK> __dump_stack lib/dump_stack.c:94 [inline] dump_stack_lvl+0x82/0xd0 lib/dump_stack.c:120 print_lock_invalid_wait_context kernel/locking/lockdep.c:4826 [inline] check_wait_context kernel/locking/lockdep.c:4898 [inline] __lock_acquire+0x883/0x3c80 kernel/locking/lockdep.c:5176 lock_acquire.part.0+0x11b/0x370 kernel/locking/lockdep.c:5849 __raw_spin_lock_irq include/linux/spinlock_api_smp.h:119 [inline] _raw_spin_lock_irq+0x36/0x50 kernel/locking/spinlock.c:170 spin_lock_irq include/linux/spinlock.h:376 [inline] io_match_task_safe io_uring/io_uring.c:218 [inline] io_match_task_safe+0x187/0x250 io_uring/io_uring.c:204 io_acct_cancel_pending_work+0xb8/0x6b0 io_uring/io-wq.c:1052 io_wq_cancel_pending_work io_uring/io-wq.c:1074 [inline] io_wq_cancel_cb+0xb0/0x390 io_uring/io-wq.c:1112 io_uring_try_cancel_requests+0x15e/0xd70 io_uring/io_uring.c:3062 io_uring_cancel_generic+0x6ec/0x8c0 io_uring/io_uring.c:3140 io_uring_files_cancel include/linux/io_uring.h:20 [inline] do_exit+0x494/0x27a0 kernel/exit.c:894 do_group_exit+0xb3/0x250 kernel/exit.c:1087 get_signal+0x1d77/0x1ef0 kernel/signal.c:3017 arch_do_signal_or_restart+0x79/0x5b0 arch/x86/kernel/signal.c:337 exit_to_user_mode_loop kernel/entry/common.c:111 [inline] exit_to_user_mode_prepare include/linux/entry-common.h:329 [inline] __syscall_exit_to_user_mode_work kernel/entry/common.c:207 [inline] syscall_exit_to_user_mode+0x150/0x2a0 kernel/entry/common.c:218 do_syscall_64+0xd8/0x250 arch/x86/entry/common.c:89 entry_SYSCALL_64_after_hwframe+0x77/0x7f which is because io_uring has ctx->timeout_lock nesting inside the io-wq acct lock, the latter of which is used from inside the scheduler and hence is a raw spinlock, while the former is a "normal" spinlock and can hence be sleeping on PREEMPT_RT. Change ctx->timeout_lock to be a raw spinlock to solve this nesting dependency on PREEMPT_RT=y. Reported-by: chase xd <sl1589472800@gmail.com> Signed-off-by: Jens Axboe <axboe@kernel.dk>	2024-12-17 08:21:46 -07:00
Yang Erkun	69d803c40e	nfsd: Revert "nfsd: release svc_expkey/svc_export with rcu_work" This reverts commit `f8c989a0c8`. Before this commit, svc_export_put or expkey_put will call path_put with sync mode. After this commit, path_put will be called with async mode. And this can lead the unexpected results show as follow. mkfs.xfs -f /dev/sda echo "/ (rw,no_root_squash,fsid=0)" > /etc/exports echo "/mnt (rw,no_root_squash,fsid=1)" >> /etc/exports exportfs -ra service nfs-server start mount -t nfs -o vers=4.0 127.0.0.1:/mnt /mnt1 mount /dev/sda /mnt/sda touch /mnt1/sda/file exportfs -r umount /mnt/sda # failed unexcepted The touch will finally call nfsd_cross_mnt, add refcount to mount, and then add cache_head. Before this commit, exportfs -r will call cache_flush to cleanup all cache_head, and path_put in svc_export_put/expkey_put will be finished with sync mode. So, the latter umount will always success. However, after this commit, path_put will be called with async mode, the latter umount may failed, and if we add some delay, umount will success too. Personally I think this bug and should be fixed. We first revert before bugfix patch, and then fix the original bug with a different way. Fixes: `f8c989a0c8` ("nfsd: release svc_expkey/svc_export with rcu_work") Signed-off-by: Yang Erkun <yangerkun@huawei.com> Signed-off-by: Chuck Lever <chuck.lever@oracle.com>	2024-12-17 09:45:23 -05:00
Gianfranco Trad	7ed2d91588	qed: fix possible uninit pointer read in qed_mcp_nvm_info_populate() Coverity reports an uninit pointer read in qed_mcp_nvm_info_populate(). If EOPNOTSUPP is returned from qed_mcp_bist_nvm_get_num_images() ensure nvm_info.num_images is set to 0 to avoid possible uninit assignment to p_hwfn->nvm_info.image_att later on in out label. Closes: https://scan5.scan.coverity.com/#/project-view/63204/10063?selectedIssue=1636666 Suggested-by: Simon Horman <horms@kernel.org> Signed-off-by: Gianfranco Trad <gianf.trad@gmail.com> Reviewed-by: Simon Horman <horms@kernel.org> Link: https://patch.msgid.link/20241215011733.351325-2-gianf.trad@gmail.com Signed-off-by: Paolo Abeni <pabeni@redhat.com>	2024-12-17 15:08:52 +01:00
Peter Ujfalusi	e8d0ba147d	ASoC: SOF: Intel: hda-dai: Do not release the link DMA on STOP The linkDMA should not be released on stop trigger since a stream re-start might happen without closing of the stream. This leaves a short time for other streams to 'steal' the linkDMA since it has been released. This issue is not easy to reproduce under normal conditions as usually after stop the stream is closed, or the same stream is restarted, but if another stream got in between the stop and start, like this: aplay -Dhw:0,3 -c2 -r48000 -fS32_LE /dev/zero -d 120 CTRL+z aplay -Dhw:0,0 -c2 -r48000 -fS32_LE /dev/zero -d 120 then the link DMA channels will be mixed up, resulting firmware error or crash. Fixes: `ab5593793e` ("ASoC: SOF: Intel: hda: Always clean up link DMA during stop") Cc: stable@vger.kernel.org Closes: https://github.com/thesofproject/sof/issues/9695 Signed-off-by: Peter Ujfalusi <peter.ujfalusi@linux.intel.com> Reviewed-by: Ranjani Sridharan <ranjani.sridharan@linux.intel.com> Reviewed-by: Liam Girdwood <liam.r.girdwood@intel.com> Reviewed-by: Bard Liao <yung-chuan.liao@linux.intel.com> Link: https://patch.msgid.link/20241217091019.31798-1-peter.ujfalusi@linux.intel.com Signed-off-by: Mark Brown <broonie@kernel.org>	2024-12-17 13:21:10 +00:00
Joe Hattori	0cb2c504d7	net: ethernet: bgmac-platform: fix an OF node reference leak The OF node obtained by of_parse_phandle() is not freed. Call of_node_put() to balance the refcount. This bug was found by an experimental static analysis tool that I am developing. Fixes: `1676aba5ef` ("net: ethernet: bgmac: device tree phy enablement") Signed-off-by: Joe Hattori <joe@pf.is.s.u-tokyo.ac.jp> Reviewed-by: Simon Horman <horms@kernel.org> Link: https://patch.msgid.link/20241214014912.2810315-1-joe@pf.is.s.u-tokyo.ac.jp Signed-off-by: Paolo Abeni <pabeni@redhat.com>	2024-12-17 13:22:05 +01:00
Paolo Abeni	90d130aadc	Merge branch 'fixes-on-the-open-alliance-tc6-10base-t1x-mac-phy-support-generic-lib' Parthiban Veerasooran says: ==================== Fixes on the OPEN Alliance TC6 10BASE-T1x MAC-PHY support generic lib This patch series contain the below fixes. - Infinite loop error when tx credits becomes 0. - Race condition between tx skb reference pointers. v2: - Added mutex lock to protect tx skb reference handling. v3: - Added mutex protection in assigning new tx skb to waiting_tx_skb pointer. - Explained the possible scenario for the race condition with the time diagram in the commit message. v4: - Replaced mutex with spin_lock_bh() variants as the start_xmit runs in BH/softirq context which can't take sleeping locks. ==================== Link: https://patch.msgid.link/20241213123159.439739-1-parthiban.veerasooran@microchip.com Signed-off-by: Paolo Abeni <pabeni@redhat.com>	2024-12-17 13:11:42 +01:00
Parthiban Veerasooran	e592b5110b	net: ethernet: oa_tc6: fix tx skb race condition between reference pointers There are two skb pointers to manage tx skb's enqueued from n/w stack. waiting_tx_skb pointer points to the tx skb which needs to be processed and ongoing_tx_skb pointer points to the tx skb which is being processed. SPI thread prepares the tx data chunks from the tx skb pointed by the ongoing_tx_skb pointer. When the tx skb pointed by the ongoing_tx_skb is processed, the tx skb pointed by the waiting_tx_skb is assigned to ongoing_tx_skb and the waiting_tx_skb pointer is assigned with NULL. Whenever there is a new tx skb from n/w stack, it will be assigned to waiting_tx_skb pointer if it is NULL. Enqueuing and processing of a tx skb handled in two different threads. Consider a scenario where the SPI thread processed an ongoing_tx_skb and it moves next tx skb from waiting_tx_skb pointer to ongoing_tx_skb pointer without doing any NULL check. At this time, if the waiting_tx_skb pointer is NULL then ongoing_tx_skb pointer is also assigned with NULL. After that, if a new tx skb is assigned to waiting_tx_skb pointer by the n/w stack and there is a chance to overwrite the tx skb pointer with NULL in the SPI thread. Finally one of the tx skb will be left as unhandled, resulting packet missing and memory leak. - Consider the below scenario where the TXC reported from the previous transfer is 10 and ongoing_tx_skb holds an tx ethernet frame which can be transported in 20 TXCs and waiting_tx_skb is still NULL. tx_credits = 10; /* 21 are filled in the previous transfer / ongoing_tx_skb = 20; waiting_tx_skb = NULL; / Still NULL / - So, (tc6->ongoing_tx_skb \|\| tc6->waiting_tx_skb) becomes true. - After oa_tc6_prepare_spi_tx_buf_for_tx_skbs() ongoing_tx_skb = 10; waiting_tx_skb = NULL; / Still NULL */ - Perform SPI transfer. - Process SPI rx buffer to get the TXC from footers. - Now let's assume previously filled 21 TXCs are freed so we are good to transport the next remaining 10 tx chunks from ongoing_tx_skb. tx_credits = 21; ongoing_tx_skb = 10; waiting_tx_skb = NULL; - So, (tc6->ongoing_tx_skb \|\| tc6->waiting_tx_skb) becomes true again. - In the oa_tc6_prepare_spi_tx_buf_for_tx_skbs() ongoing_tx_skb = NULL; waiting_tx_skb = NULL; - Now the below bad case might happen, Thread1 (oa_tc6_start_xmit) Thread2 (oa_tc6_spi_thread_handler) --------------------------- ----------------------------------- - if waiting_tx_skb is NULL - if ongoing_tx_skb is NULL - ongoing_tx_skb = waiting_tx_skb - waiting_tx_skb = skb - waiting_tx_skb = NULL ... - ongoing_tx_skb = NULL - if waiting_tx_skb is NULL - waiting_tx_skb = skb To overcome the above issue, protect the moving of tx skb reference from waiting_tx_skb pointer to ongoing_tx_skb pointer and assigning new tx skb to waiting_tx_skb pointer, so that the other thread can't access the waiting_tx_skb pointer until the current thread completes moving the tx skb reference safely. Fixes: `53fbde8ab2` ("net: ethernet: oa_tc6: implement transmit path to transfer tx ethernet frames") Signed-off-by: Parthiban Veerasooran <parthiban.veerasooran@microchip.com> Reviewed-by: Larysa Zaremba <larysa.zaremba@intel.com> Signed-off-by: Paolo Abeni <pabeni@redhat.com>	2024-12-17 13:11:22 +01:00
Parthiban Veerasooran	7d2f320e12	net: ethernet: oa_tc6: fix infinite loop error when tx credits becomes 0 SPI thread wakes up to perform SPI transfer whenever there is an TX skb from n/w stack or interrupt from MAC-PHY. Ethernet frame from TX skb is transferred based on the availability tx credits in the MAC-PHY which is reported from the previous SPI transfer. Sometimes there is a possibility that TX skb is available to transmit but there is no tx credits from MAC-PHY. In this case, there will not be any SPI transfer but the thread will be running in an endless loop until tx credits available again. So checking the availability of tx credits along with TX skb will prevent the above infinite loop. When the tx credits available again that will be notified through interrupt which will trigger the SPI transfer to get the available tx credits. Fixes: `53fbde8ab2` ("net: ethernet: oa_tc6: implement transmit path to transfer tx ethernet frames") Reviewed-by: Jacob Keller <jacob.e.keller@intel.com> Signed-off-by: Parthiban Veerasooran <parthiban.veerasooran@microchip.com> Signed-off-by: Paolo Abeni <pabeni@redhat.com>	2024-12-17 13:11:22 +01:00
Niklas Neronin	b9252f80b8	usb: xhci: fix ring expansion regression in 6.13-rc1 The source and destination rings were incorrectly assigned during the ring linking process. The "source" ring, which contains the new segments, was not spliced into the "destination" ring, leading to incorrect ring expansion. Fixes: `fe688e5006` ("usb: xhci: refactor xhci_link_rings() to use source and destination rings") Reported-by: Jeff Chua <jeff.chua.linux@gmail.com> Closes: https://lore.kernel.org/lkml/CAAJw_ZtppNqC9XA=-WVQDr+vaAS=di7jo15CzSqONeX48H75MA@mail.gmail.com/ Signed-off-by: Niklas Neronin <niklas.neronin@linux.intel.com> Signed-off-by: Mathias Nyman <mathias.nyman@linux.intel.com> Link: https://lore.kernel.org/r/20241217102122.2316814-3-mathias.nyman@linux.intel.com Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2024-12-17 11:59:09 +01:00
Mathias Nyman	e21ebe51af	xhci: Turn NEC specific quirk for handling Stop Endpoint errors generic xHC hosts from several vendors have the same issue where endpoints start so slowly that a later queued 'Stop Endpoint' command may complete before endpoint is up and running. The 'Stop Endpoint' command fails with context state error as the endpoint still appears as stopped. See commit `42b7581376` ("usb: xhci: Limit Stop Endpoint retries") for details CC: stable@vger.kernel.org Signed-off-by: Mathias Nyman <mathias.nyman@linux.intel.com> Link: https://lore.kernel.org/r/20241217102122.2316814-2-mathias.nyman@linux.intel.com Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>	2024-12-17 11:59:09 +01:00
Umesh Nerlige Ramappa	1622ed27d2	i915/guc: Accumulate active runtime on gt reset On gt reset, if a context is running, then accumulate it's active time into the busyness counter since there will be no chance for the context to switch out and update it's run time. v2: Move comment right above the if (John) Fixes: `77cdd054dd` ("drm/i915/pmu: Connect engine busyness stats from GuC to pmu") Signed-off-by: Umesh Nerlige Ramappa <umesh.nerlige.ramappa@intel.com> Reviewed-by: John Harrison <John.C.Harrison@Intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/20241127174006.190128-4-umesh.nerlige.ramappa@intel.com (cherry picked from commit `7ed047da59`) Signed-off-by: Tvrtko Ursulin <tursulin@ursulin.net>	2024-12-17 10:15:15 +00:00
Umesh Nerlige Ramappa	59a0b46788	i915/guc: Ensure busyness counter increases motonically Active busyness of an engine is calculated using gt timestamp and the context switch in time. While capturing the gt timestamp, it's possible that the context switches out. This race could result in an active busyness value that is greater than the actual context runtime value by a small amount. This leads to a negative delta and throws off busyness calculations for the user. If a subsequent count is smaller than the previous one, just return the previous one, since we expect the busyness to catch up. Fixes: `77cdd054dd` ("drm/i915/pmu: Connect engine busyness stats from GuC to pmu") Signed-off-by: Umesh Nerlige Ramappa <umesh.nerlige.ramappa@intel.com> Reviewed-by: John Harrison <John.C.Harrison@Intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/20241127174006.190128-3-umesh.nerlige.ramappa@intel.com (cherry picked from commit `cf907f6d29`) Signed-off-by: Tvrtko Ursulin <tursulin@ursulin.net>	2024-12-17 10:15:10 +00:00
Umesh Nerlige Ramappa	abcc2ddae5	i915/guc: Reset engine utilization buffer before registration On GT reset, we store total busyness counts for all engines and re-register the utilization buffer with GuC. At that time we should reset the buffer, so that we don't get spurious busyness counts on subsequent queries. To repro this issue, run igt@perf_pmu@busy-hang followed by igt@perf_pmu@most-busy-idle-check-all for a couple iterations. Fixes: `77cdd054dd` ("drm/i915/pmu: Connect engine busyness stats from GuC to pmu") Signed-off-by: Umesh Nerlige Ramappa <umesh.nerlige.ramappa@intel.com> Reviewed-by: John Harrison <John.C.Harrison@Intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/20241127174006.190128-2-umesh.nerlige.ramappa@intel.com (cherry picked from commit `abd318237f`) Signed-off-by: Tvrtko Ursulin <tursulin@ursulin.net>	2024-12-17 10:15:03 +00:00
FUJITA Tomonori	94901b7a74	rust: net::phy fix module autoloading The alias symbol name was renamed. Adjust module_phy_driver macro to create the proper symbol name to fix module autoloading. Fixes: `054a9cd395` ("modpost: rename alias symbol for MODULE_DEVICE_TABLE()") Signed-off-by: FUJITA Tomonori <fujita.tomonori@gmail.com> Link: https://patch.msgid.link/20241212130015.238863-1-fujita.tomonori@gmail.com Signed-off-by: Paolo Abeni <pabeni@redhat.com>	2024-12-17 09:20:07 +01:00
Juergen Gross	7fa0da5373	x86/xen: remove hypercall page The hypercall page is no longer needed. It can be removed, as from the Xen perspective it is optional. But, from Linux's perspective, it removes naked RET instructions that escape the speculative protections that Call Depth Tracking and/or Untrain Ret are trying to achieve. This is part of XSA-466 / CVE-2024-53241. Reported-by: Andrew Cooper <andrew.cooper3@citrix.com> Signed-off-by: Juergen Gross <jgross@suse.com> Reviewed-by: Andrew Cooper <andrew.cooper3@citrix.com> Reviewed-by: Jan Beulich <jbeulich@suse.com>	2024-12-17 08:23:42 +01:00
Juergen Gross	b1c2cb86f4	x86/xen: use new hypercall functions instead of hypercall page Call the Xen hypervisor via the new xen_hypercall_func static-call instead of the hypercall page. This is part of XSA-466 / CVE-2024-53241. Reported-by: Andrew Cooper <andrew.cooper3@citrix.com> Signed-off-by: Juergen Gross <jgross@suse.com> Co-developed-by: Peter Zijlstra <peterz@infradead.org> Co-developed-by: Josh Poimboeuf <jpoimboe@redhat.com>	2024-12-17 08:23:41 +01:00
Juergen Gross	b4845bb638	x86/xen: add central hypercall functions Add generic hypercall functions usable for all normal (i.e. not iret) hypercalls. Depending on the guest type and the processor vendor different functions need to be used due to the to be used instruction for entering the hypervisor: - PV guests need to use syscall - HVM/PVH guests on Intel need to use vmcall - HVM/PVH guests on AMD and Hygon need to use vmmcall As PVH guests need to issue hypercalls very early during boot, there is a 4th hypercall function needed for HVM/PVH which can be used on Intel and AMD processors. It will check the vendor type and then set the Intel or AMD specific function to use via static_call(). This is part of XSA-466 / CVE-2024-53241. Reported-by: Andrew Cooper <andrew.cooper3@citrix.com> Signed-off-by: Juergen Gross <jgross@suse.com> Co-developed-by: Peter Zijlstra <peterz@infradead.org>	2024-12-17 08:23:29 +01:00
Dan Carpenter	7203d10e93	net: hinic: Fix cleanup in create_rxqs/txqs() There is a check for NULL at the start of create_txqs() and create_rxqs() which tess if "nic_dev->txqs" is non-NULL. The intention is that if the device is already open and the queues are already created then we don't create them a second time. However, the bug is that if we have an error in the create_txqs() then the pointer doesn't get set back to NULL. The NULL check at the start of the function will say that it's already open when it's not and the device can't be used. Set ->txqs back to NULL on cleanup on error. Fixes: `c3e79baf1b` ("net-next/hinic: Add logical Txq and Rxq") Signed-off-by: Dan Carpenter <dan.carpenter@linaro.org> Reviewed-by: Simon Horman <horms@kernel.org> Link: https://patch.msgid.link/0cc98faf-a0ed-4565-a55b-0fa2734bc205@stanley.mountain Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-16 18:26:17 -08:00
Daniel Borkmann	e78c20f327	team: Fix feature exposure when no ports are present Small follow-up to align this to an equivalent behavior as the bond driver. The change in `3625920b62` ("teaming: fix vlan_features computing") removed the netdevice vlan_features when there is no team port attached, yet it leaves the full set of enc_features intact. Instead, leave the default features as pre `3625920b62`, and recompute once we do have ports attached. Also, similarly as in bonding case, call the netdev_base_features() helper on the enc_features. Fixes: `3625920b62` ("teaming: fix vlan_features computing") Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Reviewed-by: Nikolay Aleksandrov <razor@blackwall.org> Link: https://patch.msgid.link/20241213123657.401868-1-daniel@iogearbox.net Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-16 18:23:12 -08:00
Dan Carpenter	fbbd84af6b	chelsio/chtls: prevent potential integer overflow on 32bit The "gl->tot_len" variable is controlled by the user. It comes from process_responses(). On 32bit systems, the "gl->tot_len + sizeof(struct cpl_pass_accept_req) + sizeof(struct rss_header)" addition could have an integer wrapping bug. Use size_add() to prevent this. Fixes: `a089439478` ("crypto: chtls - Register chtls with net tls") Cc: stable@vger.kernel.org Signed-off-by: Dan Carpenter <dan.carpenter@linaro.org> Reviewed-by: Simon Horman <horms@kernel.org> Link: https://patch.msgid.link/c6bfb23c-2db2-4e1b-b8ab-ba3925c82ef5@stanley.mountain Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-16 18:08:11 -08:00
Jakub Kicinski	c8eb0c3ffd	Merge branch 'netdev-fix-repeated-netlink-messages-in-queue-dumps' Jakub Kicinski says: ==================== netdev: fix repeated netlink messages in queue dumps Fix dump continuation for queues and queue stats in the netdev family. Because we used post-increment when saving id of dumped queue next skb would re-dump the already dumped queue. ==================== Link: https://patch.msgid.link/20241213152244.3080955-1-kuba@kernel.org Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-16 17:30:14 -08:00
Jakub Kicinski	5712e323d4	selftests: net-drv: stats: sanity check netlink dumps Sanity check netlink dumps, to make sure dumps don't have repeated entries or gaps in IDs. Reviewed-by: Petr Machata <petrm@nvidia.com> Link: https://patch.msgid.link/20241213152244.3080955-6-kuba@kernel.org Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-16 17:30:13 -08:00
Jakub Kicinski	1234810b16	selftests: net-drv: queues: sanity check netlink dumps This test already catches a netlink bug fixed by this series, but only when running on HW with many queues. Make sure the netdevsim instance created has a lot of queues, and constrain the size of the recv_buffer used by netlink. While at it test both rx and tx queues. Reviewed-by: Joe Damato <jdamato@fastly.com> Reviewed-by: Petr Machata <petrm@nvidia.com> Link: https://patch.msgid.link/20241213152244.3080955-5-kuba@kernel.org Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-16 17:30:13 -08:00
Jakub Kicinski	0518863407	selftests: net: support setting recv_size in YNL recv_size parameter allows constraining the buffer size for dumps. It's useful in testing kernel handling of dump continuation, IOW testing dumps which span multiple skbs. Let the tests set this parameter when initializing the YNL family. Keep the normal default, we don't want tests to unintentionally behave very differently than normal code. Reviewed-by: Joe Damato <jdamato@fastly.com> Reviewed-by: Petr Machata <petrm@nvidia.com> Link: https://patch.msgid.link/20241213152244.3080955-4-kuba@kernel.org Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-16 17:30:13 -08:00
Jakub Kicinski	ecc391a541	netdev: fix repeated netlink messages in queue stats The context is supposed to record the next queue to dump, not last dumped. If the dump doesn't fit we will restart from the already-dumped queue, duplicating the message. Before this fix and with the selftest improvements later in this series we see: # ./run_kselftest.sh -t drivers/net:stats.py timeout set to 45 selftests: drivers/net: stats.py KTAP version 1 1..5 ok 1 stats.check_pause ok 2 stats.check_fec ok 3 stats.pkt_byte_sum # Check\| At /root/ksft-net-drv/drivers/net/./stats.py, line 125, in qstat_by_ifindex: # Check\| ksft_eq(len(queues[qtype]), len(set(queues[qtype])), # Check failed 45 != 44 repeated queue keys # Check\| At /root/ksft-net-drv/drivers/net/./stats.py, line 127, in qstat_by_ifindex: # Check\| ksft_eq(len(queues[qtype]), max(queues[qtype]) + 1, # Check failed 45 != 44 missing queue keys # Check\| At /root/ksft-net-drv/drivers/net/./stats.py, line 125, in qstat_by_ifindex: # Check\| ksft_eq(len(queues[qtype]), len(set(queues[qtype])), # Check failed 45 != 44 repeated queue keys # Check\| At /root/ksft-net-drv/drivers/net/./stats.py, line 127, in qstat_by_ifindex: # Check\| ksft_eq(len(queues[qtype]), max(queues[qtype]) + 1, # Check failed 45 != 44 missing queue keys # Check\| At /root/ksft-net-drv/drivers/net/./stats.py, line 125, in qstat_by_ifindex: # Check\| ksft_eq(len(queues[qtype]), len(set(queues[qtype])), # Check failed 103 != 100 repeated queue keys # Check\| At /root/ksft-net-drv/drivers/net/./stats.py, line 127, in qstat_by_ifindex: # Check\| ksft_eq(len(queues[qtype]), max(queues[qtype]) + 1, # Check failed 103 != 100 missing queue keys # Check\| At /root/ksft-net-drv/drivers/net/./stats.py, line 125, in qstat_by_ifindex: # Check\| ksft_eq(len(queues[qtype]), len(set(queues[qtype])), # Check failed 102 != 100 repeated queue keys # Check\| At /root/ksft-net-drv/drivers/net/./stats.py, line 127, in qstat_by_ifindex: # Check\| ksft_eq(len(queues[qtype]), max(queues[qtype]) + 1, # Check failed 102 != 100 missing queue keys not ok 4 stats.qstat_by_ifindex ok 5 stats.check_down # Totals: pass:4 fail:1 xfail:0 xpass:0 skip:0 error:0 With the fix: # ./ksft-net-drv/run_kselftest.sh -t drivers/net:stats.py timeout set to 45 selftests: drivers/net: stats.py KTAP version 1 1..5 ok 1 stats.check_pause ok 2 stats.check_fec ok 3 stats.pkt_byte_sum ok 4 stats.qstat_by_ifindex ok 5 stats.check_down # Totals: pass:5 fail:0 xfail:0 xpass:0 skip:0 error:0 Fixes: `ab63a2387c` ("netdev: add per-queue statistics") Reviewed-by: Joe Damato <jdamato@fastly.com> Link: https://patch.msgid.link/20241213152244.3080955-3-kuba@kernel.org Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-16 17:30:11 -08:00
Jakub Kicinski	b1f3a2f5a7	netdev: fix repeated netlink messages in queue dump The context is supposed to record the next queue to dump, not last dumped. If the dump doesn't fit we will restart from the already-dumped queue, duplicating the message. Before this fix and with the selftest improvements later in this series we see: # ./run_kselftest.sh -t drivers/net:queues.py timeout set to 45 selftests: drivers/net: queues.py KTAP version 1 1..2 # Check\| At /root/ksft-net-drv/drivers/net/./queues.py, line 32, in get_queues: # Check\| ksft_eq(queues, expected) # Check failed 102 != 100 # Check\| At /root/ksft-net-drv/drivers/net/./queues.py, line 32, in get_queues: # Check\| ksft_eq(queues, expected) # Check failed 101 != 100 not ok 1 queues.get_queues ok 2 queues.addremove_queues # Totals: pass:1 fail:1 xfail:0 xpass:0 skip:0 error:0 not ok 1 selftests: drivers/net: queues.py # exit=1 With the fix: # ./ksft-net-drv/run_kselftest.sh -t drivers/net:queues.py timeout set to 45 selftests: drivers/net: queues.py KTAP version 1 1..2 ok 1 queues.get_queues ok 2 queues.addremove_queues # Totals: pass:2 fail:0 xfail:0 xpass:0 skip:0 error:0 Fixes: `6b6171db7f` ("netdev-genl: Add netlink framework functions for queue") Reviewed-by: Joe Damato <jdamato@fastly.com> Link: https://patch.msgid.link/20241213152244.3080955-2-kuba@kernel.org Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-16 17:30:07 -08:00
Kees Cook	239d87327d	fortify: Hide run-time copy size from value range tracking GCC performs value range tracking for variables as a way to provide better diagnostics. One place this is regularly seen is with warnings associated with bounds-checking, e.g. -Wstringop-overflow, -Wstringop-overread, -Warray-bounds, etc. In order to keep the signal-to-noise ratio high, warnings aren't emitted when a value range spans the entire value range representable by a given variable. For example: unsigned int len; char dst[8]; ... memcpy(dst, src, len); If len's value is unknown, it has the full "unsigned int" range of [0, UINT_MAX], and GCC's compile-time bounds checks against memcpy() will be ignored. However, when a code path has been able to narrow the range: if (len > 16) return; memcpy(dst, src, len); Then the range will be updated for the execution path. Above, len is now [0, 16] when reading memcpy(), so depending on other optimizations, we might see a -Wstringop-overflow warning like: error: '__builtin_memcpy' writing between 9 and 16 bytes into region of size 8 [-Werror=stringop-overflow] When building with CONFIG_FORTIFY_SOURCE, the fortified run-time bounds checking can appear to narrow value ranges of lengths for memcpy(), depending on how the compiler constructs the execution paths during optimization passes, due to the checks against the field sizes. For example: if (p_size_field != SIZE_MAX && p_size != p_size_field && p_size_field < size) As intentionally designed, these checks only affect the kernel warnings emitted at run-time and do not block the potentially overflowing memcpy(), so GCC thinks it needs to produce a warning about the resulting value range that might be reaching the memcpy(). We have seen this manifest a few times now, with the most recent being with cpumasks: In function ‘bitmap_copy’, inlined from ‘cpumask_copy’ at ./include/linux/cpumask.h:839:2, inlined from ‘__padata_set_cpumasks’ at kernel/padata.c:730:2: ./include/linux/fortify-string.h:114:33: error: ‘__builtin_memcpy’ reading between 257 and 536870904 bytes from a region of size 256 [-Werror=stringop-overread] 114 \| #define __underlying_memcpy __builtin_memcpy \| ^ ./include/linux/fortify-string.h:633:9: note: in expansion of macro ‘__underlying_memcpy’ 633 \| __underlying_##op(p, q, __fortify_size); \ \| ^~~~~~~~~~~~~ ./include/linux/fortify-string.h:678:26: note: in expansion of macro ‘__fortify_memcpy_chk’ 678 \| #define memcpy(p, q, s) __fortify_memcpy_chk(p, q, s, \ \| ^~~~~~~~~~~~~~~~~~~~ ./include/linux/bitmap.h:259:17: note: in expansion of macro ‘memcpy’ 259 \| memcpy(dst, src, len); \| ^~~~~~ kernel/padata.c: In function ‘__padata_set_cpumasks’: kernel/padata.c:713:48: note: source object ‘pcpumask’ of size [0, 256] 713 \| cpumask_var_t pcpumask, \| ~~~~~~~~~~~~~~^~~~~~~~ This warning is _not_ emitted when CONFIG_FORTIFY_SOURCE is disabled, and with the recent -fdiagnostics-details we can confirm the origin of the warning is due to FORTIFY's bounds checking: ../include/linux/bitmap.h:259:17: note: in expansion of macro 'memcpy' 259 \| memcpy(dst, src, len); \| ^~~~~~ '__padata_set_cpumasks': events 1-2 ../include/linux/fortify-string.h:613:36: 612 \| if (p_size_field != SIZE_MAX && \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 613 \| p_size != p_size_field && p_size_field < size) \| ~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~ \| \| \| (1) when the condition is evaluated to false \| (2) when the condition is evaluated to true '__padata_set_cpumasks': event 3 114 \| #define __underlying_memcpy __builtin_memcpy \| ^ \| \| \| (3) out of array bounds here Note that the cpumask warning started appearing since bitmap functions were recently marked __always_inline in commit `ed8cd2b3bd` ("bitmap: Switch from inline to __always_inline"), which allowed GCC to gain visibility into the variables as they passed through the FORTIFY implementation. In order to silence these false positives but keep otherwise deterministic compile-time warnings intact, hide the length variable from GCC with OPTIMIZE_HIDE_VAR() before calling the builtin memcpy. Additionally add a comment about why all the macro args have copies with const storage. Reported-by: "Thomas Weißschuh" <linux@weissschuh.net> Closes: https://lore.kernel.org/all/db7190c8-d17f-4a0d-bc2f-5903c79f36c2@t-8ch.de/ Reported-by: Nilay Shroff <nilay@linux.ibm.com> Closes: https://lore.kernel.org/all/20241112124127.1666300-1-nilay@linux.ibm.com/ Tested-by: Nilay Shroff <nilay@linux.ibm.com> Acked-by: Yury Norov <yury.norov@gmail.com> Acked-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org> Signed-off-by: Kees Cook <kees@kernel.org>	2024-12-16 16:23:07 -08:00
Murad Masimov	dd471e2577	hwmon: (tmp513) Fix interpretation of values of Temperature Result and Limit Registers The values returned by the driver after processing the contents of the Temperature Result and the Temperature Limit Registers do not correspond to the TMP512/TMP513 specifications. A raw register value is converted to a signed integer value by a sign extension in accordance with the algorithm provided in the specification, but due to the off-by-one error in the sign bit index, the result is incorrect. According to the TMP512 and TMP513 datasheets, the Temperature Result (08h to 0Bh) and Limit (11h to 14h) Registers are 13-bit two's complement integer values, shifted left by 3 bits. The value is scaled by 0.0625 degrees Celsius per bit. E.g., if regval = 1 1110 0111 0000 000, the output should be -25 degrees, but the driver will return +487 degrees. Found by Linux Verification Center (linuxtesting.org) with SVACE. Fixes: `59dfa75e5d` ("hwmon: Add driver for Texas Instruments TMP512/513 sensor chips.") Signed-off-by: Murad Masimov <m.masimov@maxima.ru> Link: https://lore.kernel.org/r/20241216173648.526-4-m.masimov@maxima.ru [groeck: fixed description line length] Signed-off-by: Guenter Roeck <linux@roeck-us.net>	2024-12-16 15:58:25 -08:00
Murad Masimov	da1d0e6ba2	hwmon: (tmp513) Fix Current Register value interpretation The value returned by the driver after processing the contents of the Current Register does not correspond to the TMP512/TMP513 specifications. A raw register value is converted to a signed integer value by a sign extension in accordance with the algorithm provided in the specification, but due to the off-by-one error in the sign bit index, the result is incorrect. Moreover, negative values will be reported as large positive due to missing sign extension from u32 to long. According to the TMP512 and TMP513 datasheets, the Current Register (07h) is a 16-bit two's complement integer value. E.g., if regval = 1000 0011 0000 0000, then the value must be (-32000 * lsb), but the driver will return (33536 * lsb). Fix off-by-one bug, and also cast data->curr_lsb_ua (which is of type u32) to long to prevent incorrect cast for negative values. Found by Linux Verification Center (linuxtesting.org) with SVACE. Fixes: `59dfa75e5d` ("hwmon: Add driver for Texas Instruments TMP512/513 sensor chips.") Signed-off-by: Murad Masimov <m.masimov@maxima.ru> Link: https://lore.kernel.org/r/20241216173648.526-3-m.masimov@maxima.ru [groeck: Fixed description line length] Signed-off-by: Guenter Roeck <linux@roeck-us.net>	2024-12-16 15:58:25 -08:00
Murad Masimov	74d7e038fd	hwmon: (tmp513) Fix interpretation of values of Shunt Voltage and Limit Registers The values returned by the driver after processing the contents of the Shunt Voltage Register and the Shunt Limit Registers do not correspond to the TMP512/TMP513 specifications. A raw register value is converted to a signed integer value by a sign extension in accordance with the algorithm provided in the specification, but due to the off-by-one error in the sign bit index, the result is incorrect. Moreover, the PGA shift calculated with the tmp51x_get_pga_shift function is relevant only to the Shunt Voltage Register, but is also applied to the Shunt Limit Registers. According to the TMP512 and TMP513 datasheets, the Shunt Voltage Register (04h) is 13 to 16 bit two's complement integer value, depending on the PGA setting. The Shunt Positive (0Ch) and Negative (0Dh) Limit Registers are 16-bit two's complement integer values. Below are some examples: * Shunt Voltage Register If PGA = 8, and regval = 1000 0011 0000 0000, then the decimal value must be -32000, but the value calculated by the driver will be 33536. * Shunt Limit Register If regval = 1000 0011 0000 0000, then the decimal value must be -32000, but the value calculated by the driver will be 768, if PGA = 1. Fix sign bit index, and also correct misleading comment describing the tmp51x_get_pga_shift function. Found by Linux Verification Center (linuxtesting.org) with SVACE. Fixes: `59dfa75e5d` ("hwmon: Add driver for Texas Instruments TMP512/513 sensor chips.") Signed-off-by: Murad Masimov <m.masimov@maxima.ru> Link: https://lore.kernel.org/r/20241216173648.526-2-m.masimov@maxima.ru [groeck: Fixed description and multi-line alignments] Signed-off-by: Guenter Roeck <linux@roeck-us.net>	2024-12-16 15:58:24 -08:00
Ilya Dryomov	18d44c5d06	ceph: allocate sparse_ext map only for sparse reads If mounted with sparseread option, ceph_direct_read_write() ends up making an unnecessarily allocation for O_DIRECT writes. Fixes: `03bc06c7b0` ("ceph: add new mount option to enable sparse reads") Signed-off-by: Ilya Dryomov <idryomov@gmail.com> Reviewed-by: Alex Markuze <amarkuze@redhat.com>	2024-12-16 23:25:44 +01:00
Ilya Dryomov	66e0c4f914	ceph: fix memory leak in ceph_direct_read_write() The bvecs array which is allocated in iter_get_bvecs_alloc() is leaked and pages remain pinned if ceph_alloc_sparse_ext_map() fails. There is no need to delay the allocation of sparse_ext map until after the bvecs array is set up, so fix this by moving sparse_ext allocation a bit earlier. Also, make a similar adjustment in __ceph_sync_read() for consistency (a leak of the same kind in __ceph_sync_read() has been addressed differently). Cc: stable@vger.kernel.org Fixes: `03bc06c7b0` ("ceph: add new mount option to enable sparse reads") Signed-off-by: Ilya Dryomov <idryomov@gmail.com> Reviewed-by: Alex Markuze <amarkuze@redhat.com>	2024-12-16 23:25:44 +01:00
Alex Markuze	9abee47580	ceph: improve error handling and short/overflow-read logic in __ceph_sync_read() This patch refines the read logic in __ceph_sync_read() to ensure more predictable and efficient behavior in various edge cases. - Return early if the requested read length is zero or if the file size (`i_size`) is zero. - Initialize the index variable (`idx`) where needed and reorder some code to ensure it is always set before use. - Improve error handling by checking for negative return values earlier. - Remove redundant encrypted file checks after failures. Only attempt filesystem-level decryption if the read succeeded. - Simplify leftover calculations to correctly handle cases where the read extends beyond the end of the file or stops short. This can be hit by continuously reading a file while, on another client, we keep truncating and writing new data into it. - This resolves multiple issues caused by integer and consequent buffer overflow (`pages` array being accessed beyond `num_pages`): - https://tracker.ceph.com/issues/67524 - https://tracker.ceph.com/issues/68980 - https://tracker.ceph.com/issues/68981 Cc: stable@vger.kernel.org Fixes: `1065da21e5` ("ceph: stop copying to iter at EOF on sync reads") Reported-by: Luis Henriques (SUSE) <luis.henriques@linux.dev> Signed-off-by: Alex Markuze <amarkuze@redhat.com> Reviewed-by: Viacheslav Dubeyko <Slava.Dubeyko@ibm.com> Signed-off-by: Ilya Dryomov <idryomov@gmail.com>	2024-12-16 23:25:43 +01:00
Ilya Dryomov	12eb22a5a6	ceph: validate snapdirname option length when mounting It becomes a path component, so it shouldn't exceed NAME_MAX characters. This was hardened in commit `c152737be2` ("ceph: Use strscpy() instead of strcpy() in __get_snap_name()"), but no actual check was put in place. Cc: stable@vger.kernel.org Signed-off-by: Ilya Dryomov <idryomov@gmail.com> Reviewed-by: Alex Markuze <amarkuze@redhat.com>	2024-12-16 23:25:43 +01:00
Max Kellermann	550f7ca98e	ceph: give up on paths longer than PATH_MAX If the full path to be built by ceph_mdsc_build_path() happens to be longer than PATH_MAX, then this function will enter an endless (retry) loop, effectively blocking the whole task. Most of the machine becomes unusable, making this a very simple and effective DoS vulnerability. I cannot imagine why this retry was ever implemented, but it seems rather useless and harmful to me. Let's remove it and fail with ENAMETOOLONG instead. Cc: stable@vger.kernel.org Reported-by: Dario Weißer <dario@cure53.de> Signed-off-by: Max Kellermann <max.kellermann@ionos.com> Reviewed-by: Alex Markuze <amarkuze@redhat.com> Signed-off-by: Ilya Dryomov <idryomov@gmail.com>	2024-12-16 23:25:43 +01:00
Max Kellermann	d6fd6f8280	ceph: fix memory leaks in __ceph_sync_read() In two `break` statements, the call to ceph_release_page_vector() was missing, leaking the allocation from ceph_alloc_page_vector(). Instead of adding the missing ceph_release_page_vector() calls, the Ceph maintainers preferred to transfer page ownership to the `ceph_osd_request` by passing `own_pages=true` to osd_req_op_extent_osd_data_pages(). This requires postponing the ceph_osdc_put_request() call until after the block that accesses the `pages`. Cc: stable@vger.kernel.org Fixes: `03bc06c7b0` ("ceph: add new mount option to enable sparse reads") Fixes: `f0fe1e54cf` ("ceph: plumb in decryption during reads") Signed-off-by: Max Kellermann <max.kellermann@ionos.com> Reviewed-by: Ilya Dryomov <idryomov@gmail.com> Signed-off-by: Ilya Dryomov <idryomov@gmail.com>	2024-12-16 23:25:43 +01:00
Steven Rostedt	166438a432	ftrace: Do not find "true_parent" if HAVE_DYNAMIC_FTRACE_WITH_ARGS is not set When function tracing and function graph tracing are both enabled (in different instances) the "parent" of some of the function tracing events is "return_to_handler" which is the trampoline used by function graph tracing. To fix this, ftrace_get_true_parent_ip() was introduced that returns the "true" parent ip instead of the trampoline. To do this, the ftrace_regs_get_stack_pointer() is used, which uses kernel_stack_pointer(). The problem is that microblaze does not implement kerenl_stack_pointer() so when function graph tracing is enabled, the build fails. But microblaze also does not enabled HAVE_DYNAMIC_FTRACE_WITH_ARGS. That option has to be enabled by the architecture to reliably get the values from the fregs parameter passed in. When that config is not set, the architecture can also pass in NULL, which is not tested for in that function and could cause the kernel to crash. Cc: Masami Hiramatsu <mhiramat@kernel.org> Cc: Mathieu Desnoyers <mathieu.desnoyers@efficios.com> Cc: Mark Rutland <mark.rutland@arm.com> Cc: Al Viro <viro@ZenIV.linux.org.uk> Cc: Michal Simek <monstr@monstr.eu> Cc: Jeff Xie <jeff.xie@linux.dev> Link: https://lore.kernel.org/20241216164633.6df18e87@gandalf.local.home Fixes: `60b1f578b5` ("ftrace: Get the true parent ip for function tracer") Reported-by: Al Viro <viro@zeniv.linux.org.uk> Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>	2024-12-16 17:22:26 -05:00
Steven Rostedt	cc252bb592	fgraph: Still initialize idle shadow stacks when starting A bug was discovered where the idle shadow stacks were not initialized for offline CPUs when starting function graph tracer, and when they came online they were not traced due to the missing shadow stack. To fix this, the idle task shadow stack initialization was moved to using the CPU hotplug callbacks. But it removed the initialization when the function graph was enabled. The problem here is that the hotplug callbacks are called when the CPUs come online, but the idle shadow stack initialization only happens if function graph is currently active. This caused the online CPUs to not get their shadow stack initialized. The idle shadow stack initialization still needs to be done when the function graph is registered, as they will not be allocated if function graph is not registered. Cc: stable@vger.kernel.org Cc: Masami Hiramatsu <mhiramat@kernel.org> Cc: Mathieu Desnoyers <mathieu.desnoyers@efficios.com> Link: https://lore.kernel.org/20241211135335.094ba282@batman.local.home Fixes: `2c02f7375e` ("fgraph: Use CPU hotplug mechanism to initialize idle shadow stacks") Reported-by: Linus Walleij <linus.walleij@linaro.org> Tested-by: Linus Walleij <linus.walleij@linaro.org> Closes: https://lore.kernel.org/all/CACRpkdaTBrHwRbbrphVy-=SeDz6MSsXhTKypOtLrTQ+DgGAOcQ@mail.gmail.com/ Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>	2024-12-16 16:03:33 -05:00
Daniel Lezcano	65c8c78cc7	thermal/thresholds: Fix uapi header macros leading to a compilation error The macros giving the direction of the crossing thresholds use the BIT macro which is not exported to the userspace. Consequently when an userspace program includes the header, it fails to compile. Replace the macros by their litteral to allow the compilation of userspace program using this header. Fixes: `445936f9e2` ("thermal: core: Add user thresholds support") Signed-off-by: Daniel Lezcano <daniel.lezcano@linaro.org> Link: https://patch.msgid.link/20241212201311.4143196-1-daniel.lezcano@linaro.org [ rjw: Add Fixes: ] Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>	2024-12-16 21:30:20 +01:00
Linus Torvalds	f44d154d6e	Merge tag 'soc-fixes-6.13' of git://git.kernel.org/pub/scm/linux/kernel/git/soc/soc Pull SoC fixes from Arnd Bergmann: "Three small fixes for the soc tree: - devicetee fix for the Arm Juno reference machine, to allow more interesting PCI configurations - build fix for SCMI firmware on the NXP i.MX platform - fix for a race condition in Arm FF-A firmware" * tag 'soc-fixes-6.13' of git://git.kernel.org/pub/scm/linux/kernel/git/soc/soc: arm64: dts: fvp: Update PCIe bus-range property firmware: arm_ffa: Fix the race around setting ffa_dev->properties firmware: arm_scmi: Fix i.MX build dependency	2024-12-16 10:10:53 -08:00
Linus Torvalds	dc690bc256	Merge tag 'platform-drivers-x86-v6.13-3' of git://git.kernel.org/pub/scm/linux/kernel/git/pdx86/platform-drivers-x86 Pull x86 platform driver fixes from Ilpo Järvinen: - alienware-wmi: - Add support for Alienware m16 R1 AMD - Do not setup legacy LED control with X and G Series - intel/ifs: Clearwater Forest support - intel/vsec: Panther Lake support - p2sb: Do not hide the device if BIOS left it unhidden - touchscreen_dmi: Add SARY Tab 3 tablet information * tag 'platform-drivers-x86-v6.13-3' of git://git.kernel.org/pub/scm/linux/kernel/git/pdx86/platform-drivers-x86: platform/x86/intel/vsec: Add support for Panther Lake platform/x86/intel/ifs: Add Clearwater Forest to CPU support list platform/x86: touchscreen_dmi: Add info for SARY Tab 3 tablet p2sb: Do not scan and remove the P2SB device when it is unhidden p2sb: Move P2SB hide and unhide code to p2sb_scan_and_cache() p2sb: Introduce the global flag p2sb_hidden_by_bios p2sb: Factor out p2sb_read_from_cache() alienware-wmi: Adds support to Alienware m16 R1 AMD alienware-wmi: Fix X Series and G Series quirks	2024-12-16 10:01:57 -08:00
Mark Brown	001a3d5e8b	ASoC: Intel: sof_sdw: Update DMI matches for Lenovo Merge series from Bard Liao <yung-chuan.liao@linux.intel.com>: The DMI match information for these models has changed so the match entries need updates.	2024-12-16 17:10:37 +00:00
Greg Kroah-Hartman	59275b7633	Merge tag 'usb-serial-6.13-rc3' of ssh://gitolite.kernel.org/pub/scm/linux/kernel/git/johan/usb-serial into usb-linus Johan writes: USB-serial device ids for 6.13-rc3 Here are some new modem device ids. All have been in linux-next with no reported issues. * tag 'usb-serial-6.13-rc3' of ssh://gitolite.kernel.org/pub/scm/linux/kernel/git/johan/usb-serial: USB: serial: option: add Telit FE910C04 rmnet compositions USB: serial: option: add MediaTek T7XX compositions USB: serial: option: add Netprisma LCUK54 modules for WWAN Ready USB: serial: option: add MeiG Smart SLM770A USB: serial: option: add TCL IK512 MBIM & ECM	2024-12-16 16:31:47 +01:00
Chen-Yu Tsai	6f4a0fd03c	ASoC: dt-bindings: realtek,rt5645: Fix CPVDD voltage comment Both the ALC5645 and ALC5650 datasheets specify a recommended voltage of 1.8V for CPVDD, not 3.5V. Fix the comment. Cc: Matthias Brugger <matthias.bgg@gmail.com> Fixes: `26aa19174f` ("ASoC: dt-bindings: rt5645: add suppliers") Fixes: `83d43ab0a1` ("ASoC: dt-bindings: realtek,rt5645: Convert to dtschema") Signed-off-by: Chen-Yu Tsai <wenst@chromium.org> Acked-by: Krzysztof Kozlowski <krzysztof.kozlowski@linaro.org> Link: https://patch.msgid.link/20241211035403.4157760-1-wenst@chromium.org Signed-off-by: Mark Brown <broonie@kernel.org>	2024-12-16 15:12:40 +00:00
Richard Fitzgerald	ba7d47a54b	ASoC: Intel: sof_sdw: Fix DMI match for Lenovo 21QA and 21QB Update the DMI match for a Lenovo laptop to the new DMI identifier. This laptop ships with a different DMI identifier to what was expected, and now has two identifiers. Signed-off-by: Richard Fitzgerald <rf@opensource.cirrus.com> Fixes: `ea657f6b24` ("ASoC: Intel: sof_sdw: Add quirk for cs42l43 system using host DMICs") Signed-off-by: Bard Liao <yung-chuan.liao@linux.intel.com> Link: https://patch.msgid.link/20241216140821.153670-3-yung-chuan.liao@linux.intel.com Signed-off-by: Mark Brown <broonie@kernel.org>	2024-12-16 14:28:58 +00:00
Richard Fitzgerald	7c449ef0fd	ASoC: Intel: sof_sdw: Fix DMI match for Lenovo 21Q6 and 21Q7 Update the DMI match for a Lenovo laptop to the new DMI identifier. This laptop ships with a different DMI identifier to what was expected, and now has two identifiers. Signed-off-by: Richard Fitzgerald <rf@opensource.cirrus.com> Fixes: `83c062ae81` ("ASoC: Intel: sof_sdw: Add quirks for some new Lenovo laptops") Signed-off-by: Bard Liao <yung-chuan.liao@linux.intel.com> Link: https://patch.msgid.link/20241216140821.153670-2-yung-chuan.liao@linux.intel.com Signed-off-by: Mark Brown <broonie@kernel.org>	2024-12-16 14:28:57 +00:00
Gao Xiang	6422cde1b0	erofs: use buffered I/O for file-backed mounts by default For many use cases (e.g. container images are just fetched from remote), performance will be impacted if underlay page cache is up-to-date but direct i/o flushes dirty pages first. Instead, let's use buffered I/O by default to keep in sync with loop devices and add a (re)mount option to explicitly give a try to use direct I/O if supported by the underlying files. The container startup time is improved as below: [workload] docker.io/library/workpress:latest unpack 1st run non-1st runs EROFS snapshotter buffered I/O file 4.586404265s 0.308s 0.198s EROFS snapshotter direct I/O file 4.581742849s 2.238s 0.222s EROFS snapshotter loop 4.596023152s 0.346s 0.201s Overlayfs snapshotter 5.382851037s 0.206s 0.214s Fixes: `fb17675026` ("erofs: add file-backed mount support") Cc: Derek McGowan <derek@mcg.dev> Reviewed-by: Chao Yu <chao@kernel.org> Signed-off-by: Gao Xiang <hsiangkao@linux.alibaba.com> Link: https://lore.kernel.org/r/20241212134336.2059899-1-hsiangkao@linux.alibaba.com	2024-12-16 21:02:07 +08:00
Gao Xiang	f8d920a402	erofs: reference `struct erofs_device_info` for erofs_map_dev Record `m_sb` and `m_dif` to replace `m_fscache`, `m_daxdev`, `m_fp` and `m_dax_part_off` in order to simplify the codebase. Note that `m_bdev` is still left since it can be assigned from `sb->s_bdev` directly. Reviewed-by: Chao Yu <chao@kernel.org> Signed-off-by: Gao Xiang <hsiangkao@linux.alibaba.com> Link: https://lore.kernel.org/r/20241212235401.2857246-1-hsiangkao@linux.alibaba.com	2024-12-16 21:02:06 +08:00
Gao Xiang	7b00af2c54	erofs: use `struct erofs_device_info` for the primary device Instead of just listing each one directly in `struct erofs_sb_info` except that we still use `sb->s_bdev` for the primary block device. Reviewed-by: Chao Yu <chao@kernel.org> Signed-off-by: Gao Xiang <hsiangkao@linux.alibaba.com> Link: https://lore.kernel.org/r/20241216125310.930933-2-hsiangkao@linux.alibaba.com	2024-12-16 21:01:59 +08:00
Venkata Prasad Potturu	88438444fd	ASoC: amd: ps: Fix for enabling DMIC on acp63 platform via _DSD entry Add condition check to register ACP PDM sound card by reading _WOV acpi entry. Fixes: `0386d765f2` ("ASoC: amd: ps: refactor acp device configuration read logic") Signed-off-by: Venkata Prasad Potturu <venkataprasad.potturu@amd.com> Link: https://patch.msgid.link/20241213061147.1060451-1-venkataprasad.potturu@amd.com Signed-off-by: Mark Brown <broonie@kernel.org>	2024-12-16 12:31:03 +00:00
Thomas Gleixner	a60b990798	PCI/MSI: Handle lack of irqdomain gracefully Alexandre observed a warning emitted from pci_msi_setup_msi_irqs() on a RISCV platform which does not provide PCI/MSI support: WARNING: CPU: 1 PID: 1 at drivers/pci/msi/msi.h:121 pci_msi_setup_msi_irqs+0x2c/0x32 __pci_enable_msix_range+0x30c/0x596 pci_msi_setup_msi_irqs+0x2c/0x32 pci_alloc_irq_vectors_affinity+0xb8/0xe2 RISCV uses hierarchical interrupt domains and correctly does not implement the legacy fallback. The warning triggers from the legacy fallback stub. That warning is bogus as the PCI/MSI layer knows whether a PCI/MSI parent domain is associated with the device or not. There is a check for MSI-X, which has a legacy assumption. But that legacy fallback assumption is only valid when legacy support is enabled, but otherwise the check should simply return -ENOTSUPP. Loongarch tripped over the same problem and blindly enabled legacy support without implementing the legacy fallbacks. There are weak implementations which return an error, so the problem was papered over. Correct pci_msi_domain_supports() to evaluate the legacy mode and add the missing supported check into the MSI enable path to complete it. Fixes: `d2a463b297` ("PCI/MSI: Reject multi-MSI early") Reported-by: Alexandre Ghiti <alexghiti@rivosinc.com> Signed-off-by: Thomas Gleixner <tglx@linutronix.de> Tested-by: Alexandre Ghiti <alexghiti@rivosinc.com> Cc: stable@vger.kernel.org Link: https://lore.kernel.org/all/87ed2a8ow5.ffs@tglx	2024-12-16 10:59:47 +01:00
Mika Westerberg	24740385cb	thunderbolt: Improve redrive mode handling When USB-C monitor is connected directly to Intel Barlow Ridge host, it goes into "redrive" mode that basically routes the DisplayPort signals directly from the GPU to the USB-C monitor without any tunneling needed. However, the host router must be powered on for this to work. Aaron reported that there are a couple of cases where this will not work with the current code: - Booting with USB-C monitor plugged in. - Plugging in USB-C monitor when the host router is in sleep state (runtime suspended). - Plugging in USB-C device while the system is in system sleep state. In all these cases once the host router is runtime suspended the picture on the connected USB-C display disappears too. This is certainly not what the user expected. For this reason improve the redrive mode handling to keep the host router from runtime suspending when detect that any of the above cases is happening. Fixes: `a75e0684ef` ("thunderbolt: Keep the domain powered when USB4 port is in redrive mode") Reported-by: Aaron Rainbolt <arainbolt@kfocus.org> Closes: https://lore.kernel.org/linux-usb/20241009220118.70bfedd0@kf-ir16/ Cc: stable@vger.kernel.org Signed-off-by: Mika Westerberg <mika.westerberg@linux.intel.com>	2024-12-16 09:59:38 +02:00
Namjae Jeon	fe4ed2f09b	ksmbd: conn lock to serialize smb2 negotiate If client send parallel smb2 negotiate request on same connection, ksmbd_conn can be racy. smb2 negotiate handling that are not performance-related can be serialized with conn lock. Signed-off-by: Namjae Jeon <linkinjeon@kernel.org> Signed-off-by: Steve French <stfrench@microsoft.com>	2024-12-15 22:20:03 -06:00
Marios Makassikis	43fb7bce88	ksmbd: fix broken transfers when exceeding max simultaneous operations Since commit `0a77d947f5` ("ksmbd: check outstanding simultaneous SMB operations"), ksmbd enforces a maximum number of simultaneous operations for a connection. The problem is that reaching the limit causes ksmbd to close the socket, and the client has no indication that it should have slowed down. This behaviour can be reproduced by setting "smb2 max credits = 128" (or lower), and transferring a large file (25GB). smbclient fails as below: $ smbclient //192.168.1.254/testshare -U user%pass smb: \> put file.bin cli_push returned NT_STATUS_USER_SESSION_DELETED putting file file.bin as \file.bin smb2cli_req_compound_submit: Insufficient credits. 0 available, 1 needed NT_STATUS_INTERNAL_ERROR closing remote file \file.bin smb: \> smb2cli_req_compound_submit: Insufficient credits. 0 available, 1 needed Windows clients fail with 0x8007003b (with smaller files even). Fix this by delaying reading from the socket until there's room to allocate a request. This effectively applies backpressure on the client, so the transfer completes, albeit at a slower rate. Fixes: `0a77d947f5` ("ksmbd: check outstanding simultaneous SMB operations") Signed-off-by: Marios Makassikis <mmakassikis@freebox.fr> Signed-off-by: Namjae Jeon <linkinjeon@kernel.org> Signed-off-by: Steve French <stfrench@microsoft.com>	2024-12-15 22:20:03 -06:00
Marios Makassikis	83c47d9e0c	ksmbd: count all requests in req_running counter This changes the semantics of req_running to count all in-flight requests on a given connection, rather than the number of elements in the conn->request list. The latter is used only in smb2_cancel, and the counter is not used Signed-off-by: Marios Makassikis <mmakassikis@freebox.fr> Acked-by: Namjae Jeon <linkinjeon@kernel.org> Signed-off-by: Steve French <stfrench@microsoft.com>	2024-12-15 22:20:03 -06:00
Thiébaud Weksteen	900f83cf37	selinux: ignore unknown extended permissions When evaluating extended permissions, ignore unknown permissions instead of calling BUG(). This commit ensures that future permissions can be added without interfering with older kernels. Cc: stable@vger.kernel.org Fixes: `fa1aa143ac` ("selinux: extended permissions for ioctls") Signed-off-by: Thiébaud Weksteen <tweek@google.com> Signed-off-by: Paul Moore <paul@paul-moore.com>	2024-12-15 21:59:03 -05:00
Nikita Yushchenko	922b4b955a	net: renesas: rswitch: rework ts tags management The existing linked list based implementation of how ts tags are assigned and managed is unsafe against concurrency and corner cases: - element addition in tx processing can race against element removal in ts queue completion, - element removal in ts queue completion can race against element removal in device close, - if a large number of frames gets added to tx queue without ts queue completions in between, elements with duplicate tag values can get added. Use a different implementation, based on per-port used tags bitmaps and saved skb arrays. Safety for addition in tx processing vs removal in ts completion is provided by: tag = find_first_zero_bit(...); smp_mb(); <write rdev->ts_skb[tag]> set_bit(...); vs <read rdev->ts_skb[tag]> smp_mb(); clear_bit(...); Safety for removal in ts completion vs removal in device close is provided by using atomic read-and-clear for rdev->ts_skb[tag]: ts_skb = xchg(&rdev->ts_skb[tag], NULL); if (ts_skb) <handle it> Fixes: `33f5d733b5` ("net: renesas: rswitch: Improve TX timestamp accuracy") Signed-off-by: Nikita Yushchenko <nikita.yoush@cogentembedded.com> Link: https://patch.msgid.link/20241212062558.436455-1-nikita.yoush@cogentembedded.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-15 14:39:10 -08:00
Vasily Gorbik	282da38b46	s390/mm: Consider KMSAN modules metadata for paging levels The calculation determining whether to use three- or four-level paging didn't account for KMSAN modules metadata. Include this metadata in the virtual memory size calculation to ensure correct paging mode selection and avoiding potentially unnecessary physical memory size limitations. Fixes: `65ca73f9fb` ("s390/mm: define KMSAN metadata for vmalloc and modules") Acked-by: Heiko Carstens <hca@linux.ibm.com> Reviewed-by: Alexander Gordeev <agordeev@linux.ibm.com> Reviewed-by: Ilya Leoshkevich <iii@linux.ibm.com> Signed-off-by: Vasily Gorbik <gor@linux.ibm.com> Signed-off-by: Alexander Gordeev <agordeev@linux.ibm.com>	2024-12-15 23:35:09 +01:00
Jakub Kicinski	cb85f2b897	Merge branch 'ionic-minor-code-fixes' Shannon Nelson says: ==================== ionic: minor code fixes These are a couple of code fixes for the ionic driver. ==================== Link: https://patch.msgid.link/20241212213157.12212-1-shannon.nelson@amd.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-15 14:33:33 -08:00
Shannon Nelson	b096d62ba1	ionic: use ee->offset when returning sprom data Some calls into ionic_get_module_eeprom() don't use a single full buffer size, but instead multiple calls with an offset. Teach our driver to use the offset correctly so we can respond appropriately to the caller. Fixes: `4d03e00a21` ("ionic: Add initial ethtool support") Signed-off-by: Shannon Nelson <shannon.nelson@amd.com> Reviewed-by: Jacob Keller <jacob.e.keller@intel.com> Link: https://patch.msgid.link/20241212213157.12212-4-shannon.nelson@amd.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-15 14:33:31 -08:00
Shannon Nelson	746e6ae2e2	ionic: no double destroy workqueue There are some FW error handling paths that can cause us to try to destroy the workqueue more than once, so let's be sure we're checking for that. The case where this popped up was in an AER event where the handlers got called in such a way that ionic_reset_prepare() and thus ionic_dev_teardown() got called twice in a row. The second time through the workqueue was already destroyed, and destroy_workqueue() choked on the bad wq pointer. We didn't hit this in AER handler testing before because at that time we weren't using a private workqueue. Later we replaced the use of the system workqueue with our own private workqueue but hadn't rerun the AER handler testing since then. Fixes: `9e25450da7` ("ionic: add private workqueue per-device") Signed-off-by: Shannon Nelson <shannon.nelson@amd.com> Reviewed-by: Jacob Keller <jacob.e.keller@intel.com> Link: https://patch.msgid.link/20241212213157.12212-3-shannon.nelson@amd.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-15 14:33:31 -08:00
Brett Creeley	9590d32e09	ionic: Fix netdev notifier unregister on failure If register_netdev() fails, then the driver leaks the netdev notifier. Fix this by calling ionic_lif_unregister() on register_netdev() failure. This will also call ionic_lif_unregister_phc() if it has already been registered. Fixes: `30b87ab4c0` ("ionic: remove lif list concept") Signed-off-by: Brett Creeley <brett.creeley@amd.com> Signed-off-by: Shannon Nelson <shannon.nelson@amd.com> Reviewed-by: Jacob Keller <jacob.e.keller@intel.com> Link: https://patch.msgid.link/20241212213157.12212-2-shannon.nelson@amd.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-15 14:33:31 -08:00
Donald Hunter	663ad7481f	tools/net/ynl: fix sub-message key lookup for nested attributes Use the correct attribute space for sub-message key lookup in nested attributes when adding attributes. This fixes rt_link where the "kind" key and "data" sub-message are nested attributes in "linkinfo". For example: ./tools/net/ynl/cli.py \ --create \ --spec Documentation/netlink/specs/rt_link.yaml \ --do newlink \ --json '{"link": 99, "linkinfo": { "kind": "vlan", "data": {"id": 4 } } }' Signed-off-by: Donald Hunter <donald.hunter@gmail.com> Fixes: `ab463c4342` ("tools/net/ynl: Add support for encoding sub-messages") Link: https://patch.msgid.link/20241213130711.40267-1-donald.hunter@gmail.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-15 13:30:43 -08:00
Eric Dumazet	ee76746387	netdevsim: prevent bad user input in nsim_dev_health_break_write() If either a zero count or a large one is provided, kernel can crash. Fixes: `82c93a87bf` ("netdevsim: implement couple of testing devlink health reporters") Reported-by: syzbot+ea40e4294e58b0292f74@syzkaller.appspotmail.com Closes: https://lore.kernel.org/netdev/675c6862.050a0220.37aaf.00b1.GAE@google.com/T/#u Signed-off-by: Eric Dumazet <edumazet@google.com> Cc: Jiri Pirko <jiri@nvidia.com> Reviewed-by: Joe Damato <jdamato@fastly.com> Link: https://patch.msgid.link/20241213172518.2415666-1-edumazet@google.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-15 13:26:47 -08:00
Vladimir Oltean	2d5df3a680	net: mscc: ocelot: fix incorrect IFH SRC_PORT field in ocelot_ifh_set_basic() Packets injected by the CPU should have a SRC_PORT field equal to the CPU port module index in the Analyzer block (ocelot->num_phys_ports). The blamed commit copied the ocelot_ifh_set_basic() call incorrectly from ocelot_xmit_common() in net/dsa/tag_ocelot.c. Instead of calling with "x", it calls with BIT_ULL(x), but the field is not a port mask, but rather a single port index. [ side note: this is the technical debt of code duplication :( ] The error used to be silent and doesn't appear to have other user-visible manifestations, but with new changes in the packing library, it now fails loudly as follows: ------------[ cut here ]------------ Cannot store 0x40 inside bits 46-43 - will truncate sja1105 spi2.0: xmit timed out WARNING: CPU: 1 PID: 102 at lib/packing.c:98 __pack+0x90/0x198 sja1105 spi2.0: timed out polling for tstamp CPU: 1 UID: 0 PID: 102 Comm: felix_xmit Tainted: G W N 6.13.0-rc1-00372-gf706b85d972d-dirty #2605 Call trace: __pack+0x90/0x198 (P) __pack+0x90/0x198 (L) packing+0x78/0x98 ocelot_ifh_set_basic+0x260/0x368 ocelot_port_inject_frame+0xa8/0x250 felix_port_deferred_xmit+0x14c/0x258 kthread_worker_fn+0x134/0x350 kthread+0x114/0x138 The code path pertains to the ocelot switchdev driver and to the felix secondary DSA tag protocol, ocelot-8021q. Here seen with ocelot-8021q. The messenger (packing) is not really to blame, so fix the original commit instead. Fixes: `e1b9e80236` ("net: mscc: ocelot: fix QoS class for injected packets with "ocelot-8021q"") Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com> Reviewed-by: Simon Horman <horms@kernel.org> Link: https://patch.msgid.link/20241212165546.879567-1-vladimir.oltean@nxp.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-15 13:12:58 -08:00
David S. Miller	c296c0bf45	Merge branch 'smc-fixes' Guangguan Wang says: ==================== net: several fixes for smc v1 -> v2: rewrite patch #2 suggested by Paolo. ==================== Signed-off-by: David S. Miller <davem@davemloft.net>	2024-12-15 12:35:00 +00:00
Guangguan Wang	c5b8ee5022	net/smc: check return value of sock_recvmsg when draining clc data When receiving clc msg, the field length in smc_clc_msg_hdr indicates the length of msg should be received from network and the value should not be fully trusted as it is from the network. Once the value of length exceeds the value of buflen in function smc_clc_wait_msg it may run into deadloop when trying to drain the remaining data exceeding buflen. This patch checks the return value of sock_recvmsg when draining data in case of deadloop in draining. Fixes: `fb4f79264c` ("net/smc: tolerate future SMCD versions") Signed-off-by: Guangguan Wang <guangguan.wang@linux.alibaba.com> Reviewed-by: Wen Gu <guwen@linux.alibaba.com> Reviewed-by: D. Wythe <alibuda@linux.alibaba.com> Signed-off-by: David S. Miller <davem@davemloft.net>	2024-12-15 12:34:59 +00:00
Guangguan Wang	9ab332deb6	net/smc: check smcd_v2_ext_offset when receiving proposal msg When receiving proposal msg in server, the field smcd_v2_ext_offset in proposal msg is from the remote client and can not be fully trusted. Once the value of smcd_v2_ext_offset exceed the max value, there has the chance to access wrong address, and crash may happen. This patch checks the value of smcd_v2_ext_offset before using it. Fixes: `5c21c4ccaf` ("net/smc: determine accepted ISM devices") Signed-off-by: Guangguan Wang <guangguan.wang@linux.alibaba.com> Reviewed-by: Wen Gu <guwen@linux.alibaba.com> Reviewed-by: D. Wythe <alibuda@linux.alibaba.com> Signed-off-by: David S. Miller <davem@davemloft.net>	2024-12-15 12:34:59 +00:00
Guangguan Wang	7863c9f3d2	net/smc: check v2_ext_offset/eid_cnt/ism_gid_cnt when receiving proposal msg When receiving proposal msg in server, the fields v2_ext_offset/ eid_cnt/ism_gid_cnt in proposal msg are from the remote client and can not be fully trusted. Especially the field v2_ext_offset, once exceed the max value, there has the chance to access wrong address, and crash may happen. This patch checks the fields v2_ext_offset/eid_cnt/ism_gid_cnt before using them. Fixes: `8c3dca341a` ("net/smc: build and send V2 CLC proposal") Signed-off-by: Guangguan Wang <guangguan.wang@linux.alibaba.com> Reviewed-by: Wen Gu <guwen@linux.alibaba.com> Reviewed-by: D. Wythe <alibuda@linux.alibaba.com> Signed-off-by: David S. Miller <davem@davemloft.net>	2024-12-15 12:34:59 +00:00
Guangguan Wang	a29e220d3c	net/smc: check iparea_offset and ipv6_prefixes_cnt when receiving proposal msg When receiving proposal msg in server, the field iparea_offset and the field ipv6_prefixes_cnt in proposal msg are from the remote client and can not be fully trusted. Especially the field iparea_offset, once exceed the max value, there has the chance to access wrong address, and crash may happen. This patch checks iparea_offset and ipv6_prefixes_cnt before using them. Fixes: `e7b7a64a84` ("smc: support variable CLC proposal messages") Signed-off-by: Guangguan Wang <guangguan.wang@linux.alibaba.com> Reviewed-by: Wen Gu <guwen@linux.alibaba.com> Reviewed-by: D. Wythe <alibuda@linux.alibaba.com> Signed-off-by: David S. Miller <davem@davemloft.net>	2024-12-15 12:34:59 +00:00
Guangguan Wang	679e9ddcf9	net/smc: check sndbuf_space again after NOSPACE flag is set in smc_poll When application sending data more than sndbuf_space, there have chances application will sleep in epoll_wait, and will never be wakeup again. This is caused by a race between smc_poll and smc_cdc_tx_handler. application tasklet smc_tx_sendmsg(len > sndbuf_space) \| epoll_wait for EPOLL_OUT,timeout=0 \| smc_poll \| if (!smc->conn.sndbuf_space) \| \| smc_cdc_tx_handler \| atomic_add sndbuf_space \| smc_tx_sndbuf_nonfull \| if (!test_bit SOCK_NOSPACE) \| do not sk_write_space; set_bit SOCK_NOSPACE; \| return mask=0; \| Application will sleep in epoll_wait as smc_poll returns 0. And smc_cdc_tx_handler will not call sk_write_space because the SOCK_NOSPACE has not be set. If there is no inflight cdc msg, sk_write_space will not be called any more, and application will sleep in epoll_wait forever. So check sndbuf_space again after NOSPACE flag is set to break the race. Fixes: `8dce2786a2` ("net/smc: smc_poll improvements") Signed-off-by: Guangguan Wang <guangguan.wang@linux.alibaba.com> Suggested-by: Paolo Abeni <pabeni@redhat.com> Signed-off-by: David S. Miller <davem@davemloft.net>	2024-12-15 12:34:59 +00:00
Guangguan Wang	2b33eb8f1b	net/smc: protect link down work from execute after lgr freed link down work may be scheduled before lgr freed but execute after lgr freed, which may result in crash. So it is need to hold a reference before shedule link down work, and put the reference after work executed or canceled. The relevant crash call stack as follows: list_del corruption. prev->next should be ffffb638c9c0fe20, but was 0000000000000000 ------------[ cut here ]------------ kernel BUG at lib/list_debug.c:51! invalid opcode: 0000 [#1] SMP NOPTI CPU: 6 PID: 978112 Comm: kworker/6:119 Kdump: loaded Tainted: G #1 Hardware name: Alibaba Cloud Alibaba Cloud ECS, BIOS 2221b89 04/01/2014 Workqueue: events smc_link_down_work [smc] RIP: 0010:__list_del_entry_valid.cold+0x31/0x47 RSP: 0018:ffffb638c9c0fdd8 EFLAGS: 00010086 RAX: 0000000000000054 RBX: ffff942fb75e5128 RCX: 0000000000000000 RDX: ffff943520930aa0 RSI: ffff94352091fc80 RDI: ffff94352091fc80 RBP: 0000000000000000 R08: 0000000000000000 R09: ffffb638c9c0fc38 R10: ffffb638c9c0fc30 R11: ffffffffa015eb28 R12: 0000000000000002 R13: ffffb638c9c0fe20 R14: 0000000000000001 R15: ffff942f9cd051c0 FS: 0000000000000000(0000) GS:ffff943520900000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 00007f4f25214000 CR3: 000000025fbae004 CR4: 00000000007706e0 DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 PKRU: 55555554 Call Trace: rwsem_down_write_slowpath+0x17e/0x470 smc_link_down_work+0x3c/0x60 [smc] process_one_work+0x1ac/0x350 worker_thread+0x49/0x2f0 ? rescuer_thread+0x360/0x360 kthread+0x118/0x140 ? __kthread_bind_mask+0x60/0x60 ret_from_fork+0x1f/0x30 Fixes: `541afa10c1` ("net/smc: add smcr_port_err() and smcr_link_down() processing") Signed-off-by: Guangguan Wang <guangguan.wang@linux.alibaba.com> Reviewed-by: Tony Lu <tonylu@linux.alibaba.com> Signed-off-by: David S. Miller <davem@davemloft.net>	2024-12-15 12:34:59 +00:00
Eric Dumazet	429fde2d81	net: tun: fix tun_napi_alloc_frags() syzbot reported the following crash [1] Issue came with the blamed commit. Instead of going through all the iov components, we keep using the first one and end up with a malformed skb. [1] kernel BUG at net/core/skbuff.c:2849 ! Oops: invalid opcode: 0000 [#1] PREEMPT SMP KASAN PTI CPU: 0 UID: 0 PID: 6230 Comm: syz-executor132 Not tainted 6.13.0-rc1-syzkaller-00407-g96b6fcc0ee41 #0 Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 11/25/2024 RIP: 0010:__pskb_pull_tail+0x1568/0x1570 net/core/skbuff.c:2848 Code: 38 c1 0f 8c 32 f1 ff ff 4c 89 f7 e8 92 96 74 f8 e9 25 f1 ff ff e8 e8 ae 09 f8 48 8b 5c 24 08 e9 eb fb ff ff e8 d9 ae 09 f8 90 <0f> 0b 66 0f 1f 44 00 00 90 90 90 90 90 90 90 90 90 90 90 90 90 90 RSP: 0018:ffffc90004cbef30 EFLAGS: 00010293 RAX: ffffffff8995c347 RBX: 00000000fffffff2 RCX: ffff88802cf45a00 RDX: 0000000000000000 RSI: 00000000fffffff2 RDI: 0000000000000000 RBP: ffff88807df0c06a R08: ffffffff8995b084 R09: 1ffff1100fbe185c R10: dffffc0000000000 R11: ffffed100fbe185d R12: ffff888076e85d50 R13: ffff888076e85c80 R14: ffff888076e85cf4 R15: ffff888076e85c80 FS: 00007f0dca6ea6c0(0000) GS:ffff8880b8600000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 00007f0dca6ead58 CR3: 00000000119da000 CR4: 00000000003526f0 DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 Call Trace: <TASK> skb_cow_data+0x2da/0xcb0 net/core/skbuff.c:5284 tipc_aead_decrypt net/tipc/crypto.c:894 [inline] tipc_crypto_rcv+0x402/0x24e0 net/tipc/crypto.c:1844 tipc_rcv+0x57e/0x12a0 net/tipc/node.c:2109 tipc_l2_rcv_msg+0x2bd/0x450 net/tipc/bearer.c:668 __netif_receive_skb_list_ptype net/core/dev.c:5720 [inline] __netif_receive_skb_list_core+0x8b7/0x980 net/core/dev.c:5762 __netif_receive_skb_list net/core/dev.c:5814 [inline] netif_receive_skb_list_internal+0xa51/0xe30 net/core/dev.c:5905 gro_normal_list include/net/gro.h:515 [inline] napi_complete_done+0x2b5/0x870 net/core/dev.c:6256 napi_complete include/linux/netdevice.h:567 [inline] tun_get_user+0x2ea0/0x4890 drivers/net/tun.c:1982 tun_chr_write_iter+0x10d/0x1f0 drivers/net/tun.c:2057 do_iter_readv_writev+0x600/0x880 vfs_writev+0x376/0xba0 fs/read_write.c:1050 do_writev+0x1b6/0x360 fs/read_write.c:1096 do_syscall_x64 arch/x86/entry/common.c:52 [inline] do_syscall_64+0xf3/0x230 arch/x86/entry/common.c:83 entry_SYSCALL_64_after_hwframe+0x77/0x7f Fixes: `de4f5fed3f` ("iov_iter: add iter_iovec() helper") Reported-by: syzbot+4f66250f6663c0c1d67e@syzkaller.appspotmail.com Closes: https://lore.kernel.org/netdev/675b61aa.050a0220.599f4.00bb.GAE@google.com/T/#u Cc: stable@vger.kernel.org Signed-off-by: Eric Dumazet <edumazet@google.com> Reviewed-by: Joe Damato <jdamato@fastly.com> Reviewed-by: Jens Axboe <axboe@kernel.dk> Acked-by: Willem de Bruijn <willemb@google.com> Acked-by: Michael S. Tsirkin <mst@redhat.com> Link: https://patch.msgid.link/20241212222247.724674-1-edumazet@google.com Signed-off-by: Jakub Kicinski <kuba@kernel.org>	2024-12-13 19:33:45 -08:00
Ville Syrjälä	9398332f23	drm/modes: Avoid divide by zero harder in drm_mode_vrefresh() drm_mode_vrefresh() is trying to avoid divide by zero by checking whether htotal or vtotal are zero. But we may still end up with a div-by-zero of vtotalhtotal... Cc: stable@vger.kernel.org Reported-by: syzbot+622bba18029bcde672e1@syzkaller.appspotmail.com Closes: https://syzkaller.appspot.com/bug?extid=622bba18029bcde672e1 Signed-off-by: Ville Syrjälä <ville.syrjala@linux.intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/20241129042629.18280-2-ville.syrjala@linux.intel.com Reviewed-by: Jani Nikula <jani.nikula@intel.com>	2024-12-14 00:05:32 +02:00
Krzysztof Karas	080b2e7b5e	drm/display: use ERR_PTR on DP tunnel manager creation fail Instead of returning a generic NULL on error from drm_dp_tunnel_mgr_create(), use error pointers with informative codes to align the function with stub that is executed when CONFIG_DRM_DISPLAY_DP_TUNNEL is unset. This will also trigger IS_ERR() in current caller (intel_dp_tunnerl_mgr_init()) instead of bypassing it via NULL pointer. v2: use error codes inside drm_dp_tunnel_mgr_create() instead of handling on caller's side (Michal, Imre) v3: fixup commit message and add "CC"/"Fixes" lines (Andi), mention aligning function code with stub Fixes: `91888b5b1a` ("drm/i915/dp: Add support for DP tunnel BW allocation") Cc: Imre Deak <imre.deak@intel.com> Cc: <stable@vger.kernel.org> # v6.9+ Signed-off-by: Krzysztof Karas <krzysztof.karas@intel.com> Reviewed-by: Andi Shyti <andi.shyti@linux.intel.com> Signed-off-by: Imre Deak <imre.deak@intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/7q4fpnmmztmchczjewgm6igy55qt6jsm7tfd4fl4ucfq6yg2oy@q4lxtsu6445c	2024-12-13 18:57:34 +02:00
Arnd Bergmann	8b55f88189	media: mediatek: vcodec: mark vdec_vp9_slice_map_counts_eob_coef noinline With KASAN enabled, clang fails to optimize the inline version of vdec_vp9_slice_map_counts_eob_coef() properly, leading to kilobytes of temporary values spilled to the stack: drivers/media/platform/mediatek/vcodec/decoder/vdec/vdec_vp9_req_lat_if.c:1526:12: error: stack frame size (2160) exceeds limit (2048) in 'vdec_vp9_slice_update_prob' [-Werror,-Wframe-larger-than] This seems to affect all versions of clang including the latest (clang-20), but the degree of stack overhead is different per release. Marking the function as noinline_for_stack is harmless here and avoids the problem completely. Signed-off-by: Arnd Bergmann <arnd@arndb.de> Reviewed-by: Nathan Chancellor <nathan@kernel.org> Signed-off-by: Sebastian Fricke <sebastian.fricke@collabora.com> Signed-off-by: Mauro Carvalho Chehab <mchehab+huawei@kernel.org>	2024-12-13 17:51:35 +01:00
Arnd Bergmann	f578281000	Merge tag 'ffa-fix-6.13' of https://git.kernel.org/pub/scm/linux/kernel/git/sudeep.holla/linux into arm/fixes Arm FF-A fix for v6.13 A single fix to address a possible race around setting ffa_dev->properties in ffa_device_register() by updating ffa_device_register() to take all the partition information received from the firmware and updating the struct ffa_device accordingly before registering the device to the bus/driver model in the kernel. * tag 'ffa-fix-6.13' of https://git.kernel.org/pub/scm/linux/kernel/git/sudeep.holla/linux: firmware: arm_ffa: Fix the race around setting ffa_dev->properties Link: https://lore.kernel.org/r/20241210101113.3232602-1-sudeep.holla@arm.com Signed-off-by: Arnd Bergmann <arnd@arndb.de>	2024-12-13 14:26:32 +01:00
Michael Trimarchi	d2bd3fcb82	drm/panel: synaptics-r63353: Fix regulator unbalance The shutdown function can be called when the display is already unprepared. For example during reboot this trigger a kernel backlog. Calling the drm_panel_unprepare, allow us to avoid to trigger the kernel warning. Fixes: `2e87bad7cd` ("drm/panel: Add Synaptics R63353 panel driver") Tested-by: Dario Binacchi <dario.binacchi@amarulasolutions.com> Signed-off-by: Michael Trimarchi <michael@amarulasolutions.com> Signed-off-by: Dario Binacchi <dario.binacchi@amarulasolutions.com> Reviewed-by: Neil Armstrong <neil.armstrong@linaro.org> Reviewed-by: Jessica Zhang <quic_jesszhan@quicinc.com> Link: https://lore.kernel.org/r/20241205163002.1804784-1-dario.binacchi@amarulasolutions.com Signed-off-by: Neil Armstrong <neil.armstrong@linaro.org> Link: https://patchwork.freedesktop.org/patch/msgid/20241205163002.1804784-1-dario.binacchi@amarulasolutions.com	2024-12-13 10:52:07 +01:00
Marek Vasut	406dd4c798	drm/panel: st7701: Add prepare_prev_first flag to drm_panel The DSI host must be enabled for the panel to be initialized in prepare(). Set the prepare_prev_first flag to guarantee this. This fixes the panel operation on NXP i.MX8MP SoC / Samsung DSIM DSI host. Fixes: `849b2e3ff9` ("drm/panel: Add Sitronix ST7701 panel driver") Signed-off-by: Marek Vasut <marex@denx.de> Reviewed-by: Jessica Zhang <quic_jesszhan@quicinc.com> Link: https://lore.kernel.org/r/20241124224812.150263-1-marex@denx.de Signed-off-by: Neil Armstrong <neil.armstrong@linaro.org> Link: https://patchwork.freedesktop.org/patch/msgid/20241124224812.150263-1-marex@denx.de	2024-12-13 10:51:56 +01:00
Yang Yingliang	f8fd0968ef	drm/panel: novatek-nt35950: fix return value check in nt35950_probe() mipi_dsi_device_register_full() never returns NULL pointer, it will return ERR_PTR() when it fails, so replace the check with IS_ERR(). Fixes: `623a3531e9` ("drm/panel: Add driver for Novatek NT35950 DSI DriverIC panels") Signed-off-by: Yang Yingliang <yangyingliang@huawei.com> Reviewed-by: Neil Armstrong <neil.armstrong@linaro.org> Link: https://lore.kernel.org/r/20241029123957.1588-1-yangyingliang@huaweicloud.com Signed-off-by: Neil Armstrong <neil.armstrong@linaro.org> Link: https://patchwork.freedesktop.org/patch/msgid/20241029123957.1588-1-yangyingliang@huaweicloud.com	2024-12-13 10:51:39 +01:00
Zhang Zekun	e1e1af9148	drm/panel: himax-hx83102: Add a check to prevent NULL pointer dereference drm_mode_duplicate() could return NULL due to lack of memory, which will then call NULL pointer dereference. Add a check to prevent it. Fixes: `0ef94554dc` ("drm/panel: himax-hx83102: Break out as separate driver") Signed-off-by: Zhang Zekun <zhangzekun11@huawei.com> Reviewed-by: Neil Armstrong <neil.armstrong@linaro.org> Link: https://lore.kernel.org/r/20241025073408.27481-3-zhangzekun11@huawei.com Signed-off-by: Neil Armstrong <neil.armstrong@linaro.org> Link: https://patchwork.freedesktop.org/patch/msgid/20241025073408.27481-3-zhangzekun11@huawei.com	2024-12-13 10:51:24 +01:00
Juergen Gross	a2796dff62	x86/xen: don't do PV iret hypercall through hypercall page Instead of jumping to the Xen hypercall page for doing the iret hypercall, directly code the required sequence in xen-asm.S. This is done in preparation of no longer using hypercall page at all, as it has shown to cause problems with speculation mitigations. This is part of XSA-466 / CVE-2024-53241. Reported-by: Andrew Cooper <andrew.cooper3@citrix.com> Signed-off-by: Juergen Gross <jgross@suse.com> Reviewed-by: Jan Beulich <jbeulich@suse.com>	2024-12-13 09:28:43 +01:00
Juergen Gross	0ef8047b73	x86/static-call: provide a way to do very early static-call updates Add static_call_update_early() for updating static-call targets in very early boot. This will be needed for support of Xen guest type specific hypercall functions. This is part of XSA-466 / CVE-2024-53241. Reported-by: Andrew Cooper <andrew.cooper3@citrix.com> Signed-off-by: Juergen Gross <jgross@suse.com> Co-developed-by: Peter Zijlstra <peterz@infradead.org> Co-developed-by: Josh Poimboeuf <jpoimboe@redhat.com>	2024-12-13 09:28:32 +01:00
Juergen Gross	dda014ba59	objtool/x86: allow syscall instruction The syscall instruction is used in Xen PV mode for doing hypercalls. Allow syscall to be used in the kernel in case it is tagged with an unwind hint for objtool. This is part of XSA-466 / CVE-2024-53241. Reported-by: Andrew Cooper <andrew.cooper3@citrix.com> Signed-off-by: Juergen Gross <jgross@suse.com> Co-developed-by: Peter Zijlstra <peterz@infradead.org>	2024-12-13 09:28:21 +01:00
Juergen Gross	efbcd61d9b	x86: make get_cpu_vendor() accessible from Xen code In order to be able to differentiate between AMD and Intel based systems for very early hypercalls without having to rely on the Xen hypercall page, make get_cpu_vendor() non-static. Refactor early_cpu_init() for the same reason by splitting out the loop initializing cpu_devs() into an externally callable function. This is part of XSA-466 / CVE-2024-53241. Reported-by: Andrew Cooper <andrew.cooper3@citrix.com> Signed-off-by: Juergen Gross <jgross@suse.com>	2024-12-13 09:28:10 +01:00
Juergen Gross	f9244fb55f	xen/netfront: fix crash when removing device When removing a netfront device directly after a suspend/resume cycle it might happen that the queues have not been setup again, causing a crash during the attempt to stop the queues another time. Fix that by checking the queues are existing before trying to stop them. This is XSA-465 / CVE-2024-53240. Reported-by: Marek Marczykowski-Górecki <marmarek@invisiblethingslab.com> Fixes: `d50b7914fa` ("xen-netfront: Fix NULL sring after live migration") Signed-off-by: Juergen Gross <jgross@suse.com>	2024-12-13 09:12:24 +01:00
Gao Xiang	e2de3c1bf6	erofs: add erofs_sb_free() helper Unify the common parts of erofs_fc_free() and erofs_kill_sb() as erofs_sb_free(). Thus, fput() in erofs_fc_get_tree() is no longer needed, too. Reviewed-by: Chao Yu <chao@kernel.org> Signed-off-by: Gao Xiang <hsiangkao@linux.alibaba.com> Link: https://lore.kernel.org/r/20241212133504.2047178-1-hsiangkao@linux.alibaba.com	2024-12-13 00:26:27 +08:00
Yue Hu	6d1917045e	MAINTAINERS: erofs: update Yue Hu's email address The current email address is no longer valid, use my gmail instead. Signed-off-by: Yue Hu <zbestahu@gmail.com> Acked-by: Gao Xiang <xiang@kernel.org> Acked-by: Chao Yu <chao@kernel.org> Link: https://lore.kernel.org/r/20241211080918.8512-1-zbestahu@163.com Signed-off-by: Gao Xiang <hsiangkao@linux.alibaba.com>	2024-12-13 00:25:28 +08:00
Gao Xiang	1a2180f685	erofs: fix PSI memstall accounting Max Kellermann recently reported psi_group_cpu.tasks[NR_MEMSTALL] is incorrect in the 6.11.9 kernel. The root cause appears to be that, since the problematic commit, bio can be NULL, causing psi_memstall_leave() to be skipped in z_erofs_submit_queue(). Reported-by: Max Kellermann <max.kellermann@ionos.com> Closes: https://lore.kernel.org/r/CAKPOu+8tvSowiJADW2RuKyofL_CSkm_SuyZA7ME5vMLWmL6pqw@mail.gmail.com Fixes: `9e2f9d34dd` ("erofs: handle overlapped pclusters out of crafted images properly") Reviewed-by: Chao Yu <chao@kernel.org> Signed-off-by: Gao Xiang <hsiangkao@linux.alibaba.com> Link: https://lore.kernel.org/r/20241127085236.3538334-1-hsiangkao@linux.alibaba.com	2024-12-13 00:24:40 +08:00
Gao Xiang	b10a1e5643	erofs: fix rare pcluster memory leak after unmounting There may still exist some pcluster with valid reference counts during unmounting. Instead of introducing another synchronization primitive, just try again as unmounting is relatively rare. This approach is similar to z_erofs_cache_invalidate_folio(). It was also reported by syzbot as a UAF due to commit `f5ad9f9a60` ("erofs: free pclusters if no cached folio is attached"): BUG: KASAN: slab-use-after-free in do_raw_spin_trylock+0x72/0x1f0 kernel/locking/spinlock_debug.c:123 .. queued_spin_trylock include/asm-generic/qspinlock.h:92 [inline] do_raw_spin_trylock+0x72/0x1f0 kernel/locking/spinlock_debug.c:123 __raw_spin_trylock include/linux/spinlock_api_smp.h:89 [inline] _raw_spin_trylock+0x20/0x80 kernel/locking/spinlock.c:138 spin_trylock include/linux/spinlock.h:361 [inline] z_erofs_put_pcluster fs/erofs/zdata.c:959 [inline] z_erofs_decompress_pcluster fs/erofs/zdata.c:1403 [inline] z_erofs_decompress_queue+0x3798/0x3ef0 fs/erofs/zdata.c:1425 z_erofs_decompressqueue_work+0x99/0xe0 fs/erofs/zdata.c:1437 process_one_work kernel/workqueue.c:3229 [inline] process_scheduled_works+0xa68/0x1840 kernel/workqueue.c:3310 worker_thread+0x870/0xd30 kernel/workqueue.c:3391 kthread+0x2f2/0x390 kernel/kthread.c:389 ret_from_fork+0x4d/0x80 arch/x86/kernel/process.c:147 ret_from_fork_asm+0x1a/0x30 arch/x86/entry/entry_64.S:244 </TASK> However, it seems a long outstanding memory leak. Fix it now. Fixes: `f5ad9f9a60` ("erofs: free pclusters if no cached folio is attached") Reported-by: syzbot+7ff87b095e7ca0c5ac39@syzkaller.appspotmail.com Closes: https://lore.kernel.org/r/674c1235.050a0220.ad585.0032.GAE@google.com Reviewed-by: Chao Yu <chao@kernel.org> Signed-off-by: Gao Xiang <hsiangkao@linux.alibaba.com> Link: https://lore.kernel.org/r/20241203072821.1885740-1-hsiangkao@linux.alibaba.com	2024-12-13 00:24:12 +08:00
T.J. Mercier	0cff90dec6	dma-buf: Fix __dma_buf_debugfs_list_del argument for !CONFIG_DEBUG_FS The arguments for __dma_buf_debugfs_list_del do not match for both the CONFIG_DEBUG_FS case and the !CONFIG_DEBUG_FS case. The !CONFIG_DEBUG_FS case should take a struct dma_buf , but it's currently struct file . This can lead to the build error: error: passing argument 1 of ‘__dma_buf_debugfs_list_del’ from incompatible pointer type [-Werror=incompatible-pointer-types] dma-buf.c:63:53: note: expected ‘struct file ’ but argument is of type ‘struct dma_buf ’ 63 \| static void __dma_buf_debugfs_list_del(struct file *file) Fixes: `bfc7bc5393` ("dma-buf: Do not build debugfs related code when !CONFIG_DEBUG_FS") Signed-off-by: T.J. Mercier <tjmercier@google.com> Reviewed-by: Tvrtko Ursulin <tvrtko.ursulin@igalia.com> Signed-off-by: Sumit Semwal <sumit.semwal@linaro.org> Link: https://patchwork.freedesktop.org/patch/msgid/20241117170326.1971113-1-tjmercier@google.com	2024-12-12 18:53:45 +05:30
Jann Horn	f49856f525	udmabuf: fix memory leak on last export_udmabuf() error path In export_udmabuf(), if dma_buf_fd() fails because the FD table is full, a dma_buf owning the udmabuf has already been created; but the error handling in udmabuf_create() will tear down the udmabuf without doing anything about the containing dma_buf. This leaves a dma_buf in memory that contains a dangling pointer; though that doesn't seem to lead to anything bad except a memory leak. Fix it by moving the dma_buf_fd() call out of export_udmabuf() so that we can give it different error handling. Note that the shape of this code changed a lot in commit `5e72b2b41a` ("udmabuf: convert udmabuf driver to use folios"); but the memory leak seems to have existed since the introduction of udmabuf. Fixes: `fbb0de7950` ("Add udmabuf misc device") Acked-by: Vivek Kasireddy <vivek.kasireddy@intel.com> Signed-off-by: Jann Horn <jannh@google.com> Signed-off-by: Vivek Kasireddy <vivek.kasireddy@intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/20241204-udmabuf-fixes-v2-3-23887289de1c@google.com	2024-12-11 16:47:41 -08:00
Jann Horn	0a16e24e34	udmabuf: also check for F_SEAL_FUTURE_WRITE When F_SEAL_FUTURE_WRITE was introduced, it was overlooked that udmabuf must reject memfds with this flag, just like ones with F_SEAL_WRITE. Fix it by adding F_SEAL_FUTURE_WRITE to SEALS_DENIED. Fixes: `ab3948f58f` ("mm/memfd: add an F_SEAL_FUTURE_WRITE seal to memfd") Cc: stable@vger.kernel.org Acked-by: Vivek Kasireddy <vivek.kasireddy@intel.com> Signed-off-by: Jann Horn <jannh@google.com> Reviewed-by: Joel Fernandes (Google) <joel@joelfernandes.org> Signed-off-by: Vivek Kasireddy <vivek.kasireddy@intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/20241204-udmabuf-fixes-v2-2-23887289de1c@google.com	2024-12-11 16:47:40 -08:00
Jann Horn	9cb189a882	udmabuf: fix racy memfd sealing check The current check_memfd_seals() is racy: Since we first do check_memfd_seals() and then udmabuf_pin_folios() without holding any relevant lock across both, F_SEAL_WRITE can be set in between. This is problematic because we can end up holding pins to pages in a write-sealed memfd. Fix it using the inode lock, that's probably the easiest way. In the future, we might want to consider moving this logic into memfd, especially if anyone else wants to use memfd_pin_folios(). Reported-by: Julian Orth <ju.orth@gmail.com> Closes: https://bugzilla.kernel.org/show_bug.cgi?id=219106 Closes: https://lore.kernel.org/r/CAG48ez0w8HrFEZtJkfmkVKFDhE5aP7nz=obrimeTgpD+StkV9w@mail.gmail.com Fixes: `fbb0de7950` ("Add udmabuf misc device") Cc: stable@vger.kernel.org Signed-off-by: Jann Horn <jannh@google.com> Acked-by: Joel Fernandes (Google) <joel@joelfernandes.org> Acked-by: Vivek Kasireddy <vivek.kasireddy@intel.com> Signed-off-by: Vivek Kasireddy <vivek.kasireddy@intel.com> Link: https://patchwork.freedesktop.org/patch/msgid/20241204-udmabuf-fixes-v2-1-23887289de1c@google.com	2024-12-11 16:47:40 -08:00
Alexander Gordeev	5fa49dd8e5	s390/ipl: Fix never less than zero warning DEFINE_IPL_ATTR_STR_RW() macro produces "unsigned 'len' is never less than zero." warning when sys_vmcmd_on_*_store() callbacks are defined. Reported-by: kernel test robot <lkp@intel.com> Closes: https://lore.kernel.org/oe-kbuild-all/202412081614.5uel8F6W-lkp@intel.com/ Fixes: `247576bf62` ("s390/ipl: Do not accept z/VM CP diag X'008' cmds longer than max length") Reviewed-by: Heiko Carstens <hca@linux.ibm.com> Signed-off-by: Alexander Gordeev <agordeev@linux.ibm.com>	2024-12-11 18:05:49 +01:00
Nikita Zhandarovich	2dd59fe0e1	media: dvb-frontends: dib3000mb: fix uninit-value in dib3000_write_reg Syzbot reports [1] an uninitialized value issue found by KMSAN in dib3000_read_reg(). Local u8 rb[2] is used in i2c_transfer() as a read buffer; in case that call fails, the buffer may end up with some undefined values. Since no elaborate error handling is expected in dib3000_write_reg(), simply zero out rb buffer to mitigate the problem. [1] Syzkaller report dvb-usb: bulk message failed: -22 (6/0) ===================================================== BUG: KMSAN: uninit-value in dib3000mb_attach+0x2d8/0x3c0 drivers/media/dvb-frontends/dib3000mb.c:758 dib3000mb_attach+0x2d8/0x3c0 drivers/media/dvb-frontends/dib3000mb.c:758 dibusb_dib3000mb_frontend_attach+0x155/0x2f0 drivers/media/usb/dvb-usb/dibusb-mb.c:31 dvb_usb_adapter_frontend_init+0xed/0x9a0 drivers/media/usb/dvb-usb/dvb-usb-dvb.c:290 dvb_usb_adapter_init drivers/media/usb/dvb-usb/dvb-usb-init.c:90 [inline] dvb_usb_init drivers/media/usb/dvb-usb/dvb-usb-init.c:186 [inline] dvb_usb_device_init+0x25a8/0x3760 drivers/media/usb/dvb-usb/dvb-usb-init.c:310 dibusb_probe+0x46/0x250 drivers/media/usb/dvb-usb/dibusb-mb.c:110 ... Local variable rb created at: dib3000_read_reg+0x86/0x4e0 drivers/media/dvb-frontends/dib3000mb.c:54 dib3000mb_attach+0x123/0x3c0 drivers/media/dvb-frontends/dib3000mb.c:758 ... Fixes: `74340b0a8b` ("V4L/DVB (4457): Remove dib3000-common-module") Reported-by: syzbot+c88fc0ebe0d5935c70da@syzkaller.appspotmail.com Signed-off-by: Nikita Zhandarovich <n.zhandarovich@fintech.ru> Link: https://lore.kernel.org/r/20240517155800.9881-1-n.zhandarovich@fintech.ru Signed-off-by: Mauro Carvalho Chehab <mchehab@kernel.org>	2024-12-11 17:54:19 +01:00
Xi Pardee	83848e37f6	platform/x86/intel/vsec: Add support for Panther Lake Add Panther Lake PMT telemetry support. Signed-off-by: Xi Pardee <xi.pardee@linux.intel.com> Link: https://lore.kernel.org/r/20241210212646.239211-1-xi.pardee@linux.intel.com Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com> Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>	2024-12-11 16:00:56 +02:00
Jithu Joseph	6c0a473fc5	platform/x86/intel/ifs: Add Clearwater Forest to CPU support list Add Clearwater Forest (INTEL_ATOM_DARKMONT_X) to the x86 match table of Intel In Field Scan (IFS) driver, enabling IFS functionality on this processor. Signed-off-by: Jithu Joseph <jithu.joseph@intel.com> Link: https://lore.kernel.org/r/20241210203152.1136463-1-jithu.joseph@intel.com Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com> Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>	2024-12-11 16:00:36 +02:00
Huy Minh	220326c465	platform/x86: touchscreen_dmi: Add info for SARY Tab 3 tablet There's no info about the OEM behind the tablet, only online stores listing. This tablet uses an Intel Atom x5-Z8300, 4GB of RAM & 64GB of storage. Signed-off-by: Huy Minh <buingoc67@gmail.com> Link: https://lore.kernel.org/r/20241210154500.32124-1-buingoc67@gmail.com Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com> Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>	2024-12-11 15:58:35 +02:00
Daniele Palmas	8366e64a44	USB: serial: option: add Telit FE910C04 rmnet compositions Add the following Telit FE910C04 compositions: 0x10c0: rmnet + tty (AT/NMEA) + tty (AT) + tty (diag) T: Bus=02 Lev=01 Prnt=03 Port=06 Cnt=01 Dev#= 13 Spd=480 MxCh= 0 D: Ver= 2.00 Cls=00(>ifc ) Sub=00 Prot=00 MxPS=64 #Cfgs= 1 P: Vendor=1bc7 ProdID=10c0 Rev=05.15 S: Manufacturer=Telit Cinterion S: Product=FE910 S: SerialNumber=f71b8b32 C: #Ifs= 4 Cfg#= 1 Atr=e0 MxPwr=500mA I: If#= 0 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=ff Prot=50 Driver=qmi_wwan E: Ad=01(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=81(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=82(I) Atr=03(Int.) MxPS= 8 Ivl=32ms I: If#= 1 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=ff Prot=60 Driver=option E: Ad=02(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=83(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=84(I) Atr=03(Int.) MxPS= 10 Ivl=32ms I: If#= 2 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=ff Prot=40 Driver=option E: Ad=03(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=85(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=86(I) Atr=03(Int.) MxPS= 10 Ivl=32ms I: If#= 3 Alt= 0 #EPs= 2 Cls=ff(vend.) Sub=ff Prot=30 Driver=option E: Ad=04(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=87(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms 0x10c4: rmnet + tty (AT) + tty (AT) + tty (diag) T: Bus=02 Lev=01 Prnt=03 Port=06 Cnt=01 Dev#= 14 Spd=480 MxCh= 0 D: Ver= 2.00 Cls=00(>ifc ) Sub=00 Prot=00 MxPS=64 #Cfgs= 1 P: Vendor=1bc7 ProdID=10c4 Rev=05.15 S: Manufacturer=Telit Cinterion S: Product=FE910 S: SerialNumber=f71b8b32 C: #Ifs= 4 Cfg#= 1 Atr=e0 MxPwr=500mA I: If#= 0 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=ff Prot=50 Driver=qmi_wwan E: Ad=01(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=81(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=82(I) Atr=03(Int.) MxPS= 8 Ivl=32ms I: If#= 1 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=ff Prot=40 Driver=option E: Ad=02(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=83(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=84(I) Atr=03(Int.) MxPS= 10 Ivl=32ms I: If#= 2 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=ff Prot=40 Driver=option E: Ad=03(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=85(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=86(I) Atr=03(Int.) MxPS= 10 Ivl=32ms I: If#= 3 Alt= 0 #EPs= 2 Cls=ff(vend.) Sub=ff Prot=30 Driver=option E: Ad=04(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=87(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms 0x10c8: rmnet + tty (AT) + tty (diag) + DPL (data packet logging) + adb T: Bus=02 Lev=01 Prnt=03 Port=06 Cnt=01 Dev#= 17 Spd=480 MxCh= 0 D: Ver= 2.00 Cls=00(>ifc ) Sub=00 Prot=00 MxPS=64 #Cfgs= 1 P: Vendor=1bc7 ProdID=10c8 Rev=05.15 S: Manufacturer=Telit Cinterion S: Product=FE910 S: SerialNumber=f71b8b32 C: #Ifs= 5 Cfg#= 1 Atr=e0 MxPwr=500mA I: If#= 0 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=ff Prot=50 Driver=qmi_wwan E: Ad=01(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=81(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=82(I) Atr=03(Int.) MxPS= 8 Ivl=32ms I: If#= 1 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=ff Prot=40 Driver=option E: Ad=02(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=83(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=84(I) Atr=03(Int.) MxPS= 10 Ivl=32ms I: If#= 2 Alt= 0 #EPs= 2 Cls=ff(vend.) Sub=ff Prot=30 Driver=option E: Ad=03(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=85(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms I: If#= 3 Alt= 0 #EPs= 1 Cls=ff(vend.) Sub=ff Prot=80 Driver=(none) E: Ad=86(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms I: If#= 4 Alt= 0 #EPs= 2 Cls=ff(vend.) Sub=42 Prot=01 Driver=(none) E: Ad=04(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=87(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms Signed-off-by: Daniele Palmas <dnlplm@gmail.com> Cc: stable@vger.kernel.org Signed-off-by: Johan Hovold <johan@kernel.org>	2024-12-11 10:37:28 +01:00
Jack Wu	f07dfa6a1b	USB: serial: option: add MediaTek T7XX compositions Add the MediaTek T7XX compositions: T: Bus=03 Lev=01 Prnt=01 Port=05 Cnt=01 Dev#= 74 Spd=480 MxCh= 0 D: Ver= 2.10 Cls=ef(misc ) Sub=02 Prot=01 MxPS=64 #Cfgs= 1 P: Vendor=0e8d ProdID=7129 Rev= 0.01 S: Manufacturer=MediaTek Inc. S: Product=USB DATA CARD S: SerialNumber=004402459035402 C:* #Ifs=10 Cfg#= 1 Atr=a0 MxPwr=500mA A: FirstIf#= 0 IfCount= 2 Cls=02(comm.) Sub=0e Prot=00 I:* If#= 0 Alt= 0 #EPs= 1 Cls=02(comm.) Sub=0e Prot=00 Driver=cdc_mbim E: Ad=82(I) Atr=03(Int.) MxPS= 64 Ivl=32ms I: If#= 1 Alt= 0 #EPs= 0 Cls=0a(data ) Sub=00 Prot=02 Driver=cdc_mbim I:* If#= 1 Alt= 1 #EPs= 2 Cls=0a(data ) Sub=00 Prot=02 Driver=cdc_mbim E: Ad=81(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=01(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms I:* If#= 2 Alt= 0 #EPs= 2 Cls=ff(vend.) Sub=00 Prot=00 Driver=option E: Ad=83(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=02(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms I:* If#= 3 Alt= 0 #EPs= 2 Cls=ff(vend.) Sub=00 Prot=00 Driver=option E: Ad=84(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=03(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms I:* If#= 4 Alt= 0 #EPs= 2 Cls=ff(vend.) Sub=00 Prot=00 Driver=option E: Ad=85(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=04(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms I:* If#= 5 Alt= 0 #EPs= 2 Cls=ff(vend.) Sub=42 Prot=01 Driver=(none) E: Ad=86(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=05(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms I:* If#= 6 Alt= 0 #EPs= 2 Cls=ff(vend.) Sub=00 Prot=00 Driver=option E: Ad=87(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=06(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms I:* If#= 7 Alt= 0 #EPs= 2 Cls=ff(vend.) Sub=00 Prot=00 Driver=option E: Ad=88(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=07(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms I:* If#= 8 Alt= 0 #EPs= 2 Cls=ff(vend.) Sub=00 Prot=00 Driver=option E: Ad=89(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=08(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms I:* If#= 9 Alt= 0 #EPs= 2 Cls=ff(vend.) Sub=00 Prot=00 Driver=option E: Ad=8a(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=09(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms ------------------------------- \| If Number \| Function \| ------------------------------- \| 2 \| USB AP Log Port \| ------------------------------- \| 3 \| USB AP GNSS Port\| ------------------------------- \| 4 \| USB AP META Port\| ------------------------------- \| 5 \| ADB port \| ------------------------------- \| 6 \| USB MD AT Port \| ------------------------------ \| 7 \| USB MD META Port\| ------------------------------- \| 8 \| USB NTZ Port \| ------------------------------- \| 9 \| USB Debug port \| ------------------------------- Signed-off-by: Jack Wu <wojackbb@gmail.com> Cc: stable@vger.kernel.org Signed-off-by: Johan Hovold <johan@kernel.org>	2024-12-11 10:26:51 +01:00
Mank Wang	aa954ae082	USB: serial: option: add Netprisma LCUK54 modules for WWAN Ready LCUK54-WRD's pid/vid 0x3731/0x010a 0x3731/0x010c LCUK54-WWD's pid/vid 0x3731/0x010b 0x3731/0x010d Above products use the exact same interface layout and option driver: MBIM + GNSS + DIAG + NMEA + AT + QDSS + DPL T: Bus=01 Lev=01 Prnt=01 Port=01 Cnt=02 Dev#= 5 Spd=480 MxCh= 0 D: Ver= 2.00 Cls=00(>ifc ) Sub=00 Prot=00 MxPS=64 #Cfgs= 1 P: Vendor=3731 ProdID=0101 Rev= 5.04 S: Manufacturer=NetPrisma S: Product=LCUK54-WRD S: SerialNumber=feeba631 C:* #Ifs= 8 Cfg#= 1 Atr=a0 MxPwr=500mA A: FirstIf#= 0 IfCount= 2 Cls=02(comm.) Sub=0e Prot=00 I:* If#= 0 Alt= 0 #EPs= 1 Cls=02(comm.) Sub=0e Prot=00 Driver=cdc_mbim E: Ad=81(I) Atr=03(Int.) MxPS= 64 Ivl=32ms I: If#= 1 Alt= 0 #EPs= 0 Cls=0a(data ) Sub=00 Prot=02 Driver=cdc_mbim I:* If#= 1 Alt= 1 #EPs= 2 Cls=0a(data ) Sub=00 Prot=02 Driver=cdc_mbim E: Ad=8e(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=0f(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms I:* If#= 2 Alt= 0 #EPs= 1 Cls=ff(vend.) Sub=ff Prot=ff Driver=(none) E: Ad=82(I) Atr=03(Int.) MxPS= 64 Ivl=32ms I:* If#= 3 Alt= 0 #EPs= 2 Cls=ff(vend.) Sub=ff Prot=30 Driver=option E: Ad=01(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=83(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms I:* If#= 4 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=00 Prot=40 Driver=option E: Ad=85(I) Atr=03(Int.) MxPS= 10 Ivl=32ms E: Ad=84(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=02(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms I:* If#= 5 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=ff Prot=40 Driver=option E: Ad=87(I) Atr=03(Int.) MxPS= 10 Ivl=32ms E: Ad=86(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=03(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms I:* If#= 6 Alt= 0 #EPs= 1 Cls=ff(vend.) Sub=ff Prot=70 Driver=(none) E: Ad=88(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms I:* If#= 7 Alt= 0 #EPs= 1 Cls=ff(vend.) Sub=ff Prot=80 Driver=(none) E: Ad=8f(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms Signed-off-by: Mank Wang <mank.wang@netprisma.com> [ johan: use lower case hex notation ] Cc: stable@vger.kernel.org Signed-off-by: Johan Hovold <johan@kernel.org>	2024-12-11 10:26:18 +01:00
Michal Hrusecky	724d461e44	USB: serial: option: add MeiG Smart SLM770A Update the USB serial option driver to support MeiG Smart SLM770A. ID 2dee:4d57 Marvell Mobile Composite Device Bus T: Bus=02 Lev=01 Prnt=01 Port=00 Cnt=01 Dev#= 2 Spd=480 MxCh= 0 D: Ver= 2.00 Cls=ef(misc ) Sub=02 Prot=01 MxPS=64 #Cfgs= 1 P: Vendor=2dee ProdID=4d57 Rev= 1.00 S: Manufacturer=Marvell S: Product=Mobile Composite Device Bus C:* #Ifs= 6 Cfg#= 1 Atr=c0 MxPwr=500mA A: FirstIf#= 0 IfCount= 2 Cls=e0(wlcon) Sub=01 Prot=03 I:* If#= 0 Alt= 0 #EPs= 1 Cls=e0(wlcon) Sub=01 Prot=03 Driver=rndis_host E: Ad=87(I) Atr=03(Int.) MxPS= 64 Ivl=4096ms I:* If#= 1 Alt= 0 #EPs= 2 Cls=0a(data ) Sub=00 Prot=00 Driver=rndis_host E: Ad=83(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=0c(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms I:* If#= 2 Alt= 0 #EPs= 2 Cls=ff(vend.) Sub=00 Prot=00 Driver=option E: Ad=82(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=0b(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms I:* If#= 3 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=00 Prot=00 Driver=option E: Ad=88(I) Atr=03(Int.) MxPS= 64 Ivl=4096ms E: Ad=81(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=0a(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms I:* If#= 4 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=00 Prot=00 Driver=option E: Ad=89(I) Atr=03(Int.) MxPS= 64 Ivl=4096ms E: Ad=86(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=0f(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms I:* If#= 5 Alt= 0 #EPs= 2 Cls=ff(vend.) Sub=00 Prot=00 Driver=option E: Ad=85(I) Atr=02(Bulk) MxPS= 512 Ivl=0ms E: Ad=0e(O) Atr=02(Bulk) MxPS= 512 Ivl=0ms Tested successfully connecting to the Internet via rndis interface after dialing via AT commands on If#=3 or If#=4. Not sure of the purpose of the other serial interfaces. Signed-off-by: Michal Hrusecky <michal.hrusecky@turris.com> Cc: stable@vger.kernel.org Signed-off-by: Johan Hovold <johan@kernel.org>	2024-12-11 10:05:43 +01:00
Daniel Swanemar	fdad4fb7c5	USB: serial: option: add TCL IK512 MBIM & ECM Add the following TCL IK512 compositions: 0x0530: Modem + Diag + AT + MBIM T: Bus=04 Lev=01 Prnt=01 Port=00 Cnt=01 Dev#= 3 Spd=10000 MxCh= 0 D: Ver= 3.20 Cls=00(>ifc ) Sub=00 Prot=00 MxPS= 9 #Cfgs= 1 P: Vendor=1bbb ProdID=0530 Rev=05.04 S: Manufacturer=TCL S: Product=TCL 5G USB Dongle S: SerialNumber=3136b91a C: #Ifs= 5 Cfg#= 1 Atr=80 MxPwr=896mA I: If#= 0 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=ff Prot=40 Driver=option E: Ad=01(O) Atr=02(Bulk) MxPS=1024 Ivl=0ms E: Ad=81(I) Atr=02(Bulk) MxPS=1024 Ivl=0ms E: Ad=82(I) Atr=03(Int.) MxPS= 10 Ivl=32ms I: If#= 1 Alt= 0 #EPs= 2 Cls=ff(vend.) Sub=ff Prot=30 Driver=option E: Ad=02(O) Atr=02(Bulk) MxPS=1024 Ivl=0ms E: Ad=83(I) Atr=02(Bulk) MxPS=1024 Ivl=0ms I: If#= 2 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=ff Prot=40 Driver=option E: Ad=03(O) Atr=02(Bulk) MxPS=1024 Ivl=0ms E: Ad=84(I) Atr=02(Bulk) MxPS=1024 Ivl=0ms E: Ad=85(I) Atr=03(Int.) MxPS= 10 Ivl=32ms I: If#= 3 Alt= 0 #EPs= 1 Cls=02(commc) Sub=0e Prot=00 Driver=cdc_mbim E: Ad=86(I) Atr=03(Int.) MxPS= 64 Ivl=32ms I: If#= 4 Alt= 1 #EPs= 2 Cls=0a(data ) Sub=00 Prot=02 Driver=cdc_mbim E: Ad=0f(O) Atr=02(Bulk) MxPS=1024 Ivl=0ms E: Ad=8e(I) Atr=02(Bulk) MxPS=1024 Ivl=0ms 0x0640: ECM + Modem + Diag + AT T: Bus=04 Lev=01 Prnt=01 Port=00 Cnt=01 Dev#= 4 Spd=10000 MxCh= 0 D: Ver= 3.20 Cls=00(>ifc ) Sub=00 Prot=00 MxPS= 9 #Cfgs= 1 P: Vendor=1bbb ProdID=0640 Rev=05.04 S: Manufacturer=TCL S: Product=TCL 5G USB Dongle S: SerialNumber=3136b91a C: #Ifs= 5 Cfg#= 1 Atr=80 MxPwr=896mA I: If#= 0 Alt= 0 #EPs= 1 Cls=02(commc) Sub=06 Prot=00 Driver=cdc_ether E: Ad=81(I) Atr=03(Int.) MxPS= 16 Ivl=32ms I: If#= 1 Alt= 1 #EPs= 2 Cls=0a(data ) Sub=00 Prot=00 Driver=cdc_ether E: Ad=0f(O) Atr=02(Bulk) MxPS=1024 Ivl=0ms E: Ad=8e(I) Atr=02(Bulk) MxPS=1024 Ivl=0ms I: If#= 2 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=ff Prot=40 Driver=option E: Ad=01(O) Atr=02(Bulk) MxPS=1024 Ivl=0ms E: Ad=82(I) Atr=02(Bulk) MxPS=1024 Ivl=0ms E: Ad=83(I) Atr=03(Int.) MxPS= 10 Ivl=32ms I: If#= 3 Alt= 0 #EPs= 2 Cls=ff(vend.) Sub=ff Prot=30 Driver=option E: Ad=02(O) Atr=02(Bulk) MxPS=1024 Ivl=0ms E: Ad=84(I) Atr=02(Bulk) MxPS=1024 Ivl=0ms I: If#= 4 Alt= 0 #EPs= 3 Cls=ff(vend.) Sub=ff Prot=40 Driver=option E: Ad=03(O) Atr=02(Bulk) MxPS=1024 Ivl=0ms E: Ad=85(I) Atr=02(Bulk) MxPS=1024 Ivl=0ms E: Ad=86(I) Atr=03(Int.) MxPS= 10 Ivl=32ms Signed-off-by: Daniel Swanemar <d.swanemar@gmail.com> Cc: stable@vger.kernel.org Signed-off-by: Johan Hovold <johan@kernel.org>	2024-12-11 09:57:34 +01:00
Mario Limonciello	e34f1717ef	thunderbolt: Don't display nvm_version unless upgrade supported The read will never succeed if NVM wasn't initialized due to an unknown format. Add a new callback for visibility to only show when supported. Cc: stable@vger.kernel.org Fixes: `aef9c693e7` ("thunderbolt: Move vendor specific NVM handling into nvm.c") Reported-by: Richard Hughes <hughsient@gmail.com> Closes: https://github.com/fwupd/fwupd/issues/8200 Signed-off-by: Mario Limonciello <mario.limonciello@amd.com> Signed-off-by: Mika Westerberg <mika.westerberg@linux.intel.com>	2024-12-11 09:11:51 +02:00
Danilo Krummrich	2872e21c47	MAINTAINERS: align Danilo's maintainer entries Some entries use my kernel.org address, while others use my Red Hat one. Since this is a bit of an inconvinience for me, align them to all use the same (kernel.org) address. Acked-by: Dave Airlie <airlied@redhat.com> Signed-off-by: Danilo Krummrich <dakr@kernel.org> Link: https://patchwork.freedesktop.org/patch/msgid/20241204152248.8644-1-dakr@kernel.org	2024-12-10 23:56:37 +01:00
Huaisheng Ye	76467a9481	cxl/region: Fix region creation for greater than x2 switches The cxl_port_setup_targets() algorithm fails to identify valid target list ordering in the presence of 4-way and above switches resulting in 'cxl create-region' failures of the form: $ cxl create-region -d decoder0.0 -g 1024 -s 2G -t ram -w 8 -m mem4 mem1 mem6 mem3 mem2 mem5 mem7 mem0 cxl region: create_region: region0: failed to set target7 to mem0 cxl region: cmd_create_region: created 0 regions [kernel debug message] check_last_peer:1213: cxl region0: pci0000:0c:port1: cannot host mem6:decoder7.0 at 2 bus_remove_device:574: bus: 'cxl': remove device region0 QEMU can create this failing topology: ACPI0017:00 [root0] \| HB_0 [port1] / \ RP_0 RP_1 \| \| USP [port2] USP [port3] / / \ \ / / \ \ DSP DSP DSP DSP DSP DSP DSP DSP \| \| \| \| \| \| \| \| mem4 mem6 mem2 mem7 mem1 mem3 mem5 mem0 Pos: 0 2 4 6 1 3 5 7 HB: Host Bridge RP: Root Port USP: Upstream Port DSP: Downstream Port ...with the following command steps: $ qemu-system-x86_64 -machine q35,cxl=on,accel=tcg \ -smp cpus=8 \ -m 8G \ -hda /home/work/vm-images/centos-stream8-02.qcow2 \ -object memory-backend-ram,size=4G,id=m0 \ -object memory-backend-ram,size=4G,id=m1 \ -object memory-backend-ram,size=2G,id=cxl-mem0 \ -object memory-backend-ram,size=2G,id=cxl-mem1 \ -object memory-backend-ram,size=2G,id=cxl-mem2 \ -object memory-backend-ram,size=2G,id=cxl-mem3 \ -object memory-backend-ram,size=2G,id=cxl-mem4 \ -object memory-backend-ram,size=2G,id=cxl-mem5 \ -object memory-backend-ram,size=2G,id=cxl-mem6 \ -object memory-backend-ram,size=2G,id=cxl-mem7 \ -numa node,memdev=m0,cpus=0-3,nodeid=0 \ -numa node,memdev=m1,cpus=4-7,nodeid=1 \ -netdev user,id=net0,hostfwd=tcp::2222-:22 \ -device virtio-net-pci,netdev=net0 \ -device pxb-cxl,bus_nr=12,bus=pcie.0,id=cxl.1 \ -device cxl-rp,port=0,bus=cxl.1,id=root_port0,chassis=0,slot=0 \ -device cxl-rp,port=1,bus=cxl.1,id=root_port1,chassis=0,slot=1 \ -device cxl-upstream,bus=root_port0,id=us0 \ -device cxl-downstream,port=0,bus=us0,id=swport0,chassis=0,slot=4 \ -device cxl-type3,bus=swport0,volatile-memdev=cxl-mem0,id=cxl-vmem0 \ -device cxl-downstream,port=1,bus=us0,id=swport1,chassis=0,slot=5 \ -device cxl-type3,bus=swport1,volatile-memdev=cxl-mem1,id=cxl-vmem1 \ -device cxl-downstream,port=2,bus=us0,id=swport2,chassis=0,slot=6 \ -device cxl-type3,bus=swport2,volatile-memdev=cxl-mem2,id=cxl-vmem2 \ -device cxl-downstream,port=3,bus=us0,id=swport3,chassis=0,slot=7 \ -device cxl-type3,bus=swport3,volatile-memdev=cxl-mem3,id=cxl-vmem3 \ -device cxl-upstream,bus=root_port1,id=us1 \ -device cxl-downstream,port=4,bus=us1,id=swport4,chassis=0,slot=8 \ -device cxl-type3,bus=swport4,volatile-memdev=cxl-mem4,id=cxl-vmem4 \ -device cxl-downstream,port=5,bus=us1,id=swport5,chassis=0,slot=9 \ -device cxl-type3,bus=swport5,volatile-memdev=cxl-mem5,id=cxl-vmem5 \ -device cxl-downstream,port=6,bus=us1,id=swport6,chassis=0,slot=10 \ -device cxl-type3,bus=swport6,volatile-memdev=cxl-mem6,id=cxl-vmem6 \ -device cxl-downstream,port=7,bus=us1,id=swport7,chassis=0,slot=11 \ -device cxl-type3,bus=swport7,volatile-memdev=cxl-mem7,id=cxl-vmem7 \ -M cxl-fmw.0.targets.0=cxl.1,cxl-fmw.0.size=32G & In Guest OS: $ cxl create-region -d decoder0.0 -g 1024 -s 2G -t ram -w 8 -m mem4 mem1 mem6 mem3 mem2 mem5 mem7 mem0 Fix the method to calculate @distance by iterativeley multiplying the number of targets per switch port. This also follows the algorithm recommended here [1]. Fixes: `27b3f8d138` ("cxl/region: Program target lists") Link: http://lore.kernel.org/6538824b52349_7258329466@dwillia2-xfh.jf.intel.com.notmuch [1] Signed-off-by: Huaisheng Ye <huaisheng.ye@intel.com> Tested-by: Li Zhijian <lizhijian@fujitsu.com> [djbw: add a comment explaining 'distance'] Signed-off-by: Dan Williams <dan.j.williams@intel.com> Link: https://patch.msgid.link/173378716722.1270362.9546805175813426729.stgit@dwillia2-xfh.jf.intel.com Signed-off-by: Dave Jiang <dave.jiang@intel.com>	2024-12-10 14:50:34 -07:00
Li Ming	09ceba3a93	cxl/pci: Check dport->regs.rcd_pcie_cap availability before accessing RCD Upstream Port's PCI Express Capability is a component registers block stored in RCD Upstream Port RCRB. CXL PCI driver helps to map it during the RCD probing, but mapping failure is allowed for component registers blocks in CXL PCI driver. dport->regs.rcd_pcie_cap is used to store the virtual address of the RCD Upstream Port's PCI Express Capability, add a dport->regs.rcd_pcie_cap checking in rcd_pcie_cap_emit() just in case user accesses a invalid address via RCD sysfs. Fixes: `c5eaec79fa` ("cxl/pci: Add sysfs attribute for CXL 1.1 device link status") Signed-off-by: Li Ming <ming.li@zohomail.com> Reviewed-by: Alison Schofield <alison.schofield@intel.com> Reviewed-by: Dan Williams <dan.j.williams@intel.com> Link: https://patch.msgid.link/20241129132825.569237-1-ming.li@zohomail.com Signed-off-by: Dave Jiang <dave.jiang@intel.com>	2024-12-10 14:49:14 -07:00
Davidlohr Bueso	da4d8c8335	cxl/pci: Fix potential bogus return value upon successful probing If cxl_pci_ras_unmask() returns non-zero, cxl_pci_probe() will end up returning that value, instead of zero. Fixes: `248529edc8` ("cxl: add RAS status unmasking for CXL") Reviewed-by: Fan Ni <fan.ni@samsung.com> Signed-off-by: Davidlohr Bueso <dave@stgolabs.net> Reviewed-by: Ira Weiny <ira.weiny@intel.com> Link: https://patch.msgid.link/20241115170032.108445-1-dave@stgolabs.net Signed-off-by: Dave Jiang <dave.jiang@intel.com>	2024-12-10 14:30:51 -07:00
Zijun Hu	0f7ca6f693	of/irq: Fix using uninitialized variable @addr_len in API of_irq_parse_one() of_irq_parse_one() may use uninitialized variable @addr_len as shown below: // @addr_len is uninitialized int addr_len; // This operation does not touch @addr_len if it fails. addr = of_get_property(device, "reg", &addr_len); // Use uninitialized @addr_len if the operation fails. if (addr_len > sizeof(addr_buf)) addr_len = sizeof(addr_buf); // Check the operation result here. if (addr) memcpy(addr_buf, addr, addr_len); Fix by initializing @addr_len before the operation. Fixes: `b739dffa5d` ("of/irq: Prevent device address out-of-bounds read in interrupt map walk") Cc: stable@vger.kernel.org Signed-off-by: Zijun Hu <quic_zijuhu@quicinc.com> Link: https://lore.kernel.org/r/20241209-of_irq_fix-v1-4-782f1419c8a1@quicinc.com Signed-off-by: Rob Herring (Arm) <robh@kernel.org>	2024-12-10 10:52:45 -06:00
Zijun Hu	fec3edc47d	of/irq: Fix interrupt-map cell length check in of_irq_parse_imap_parent() On a malformed interrupt-map property which is shorter than expected by 1 cell, we may read bogus data past the end of the property instead of returning an error in of_irq_parse_imap_parent(). Decrement the remaining length when skipping over the interrupt parent phandle cell. Fixes: `935df1bd40` ("of/irq: Factor out parsing of interrupt-map parent phandle+args from of_irq_parse_raw()") Cc: stable@vger.kernel.org Signed-off-by: Zijun Hu <quic_zijuhu@quicinc.com> Link: https://lore.kernel.org/r/20241209-of_irq_fix-v1-1-782f1419c8a1@quicinc.com [rh: reword commit msg] Signed-off-by: Rob Herring (Arm) <robh@kernel.org>	2024-12-10 10:52:45 -06:00
Zijun Hu	5d009e0240	of: Fix refcount leakage for OF node returned by __of_get_dma_parent() __of_get_dma_parent() returns OF device node @args.np, but the node's refcount is increased twice, by both of_parse_phandle_with_args() and of_node_get(), so causes refcount leakage for the node. Fix by directly returning the node got by of_parse_phandle_with_args(). Fixes: `f83a6e5dea` ("of: address: Add support for the parent DMA bus") Cc: stable@vger.kernel.org Signed-off-by: Zijun Hu <quic_zijuhu@quicinc.com> Link: https://lore.kernel.org/r/20241206-of_core_fix-v1-4-dc28ed56bec3@quicinc.com Signed-off-by: Rob Herring (Arm) <robh@kernel.org>	2024-12-10 10:52:45 -06:00
Mario Limonciello	2993b29b2a	cpufreq/amd-pstate: Use boost numerator for upper bound of frequencies commit `18d9b52271` ("cpufreq/amd-pstate: Use nominal perf for limits when boost is disabled") introduced different semantics for min/max limits based upon whether the user turned off boost from sysfs. This however is not necessary when the highest perf value is the boost numerator. Suggested-by: Dhananjay Ugwekar <Dhananjay.Ugwekar@amd.com> Reviewed-by: Gautham R. Shenoy <gautham.shenoy@amd.com> Fixes: `18d9b52271` ("cpufreq/amd-pstate: Use nominal perf for limits when boost is disabled") Link: https://lore.kernel.org/r/20241209185248.16301-3-mario.limonciello@amd.com Signed-off-by: Mario Limonciello <mario.limonciello@amd.com>	2024-12-10 10:17:43 -06:00
Mario Limonciello	50a062a762	cpufreq/amd-pstate: Store the boost numerator as highest perf again commit `ad4caad58d` ("cpufreq: amd-pstate: Merge amd_pstate_highest_perf_set() into amd_get_boost_ratio_numerator()") changed the semantics for highest perf and commit `18d9b52271` ("cpufreq/amd-pstate: Use nominal perf for limits when boost is disabled") worked around those semantic changes. This however is a confusing result and furthermore makes it awkward to change frequency limits and boost due to the scaling differences. Restore the boost numerator to highest perf again. Suggested-by: Dhananjay Ugwekar <Dhananjay.Ugwekar@amd.com> Reviewed-by: Gautham R. Shenoy <gautham.shenoy@amd.com> Fixes: `ad4caad58d` ("cpufreq: amd-pstate: Merge amd_pstate_highest_perf_set() into amd_get_boost_ratio_numerator()") Link: https://lore.kernel.org/r/20241209185248.16301-2-mario.limonciello@amd.com Signed-off-by: Mario Limonciello <mario.limonciello@amd.com>	2024-12-10 10:17:43 -06:00
Joe Hattori	f3d87abe11	mmc: mtk-sd: disable wakeup in .remove() and in the error path of .probe() Current implementation leaves pdev->dev as a wakeup source. Add a device_init_wakeup(&pdev->dev, false) call in the .remove() function and in the error path of the .probe() function. Signed-off-by: Joe Hattori <joe@pf.is.s.u-tokyo.ac.jp> Fixes: `527f36f5ef` ("mmc: mediatek: add support for SDIO eint wakup IRQ") Cc: stable@vger.kernel.org Message-ID: <20241203023442.2434018-1-joe@pf.is.s.u-tokyo.ac.jp> Signed-off-by: Ulf Hansson <ulf.hansson@linaro.org>	2024-12-10 16:02:34 +01:00
Prathamesh Shete	a56335c85b	mmc: sdhci-tegra: Remove SDHCI_QUIRK_BROKEN_ADMA_ZEROLEN_DESC quirk Value 0 in ADMA length descriptor is interpreted as 65536 on new Tegra chips, remove SDHCI_QUIRK_BROKEN_ADMA_ZEROLEN_DESC quirk to make sure max ADMA2 length is 65536. Fixes: `4346b7c794` ("mmc: tegra: Add Tegra186 support") Cc: stable@vger.kernel.org Signed-off-by: Prathamesh Shete <pshete@nvidia.com> Acked-by: Thierry Reding <treding@nvidia.com> Acked-by: Adrian Hunter <adrian.hunter@intel.com> Message-ID: <20241209101009.22710-1-pshete@nvidia.com> Signed-off-by: Ulf Hansson <ulf.hansson@linaro.org>	2024-12-10 16:00:07 +01:00
Heiko Carstens	41856638e6	s390/mm: Fix DirectMap accounting With uncoupling of physical and virtual address spaces population of the identity mapping was changed to use the type POPULATE_IDENTITY instead of POPULATE_DIRECT. This breaks DirectMap accounting: > cat /proc/meminfo DirectMap4k: 55296 kB DirectMap1M: 18446744073709496320 kB Adjust all locations of update_page_count() in vmem.c to use POPULATE_IDENTITY instead of POPULATE_DIRECT as well. With this accounting is correct again: > cat /proc/meminfo DirectMap4k: 54264 kB DirectMap1M: 8334336 kB Fixes: `c98d2ecae0` ("s390/mm: Uncouple physical vs virtual address spaces") Cc: stable@vger.kernel.org Reviewed-by: Alexander Gordeev <agordeev@linux.ibm.com> Signed-off-by: Heiko Carstens <hca@linux.ibm.com> Signed-off-by: Alexander Gordeev <agordeev@linux.ibm.com>	2024-12-10 15:40:41 +01:00
Shin'ichiro Kawasaki	360c400d0f	p2sb: Do not scan and remove the P2SB device when it is unhidden When drivers access P2SB device resources, it calls p2sb_bar(). Before the commit `5913320eb0` ("platform/x86: p2sb: Allow p2sb_bar() calls during PCI device probe"), p2sb_bar() obtained the resources and then called pci_stop_and_remove_bus_device() for clean up. Then the P2SB device disappeared. The commit `5913320eb0` introduced the P2SB device resource cache feature in the boot process. During the resource cache, pci_stop_and_remove_bus_device() is called for the P2SB device, then the P2SB device disappears regardless of whether p2sb_bar() is called or not. Such P2SB device disappearance caused a confusion [1]. To avoid the confusion, avoid the pci_stop_and_remove_bus_device() call when the BIOS does not hide the P2SB device. For that purpose, cache the P2SB device resources only if the BIOS hides the P2SB device. Call p2sb_scan_and_cache() only if p2sb_hidden_by_bios is true. This allows removing two branches from p2sb_scan_and_cache(). When p2sb_bar() is called, get the resources from the cache if the P2SB device is hidden. Otherwise, read the resources from the unhidden P2SB device. Reported-by: Daniel Walker (danielwa) <danielwa@cisco.com> Closes: https://lore.kernel.org/lkml/ZzTI+biIUTvFT6NC@goliath/ [1] Fixes: `5913320eb0` ("platform/x86: p2sb: Allow p2sb_bar() calls during PCI device probe") Signed-off-by: Shin'ichiro Kawasaki <shinichiro.kawasaki@wdc.com> Reviewed-by: Hans de Goede <hdegoede@redhat.com> Link: https://lore.kernel.org/r/20241128002836.373745-5-shinichiro.kawasaki@wdc.com Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com> Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>	2024-12-10 16:24:51 +02:00
Shin'ichiro Kawasaki	0286070c74	p2sb: Move P2SB hide and unhide code to p2sb_scan_and_cache() To prepare for the following fix, move the code to hide and unhide the P2SB device from p2sb_cache_resources() to p2sb_scan_and_cache(). Signed-off-by: Shin'ichiro Kawasaki <shinichiro.kawasaki@wdc.com> Reviewed-by: Hans de Goede <hdegoede@redhat.com> Link: https://lore.kernel.org/r/20241128002836.373745-4-shinichiro.kawasaki@wdc.com Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com> Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>	2024-12-10 16:24:49 +02:00
Shin'ichiro Kawasaki	ae3e6ebc5a	p2sb: Introduce the global flag p2sb_hidden_by_bios To prepare for the following fix, introduce the global flag p2sb_hidden_by_bios. Check if the BIOS hides the P2SB device and store the result in the flag. This allows to refer to the check result across functions. Signed-off-by: Shin'ichiro Kawasaki <shinichiro.kawasaki@wdc.com> Reviewed-by: Hans de Goede <hdegoede@redhat.com> Link: https://lore.kernel.org/r/20241128002836.373745-3-shinichiro.kawasaki@wdc.com Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com> Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>	2024-12-10 16:24:48 +02:00
Shin'ichiro Kawasaki	9244524d60	p2sb: Factor out p2sb_read_from_cache() To prepare for the following fix, factor out the code to read the P2SB resource from the cache to the new function p2sb_read_from_cache(). Signed-off-by: Shin'ichiro Kawasaki <shinichiro.kawasaki@wdc.com> Reviewed-by: Hans de Goede <hdegoede@redhat.com> Link: https://lore.kernel.org/r/20241128002836.373745-2-shinichiro.kawasaki@wdc.com Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com> Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>	2024-12-10 16:24:45 +02:00
Kurt Borja	54a8cada2f	alienware-wmi: Adds support to Alienware m16 R1 AMD Adds support to Alienware m16 R1 AMD. Tested-by: Cihan Ozakca <cozakca@outlook.com> Signed-off-by: Kurt Borja <kuurtb@gmail.com> Reviewed-by: Armin Wolf <W_Armin@gmx.de> Link: https://lore.kernel.org/r/20241208003013.6490-3-kuurtb@gmail.com Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com> Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>	2024-12-10 16:22:55 +02:00
Kurt Borja	c1043cdb01	alienware-wmi: Fix X Series and G Series quirks Devices that are known to support the WMI thermal interface do not support the legacy LED control interface. Make `.num_zones = 0` and avoid calling alienware_zone_init() if that's the case. Fixes: `9f6c430415` ("alienware-wmi: added platform profile support") Fixes: `1c1eb70e7d` ("alienware-wmi: extends the list of supported models") Suggested-by: Armin Wolf <W_Armin@gmx.de> Reviewed-by: Armin Wolf <W_Armin@gmx.de> Signed-off-by: Kurt Borja <kuurtb@gmail.com> Link: https://lore.kernel.org/r/20241208002652.5885-4-kuurtb@gmail.com Reviewed-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com> Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@linux.intel.com>	2024-12-10 16:22:52 +02:00
Arnd Bergmann	efb113fc30	drm: rework FB_CORE dependency The 'select FB_CORE' statement moved from CONFIG_DRM to DRM_CLIENT_LIB, but there are now configurations that have code calling into fb_core as built-in even though the client_lib itself is a loadable module: x86_64-linux-ld: drivers/gpu/drm/drm_fbdev_shmem.o: in function `drm_fbdev_shmem_driver_fbdev_probe': drm_fbdev_shmem.c:(.text+0x1fc): undefined reference to `fb_deferred_io_init' x86_64-linux-ld: drivers/gpu/drm/drm_fbdev_shmem.o: in function `drm_fbdev_shmem_fb_destroy': drm_fbdev_shmem.c:(.text+0x2e1): undefined reference to `fb_deferred_io_cleanup' In addition to DRM_CLIENT_LIB, the 'select' needs to be at least in two more parts, DRM_KMS_HELPER and DRM_GEM_SHMEM_HELPER, so add those here. v3: - Remove FB_CORE from DRM_KMS_HELPER to avoid circular dependency Fixes: `dadd28d414` ("drm/client: Add client-lib module") Signed-off-by: Arnd Bergmann <arnd@arndb.de> Reviewed-by: Thomas Zimmermann <tzimmermann@suse.de> Signed-off-by: Thomas Zimmermann <tzimmermann@suse.de> Link: https://patchwork.freedesktop.org/patch/msgid/20241115162323.3555229-1-arnd@kernel.org	2024-12-10 11:11:39 +01:00
Mika Westerberg	8644b48714	thunderbolt: Add support for Intel Panther Lake-M/P Intel Panther Lake-M/P has the same integrated Thunderbolt/USB4 controller as Lunar Lake. Add these PCI IDs to the driver list of supported devices. Cc: stable@vger.kernel.org Signed-off-by: Mika Westerberg <mika.westerberg@linux.intel.com>	2024-12-10 08:02:17 +02:00
K Prateek Nayak	919bfa9b2d	cpufreq/amd-pstate: Detect preferred core support before driver registration Booting with amd-pstate on 3rd Generation EPYC system incorrectly enabled ITMT support despite the system not supporting Preferred Core ranking. amd_pstate_init_prefcore() called during amd_pstate_cpu_init() requires "amd_pstate_prefcore" to be set correctly however the preferred core support is detected only after driver registration which is too late. Swap the function calls around to detect preferred core support before registring the driver via amd_pstate_register_driver(). This ensures amd_pstate_cpu_init() sees the correct value of "amd_pstate_prefcore" considering the platform support. Fixes: `279f838a61` ("x86/amd: Detect preferred cores in amd_get_boost_ratio_numerator()") Fixes: `ff2653ded4` ("cpufreq/amd-pstate: Move registration after static function call update") Signed-off-by: K Prateek Nayak <kprateek.nayak@amd.com> Acked-by: Mario Limonciello <mario.limonciello@amd.com> Link: https://lore.kernel.org/r/20241210032557.754-1-kprateek.nayak@amd.com Signed-off-by: Mario Limonciello <mario.limonciello@amd.com>	2024-12-09 21:57:34 -06:00
Thomas Weißschuh	c28dc9fc24	power: supply: cros_charge-control: hide start threshold on v2 cmd ECs implementing the v2 command will not stop charging when the end threshold is reached. Instead they will begin discharging until the start threshold is reached, leading to permanent charge and discharge cycles. This defeats the point of the charge control mechanism. Avoid the issue by hiding the start threshold on v2 systems. Instead on those systems program the EC with start == end which forces the EC to reach and stay at that level. v1 does not support thresholds and v3 works correctly, at least judging from the code. Reported-by: Thomas Koch <linrunner@gmx.net> Fixes: `c6ed48ef52` ("power: supply: add ChromeOS EC based charge control driver") Cc: stable@vger.kernel.org Signed-off-by: Thomas Weißschuh <linux@weissschuh.net> Link: https://lore.kernel.org/r/20241208-cros_charge-control-v2-v1-3-8d168d0f08a3@weissschuh.net Signed-off-by: Sebastian Reichel <sebastian.reichel@collabora.com>	2024-12-10 02:51:45 +01:00
Thomas Weißschuh	e65a1b7fad	power: supply: cros_charge-control: allow start_threshold == end_threshold Allow setting the start and stop thresholds to the same value. There is no reason to disallow it. Suggested-by: Thomas Koch <linrunner@gmx.net> Fixes: `c6ed48ef52` ("power: supply: add ChromeOS EC based charge control driver") Cc: stable@vger.kernel.org Signed-off-by: Thomas Weißschuh <linux@weissschuh.net> Link: https://lore.kernel.org/r/20241208-cros_charge-control-v2-v1-2-8d168d0f08a3@weissschuh.net Signed-off-by: Sebastian Reichel <sebastian.reichel@collabora.com>	2024-12-10 02:51:45 +01:00
Thomas Weißschuh	e5f84d1cf5	power: supply: cros_charge-control: add mutex for driver data Concurrent accesses through sysfs may lead to inconsistent state in the priv data. Introduce a mutex to avoid this. Fixes: `c6ed48ef52` ("power: supply: add ChromeOS EC based charge control driver") Cc: stable@vger.kernel.org Signed-off-by: Thomas Weißschuh <linux@weissschuh.net> Link: https://lore.kernel.org/r/20241208-cros_charge-control-v2-v1-1-8d168d0f08a3@weissschuh.net Signed-off-by: Sebastian Reichel <sebastian.reichel@collabora.com>	2024-12-10 02:51:45 +01:00
Dimitri Fedrau	afc6e39e82	power: supply: gpio-charger: Fix set charge current limits Fix set charge current limits for devices which allow to set the lowest charge current limit to be greater zero. If requested charge current limit is below lowest limit, the index equals current_limit_map_size which leads to accessing memory beyond allocated memory. Fixes: `be2919d835` ("power: supply: gpio-charger: add charge-current-limit feature") Cc: stable@vger.kernel.org Signed-off-by: Dimitri Fedrau <dimitri.fedrau@liebherr.com> Link: https://lore.kernel.org/r/20241209-fix-charge-current-limit-v1-1-760d9b8f2af3@liebherr.com Signed-off-by: Sebastian Reichel <sebastian.reichel@collabora.com>	2024-12-10 02:51:24 +01:00
Olaf Hering	175c71c2ac	tools/hv: reduce resource usage in hv_kvp_daemon hv_kvp_daemon uses popen(3) and system(3) as convinience helper to launch external helpers. These helpers are invoked via a temporary shell process. There is no need to keep this temporary process around while the helper runs. Replace this temporary shell with the actual helper process via 'exec'. Signed-off-by: Olaf Hering <olaf@aepfle.de> Link: https://lore.kernel.org/linux-hyperv/20241202123520.27812-1-olaf@aepfle.de/ Signed-off-by: Wei Liu <wei.liu@kernel.org>	2024-12-09 18:44:15 +00:00
Olaf Hering	becc7fe329	tools/hv: add a .gitignore file Remove generated files from 'git status' output after 'make -C tools/hv'. Signed-off-by: Olaf Hering <olaf@aepfle.de> Link: https://lore.kernel.org/r/20241202124107.28650-1-olaf@aepfle.de Signed-off-by: Wei Liu <wei.liu@kernel.org> Message-ID: <20241202124107.28650-1-olaf@aepfle.de>	2024-12-09 18:44:15 +00:00
Olaf Hering	a4d024fe2e	tools/hv: reduce resouce usage in hv_get_dns_info helper Remove the usage of cat. Replace the shell process with awk with 'exec'. Also use a generic shell because no bash specific features will be used. Signed-off-by: Olaf Hering <olaf@aepfle.de> Acked-by: Wei Liu <wei.liu@kernel.org> Link: https://lore.kernel.org/r/20241202120432.21115-1-olaf@aepfle.de Signed-off-by: Wei Liu <wei.liu@kernel.org> Message-ID: <20241202120432.21115-1-olaf@aepfle.de>	2024-12-09 18:44:15 +00:00
Vitaly Kuznetsov	07dfa6e821	hv/hv_kvp_daemon: Pass NIC name to hv_get_dns_info as well The reference implementation of hv_get_dns_info which is in the tree uses /etc/resolv.conf to get DNS servers and this does not require to know which NIC is queried. Distro specific implementations, however, may want to provide per-NIC, fine grained information. E.g. NetworkManager keeps track of DNS servers per connection. Similar to hv_get_dhcp_info, pass NIC name as a parameter to hv_get_dns_info script. Signed-off-by: Vitaly Kuznetsov <vkuznets@redhat.com> Link: https://lore.kernel.org/r/20241112150401.217094-1-vkuznets@redhat.com Signed-off-by: Wei Liu <wei.liu@kernel.org> Message-ID: <20241112150401.217094-1-vkuznets@redhat.com>	2024-12-09 18:44:15 +00:00
Michael Kelley	07a756a49f	Drivers: hv: util: Avoid accessing a ringbuffer not initialized yet If the KVP (or VSS) daemon starts before the VMBus channel's ringbuffer is fully initialized, we can hit the panic below: hv_utils: Registering HyperV Utility Driver hv_vmbus: registering driver hv_utils ... BUG: kernel NULL pointer dereference, address: 0000000000000000 CPU: 44 UID: 0 PID: 2552 Comm: hv_kvp_daemon Tainted: G E 6.11.0-rc3+ #1 RIP: 0010:hv_pkt_iter_first+0x12/0xd0 Call Trace: ... vmbus_recvpacket hv_kvp_onchannelcallback vmbus_on_event tasklet_action_common tasklet_action handle_softirqs irq_exit_rcu sysvec_hyperv_stimer0 </IRQ> <TASK> asm_sysvec_hyperv_stimer0 ... kvp_register_done hvt_op_read vfs_read ksys_read __x64_sys_read This can happen because the KVP/VSS channel callback can be invoked even before the channel is fully opened: 1) as soon as hv_kvp_init() -> hvutil_transport_init() creates /dev/vmbus/hv_kvp, the kvp daemon can open the device file immediately and register itself to the driver by writing a message KVP_OP_REGISTER1 to the file (which is handled by kvp_on_msg() ->kvp_handle_handshake()) and reading the file for the driver's response, which is handled by hvt_op_read(), which calls hvt->on_read(), i.e. kvp_register_done(). 2) the problem with kvp_register_done() is that it can cause the channel callback to be called even before the channel is fully opened, and when the channel callback is starting to run, util_probe()-> vmbus_open() may have not initialized the ringbuffer yet, so the callback can hit the panic of NULL pointer dereference. To reproduce the panic consistently, we can add a "ssleep(10)" for KVP in __vmbus_open(), just before the first hv_ringbuffer_init(), and then we unload and reload the driver hv_utils, and run the daemon manually within the 10 seconds. Fix the panic by reordering the steps in util_probe() so the char dev entry used by the KVP or VSS daemon is not created until after vmbus_open() has completed. This reordering prevents the race condition from happening. Reported-by: Dexuan Cui <decui@microsoft.com> Fixes: `e0fa3e5e7d` ("Drivers: hv: utils: fix a race on userspace daemons registration") Cc: stable@vger.kernel.org Signed-off-by: Michael Kelley <mhklinux@outlook.com> Acked-by: Wei Liu <wei.liu@kernel.org> Link: https://lore.kernel.org/r/20241106154247.2271-3-mhklinux@outlook.com Signed-off-by: Wei Liu <wei.liu@kernel.org> Message-ID: <20241106154247.2271-3-mhklinux@outlook.com>	2024-12-09 18:44:15 +00:00
Michael Kelley	96e052d147	Drivers: hv: util: Don't force error code to ENODEV in util_probe() If the util_init function call in util_probe() returns an error code, util_probe() always return ENODEV, and the error code from the util_init function is lost. The error message output in the caller, vmbus_probe(), doesn't show the real error code. Fix this by just returning the error code from the util_init function. There doesn't seem to be a reason to force ENODEV, as other errors such as ENOMEM can already be returned from util_probe(). And the code in call_driver_probe() implies that ENODEV should mean that a matching driver wasn't found, which is not the case here. Suggested-by: Dexuan Cui <decui@microsoft.com> Signed-off-by: Michael Kelley <mhklinux@outlook.com> Acked-by: Wei Liu <wei.liu@kernel.org> Link: https://lore.kernel.org/r/20241106154247.2271-2-mhklinux@outlook.com Signed-off-by: Wei Liu <wei.liu@kernel.org> Message-ID: <20241106154247.2271-2-mhklinux@outlook.com>	2024-12-09 18:44:14 +00:00
Olaf Hering	a9640fcdd4	tools/hv: terminate fcopy daemon if read from uio fails Terminate endless loop in reading fails, to avoid flooding syslog. This happens if the state of "Guest services" integration service is changed from "enabled" to "disabled" at runtime in the VM settings. In this case pread returns EIO. Also handle an interrupted system call, and continue in this case. Signed-off-by: Olaf Hering <olaf@aepfle.de> Reviewed-by: Saurabh Sengar <ssengar@linux.microsoft.com> Link: https://lore.kernel.org/r/20241105081437.15689-1-olaf@aepfle.de Signed-off-by: Wei Liu <wei.liu@kernel.org> Message-ID: <20241105081437.15689-1-olaf@aepfle.de>	2024-12-09 18:44:14 +00:00
Easwar Hariharan	67b5e1042d	drivers: hv: Convert open-coded timeouts to secs_to_jiffies() We have several places where timeouts are open-coded as N (seconds) * HZ, but best practice is to use the utility functions from jiffies.h. Convert the timeouts to be compliant. This doesn't fix any bugs, it's a simple code improvement. Signed-off-by: Easwar Hariharan <eahariha@linux.microsoft.com> Reviewed-by: Michael Kelley <mhklinux@outlook.com> Link: https://lore.kernel.org/r/20241030-open-coded-timeouts-v3-2-9ba123facf88@linux.microsoft.com Signed-off-by: Wei Liu <wei.liu@kernel.org> Message-ID: <20241030-open-coded-timeouts-v3-2-9ba123facf88@linux.microsoft.com>	2024-12-09 18:44:14 +00:00
Olaf Hering	91ae69c7ed	tools: hv: change permissions of NetworkManager configuration file Align permissions of the resulting .nmconnection file, instead of the input file from hv_kvp_daemon. To avoid the tiny time frame where the output file is world-readable, use umask instead of chmod. Fixes: `42999c9046` ("hv/hv_kvp_daemon:Support for keyfile based connection profile") Signed-off-by: Olaf Hering <olaf@aepfle.de> Reviewed-by: Shradha Gupta <shradhagupta@linux.microsoft.com> Link: https://lore.kernel.org/r/20241016143521.3735-1-olaf@aepfle.de Signed-off-by: Wei Liu <wei.liu@kernel.org> Message-ID: <20241016143521.3735-1-olaf@aepfle.de>	2024-12-09 18:42:52 +00:00
Naman Jain	bcc80dec91	x86/hyperv: Fix hv tsc page based sched_clock for hibernation read_hv_sched_clock_tsc() assumes that the Hyper-V clock counter is bigger than the variable hv_sched_clock_offset, which is cached during early boot, but depending on the timing this assumption may be false when a hibernated VM starts again (the clock counter starts from 0 again) and is resuming back (Note: hv_init_tsc_clocksource() is not called during hibernation/resume); consequently, read_hv_sched_clock_tsc() may return a negative integer (which is interpreted as a huge positive integer since the return type is u64) and new kernel messages are prefixed with huge timestamps before read_hv_sched_clock_tsc() grows big enough (which typically takes several seconds). Fix the issue by saving the Hyper-V clock counter just before the suspend, and using it to correct the hv_sched_clock_offset in resume. This makes hv tsc page based sched_clock continuous and ensures that post resume, it starts from where it left off during suspend. Override x86_platform.save_sched_clock_state and x86_platform.restore_sched_clock_state routines to correct this as soon as possible. Note: if Invariant TSC is available, the issue doesn't happen because 1) we don't register read_hv_sched_clock_tsc() for sched clock: See commit `e5313f1c54` ("clocksource/drivers/hyper-v: Rework clocksource and sched clock setup"); 2) the common x86 code adjusts TSC similarly: see __restore_processor_state() -> tsc_verify_tsc_adjust(true) and x86_platform.restore_sched_clock_state(). Cc: stable@vger.kernel.org Fixes: `1349401ff1` ("clocksource/drivers/hyper-v: Suspend/resume Hyper-V clocksource for hibernation") Co-developed-by: Dexuan Cui <decui@microsoft.com> Signed-off-by: Dexuan Cui <decui@microsoft.com> Signed-off-by: Naman Jain <namjain@linux.microsoft.com> Reviewed-by: Michael Kelley <mhklinux@outlook.com> Link: https://lore.kernel.org/r/20240917053917.76787-1-namjain@linux.microsoft.com Signed-off-by: Wei Liu <wei.liu@kernel.org> Message-ID: <20240917053917.76787-1-namjain@linux.microsoft.com>	2024-12-09 18:42:42 +00:00
Dexuan Cui	cb1b78f1c7	tools: hv: Fix a complier warning in the fcopy uio daemon hv_fcopy_uio_daemon.c:436:53: warning: '%s' directive output may be truncated writing up to 14 bytes into a region of size 10 [-Wformat-truncation=] 436 \| snprintf(uio_dev_path, sizeof(uio_dev_path), "/dev/%s", uio_name); Also added 'static' for the array 'desc[]'. Fixes: `82b0945ce2` ("tools: hv: Add new fcopy application based on uio driver") Cc: stable@vger.kernel.org # 6.10+ Signed-off-by: Dexuan Cui <decui@microsoft.com> Reviewed-by: Saurabh Sengar <ssengar@linux.microsoft.com> Link: https://lore.kernel.org/r/20240910004433.50254-1-decui@microsoft.com Signed-off-by: Wei Liu <wei.liu@kernel.org> Message-ID: <20240910004433.50254-1-decui@microsoft.com>	2024-12-09 18:42:42 +00:00
Arnd Bergmann	c9bc45b346	Merge tag 'scmi-fix-6.13' of https://git.kernel.org/pub/scm/linux/kernel/git/sudeep.holla/linux into arm/fixes Arm SCMI fix for v6.13 Fix for the build issue in the ASoC driver with the SCMI support by enforcing the link-time dependency if IMX_SCMI_MISC_DRV is a loadable module but not if that is disabled. * tag 'scmi-fix-6.13' of https://git.kernel.org/pub/scm/linux/kernel/git/sudeep.holla/linux: firmware: arm_scmi: Fix i.MX build dependency Link: https://lore.kernel.org/r/20241205114348.708618-1-sudeep.holla@arm.com Signed-off-by: Arnd Bergmann <arnd@arndb.de>	2024-12-09 16:05:07 +01:00
Arnd Bergmann	90386e1ba4	Merge tag 'juno-fix-6.13' of https://git.kernel.org/pub/scm/linux/kernel/git/sudeep.holla/linux into arm/fixes Armv8 Juno fix for v6.13 Just a single fix updating the PCIe bus address range to accommodate the full ECAM window of 256MB available on most of the recent versions of RevC FVP models. * tag 'juno-fix-6.13' of https://git.kernel.org/pub/scm/linux/kernel/git/sudeep.holla/linux: arm64: dts: fvp: Update PCIe bus-range property Link: https://lore.kernel.org/r/20241205114302.708433-1-sudeep.holla@arm.com Signed-off-by: Arnd Bergmann <arnd@arndb.de>	2024-12-09 16:04:48 +01:00
Cristian Ciocaltea	9d23e48654	phy: rockchip: samsung-hdptx: Set drvdata before enabling runtime PM In some cases, rk_hdptx_phy_runtime_resume() may be invoked before platform_set_drvdata() is executed in ->probe(), leading to a NULL pointer dereference when using the return of dev_get_drvdata(). Ensure platform_set_drvdata() is called before devm_pm_runtime_enable(). Reported-by: Dmitry Osipenko <dmitry.osipenko@collabora.com> Fixes: `553be2830c` ("phy: rockchip: Add Samsung HDMI/eDP Combo PHY driver") Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Heiko Stuebner <heiko@sntech.de> Link: https://lore.kernel.org/r/20241023-phy-sam-hdptx-rpm-fix-v1-1-87f4c994e346@collabora.com Signed-off-by: Vinod Koul <vkoul@kernel.org>	2024-12-08 21:42:07 +05:30
Valentina Fernandez	48808b55b0	firmware: microchip: fix UL_IAP lock check in mpfs_auto_update_state() To verify that Auto Update is possible, the mpfs_auto_update_state() function performs a "Query Security Service Request" to the system controller. Previously, the check was performed on the first element of the response message, which was accessed using a 32-bit pointer. This caused the bitwise operation to reference incorrect data, as the response should be inspected at the byte level. Fixed this by casting the response to a u8 * pointer, ensuring the check correctly inspects the appropriate byte of the response message. Additionally, rename "UL_Auto Update" to "UL_IAP" to match the PolarFire Family System Services User Guide. Signed-off-by: Valentina Fernandez <valentina.fernandezalanis@microchip.com> Signed-off-by: Conor Dooley <conor.dooley@microchip.com>	2024-12-05 15:08:51 +00:00
Aneesh Kumar K.V (Arm)	4f776d81bf	arm64: dts: fvp: Update PCIe bus-range property These days, the Fixed Virtual Platforms(FVP) Base RevC model supports more PCI devices. Update the max bus number so that Linux can enumerate them correctly. Without this, the kernel throws the below error while booting with the default hierarchy \| pci_bus 0000:01: busn_res: [bus 01] end is updated to 01 \| pci_bus 0000:02: busn_res: can not insert [bus 02-01] under \| [bus 00-01] (conflicts with (null) [bus 00-01]) \| pci_bus 0000:02: busn_res: [bus 02-01] end is updated to 02 \| pci_bus 0000:02: busn_res: can not insert [bus 02] under \| [bus 00-01] (conflicts with (null) [bus 00-01]) \| pci_bus 0000:03: busn_res: can not insert [bus 03-01] under \| [bus 00-01] (conflicts with (null) [bus 00-01]) \| pci_bus 0000:03: busn_res: [bus 03-01] end is updated to 03 \| pci_bus 0000:03: busn_res: can not insert [bus 03] under \| [bus 00-01] (conflicts with (null) [bus 00-01]) \| pci_bus 0000:04: busn_res: can not insert [bus 04-01] under \| [bus 00-01] (conflicts with (null) [bus 00-01]) \| pci_bus 0000:04: busn_res: [bus 04-01] end is updated to 04 \| pci_bus 0000:04: busn_res: can not insert [bus 04] under \| [bus 00-01] (conflicts with (null) [bus 00-01]) \| pci 0000:00:01.0: BAR 14: assigned [mem 0x50000000-0x500fffff] \| pci-host-generic 40000000.pci: ECAM at [mem 0x40000000-0x4fffffff] \| for [bus 00-01] The change is using 0xff as max bus number because the ECAM window is 256MB in size. Below is the lspci output with and without the change: without fix =========== \| 00:00.0 Host bridge: ARM Device 00ba (rev 01) \| 00:01.0 PCI bridge: ARM Device 0def \| 00:02.0 PCI bridge: ARM Device 0def \| 00:03.0 PCI bridge: ARM Device 0def \| 00:04.0 PCI bridge: ARM Device 0def \| 00:1e.0 Unassigned class [ff00]: ARM Device ff80 \| 00:1e.1 Unassigned class [ff00]: ARM Device ff80 \| 00:1f.0 SATA controller: Device 0abc:aced (rev 01) \| 01:00.0 SATA controller: Device 0abc:aced (rev 01) with fix ======== \| 00:00.0 Host bridge: ARM Device 00ba (rev 01) \| 00:01.0 PCI bridge: ARM Device 0def \| 00:02.0 PCI bridge: ARM Device 0def \| 00:03.0 PCI bridge: ARM Device 0def \| 00:04.0 PCI bridge: ARM Device 0def \| 00:1e.0 Unassigned class [ff00]: ARM Device ff80 \| 00:1e.1 Unassigned class [ff00]: ARM Device ff80 \| 00:1f.0 SATA controller: Device 0abc:aced (rev 01) \| 01:00.0 SATA controller: Device 0abc:aced (rev 01) \| 02:00.0 Unassigned class [ff00]: ARM Device ff80 \| 02:00.4 Unassigned class [ff00]: ARM Device ff80 \| 03:00.0 PCI bridge: ARM Device 0def \| 04:00.0 PCI bridge: ARM Device 0def \| 04:01.0 PCI bridge: ARM Device 0def \| 04:02.0 PCI bridge: ARM Device 0def \| 05:00.0 SATA controller: Device 0abc:aced (rev 01) \| 06:00.0 Unassigned class [ff00]: ARM Device ff80 \| 06:00.7 Unassigned class [ff00]: ARM Device ff80 \| 07:00.0 Unassigned class [ff00]: ARM Device ff80 \| 07:00.3 Unassigned class [ff00]: ARM Device ff80 \| 08:00.0 Unassigned class [ff00]: ARM Device ff80 \| 08:00.1 Unassigned class [ff00]: ARM Device ff80 Cc: Sudeep Holla <sudeep.holla@arm.com> Cc: Lorenzo Pieralisi <lpieralisi@kernel.org> Cc: Rob Herring <robh@kernel.org> Cc: Krzysztof Kozlowski <krzk+dt@kernel.org> Cc: Conor Dooley <conor+dt@kernel.org> Reviewed-by: Liviu Dudau <liviu.dudau@arm.com> Signed-off-by: Aneesh Kumar K.V (Arm) <aneesh.kumar@kernel.org> Message-Id: <20241128152543.1821878-1-aneesh.kumar@kernel.org> Signed-off-by: Sudeep Holla <sudeep.holla@arm.com>	2024-12-05 10:28:26 +00:00
Roger Quadros	140054a25f	mtd: rawnand: omap2: Fix build warnings with W=1 Add kernel-doc for functions to get rid of below warnings when built with W=1. drivers/mtd/nand/raw/omap2.c:260: warning: Function parameter or struct member 'chip' not described in 'omap_nand_data_in_pref' drivers/mtd/nand/raw/omap2.c:260: warning: Function parameter or struct member 'buf' not described in 'omap_nand_data_in_pref' drivers/mtd/nand/raw/omap2.c:260: warning: Function parameter or struct member 'len' not described in 'omap_nand_data_in_pref' drivers/mtd/nand/raw/omap2.c:260: warning: Function parameter or struct member 'force_8bit' not described in 'omap_nand_data_in_pref' drivers/mtd/nand/raw/omap2.c:304: warning: Function parameter or struct member 'chip' not described in 'omap_nand_data_out_pref' drivers/mtd/nand/raw/omap2.c:304: warning: Function parameter or struct member 'buf' not described in 'omap_nand_data_out_pref' drivers/mtd/nand/raw/omap2.c:304: warning: Function parameter or struct member 'len' not described in 'omap_nand_data_out_pref' drivers/mtd/nand/raw/omap2.c:304: warning: Function parameter or struct member 'force_8bit' not described in 'omap_nand_data_out_pref' drivers/mtd/nand/raw/omap2.c:446: warning: Function parameter or struct member 'chip' not described in 'omap_nand_data_in_dma_pref' drivers/mtd/nand/raw/omap2.c:446: warning: Function parameter or struct member 'buf' not described in 'omap_nand_data_in_dma_pref' drivers/mtd/nand/raw/omap2.c:446: warning: Function parameter or struct member 'len' not described in 'omap_nand_data_in_dma_pref' drivers/mtd/nand/raw/omap2.c:446: warning: Function parameter or struct member 'force_8bit' not described in 'omap_nand_data_in_dma_pref' drivers/mtd/nand/raw/omap2.c:467: warning: Function parameter or struct member 'chip' not described in 'omap_nand_data_out_dma_pref' drivers/mtd/nand/raw/omap2.c:467: warning: Function parameter or struct member 'buf' not described in 'omap_nand_data_out_dma_pref' drivers/mtd/nand/raw/omap2.c:467: warning: Function parameter or struct member 'len' not described in 'omap_nand_data_out_dma_pref' drivers/mtd/nand/raw/omap2.c:467: warning: Function parameter or struct member 'force_8bit' not described in 'omap_nand_data_out_dma_pref' Reported-by: kernel test robot <lkp@intel.com> Closes: https://lore.kernel.org/oe-kbuild-all/202412031716.JfNIh1Uu-lkp@intel.com/ Signed-off-by: Roger Quadros <rogerq@kernel.org> Signed-off-by: Miquel Raynal <miquel.raynal@bootlin.com>	2024-12-05 11:15:00 +01:00
Maciej Andrzejewski	11e6831fd8	mtd: rawnand: arasan: Fix missing de-registration of NAND The NAND chip-selects are registered for the Arasan driver during initialization but are not de-registered when the driver is unloaded. As a result, if the driver is loaded again, the chip-selects remain registered and busy, making them unavailable for use. Fixes: `197b88fecc` ("mtd: rawnand: arasan: Add new Arasan NAND controller") Cc: stable@vger.kernel.org Signed-off-by: Maciej Andrzejewski ICEYE <maciej.andrzejewski@m-works.net> Signed-off-by: Miquel Raynal <miquel.raynal@bootlin.com>	2024-12-05 11:13:52 +01:00
Maciej Andrzejewski	b086a46dae	mtd: rawnand: arasan: Fix double assertion of chip-select When two chip-selects are configured in the device tree, and the second is a non-native GPIO, both the GPIO-based chip-select and the first native chip-select may be asserted simultaneously. This double assertion causes incorrect read and write operations. The issue occurs because when nfc->ncs <= 2, nfc->spare_cs is always initialized to 0 due to static initialization. Consequently, when the second chip-select (GPIO-based) is selected in anfc_assert_cs(), it is detected by anfc_is_gpio_cs(), and nfc->native_cs is assigned the value 0. This results in both the GPIO-based chip-select being asserted and the NAND controller register receiving 0, erroneously selecting the native chip-select. This patch resolves the issue, as confirmed by oscilloscope testing with configurations involving two or more chip-selects in the device tree. Fixes: `acbd3d0945` ("mtd: rawnand: arasan: Leverage additional GPIO CS") Cc: stable@vger.kernel.org Signed-off-by: Maciej Andrzejewski <maciej.andrzejewski@m-works.net> Signed-off-by: Miquel Raynal <miquel.raynal@bootlin.com>	2024-12-05 11:13:41 +01:00
Zichen Xie	9b458e8be0	mtd: diskonchip: Cast an operand to prevent potential overflow There may be a potential integer overflow issue in inftl_partscan(). parts[0].size is defined as "uint64_t" while mtd->erasesize and ip->firstUnit are defined as 32-bit unsigned integer. The result of the calculation will be limited to 32 bits without correct casting. Fixes: `1da177e4c3` ("Linux-2.6.12-rc2") Signed-off-by: Zichen Xie <zichenxie0106@gmail.com> Cc: stable@vger.kernel.org Signed-off-by: Miquel Raynal <miquel.raynal@bootlin.com>	2024-12-05 11:09:12 +01:00
Dan Carpenter	d8e4771f99	mtd: rawnand: fix double free in atmel_pmecc_create_user() The "user" pointer was converted from being allocated with kzalloc() to being allocated by devm_kzalloc(). Calling kfree(user) will lead to a double free. Fixes: `6d734f1bfc` ("mtd: rawnand: atmel: Fix possible memory leak") Signed-off-by: Dan Carpenter <dan.carpenter@linaro.org> Signed-off-by: Miquel Raynal <miquel.raynal@bootlin.com>	2024-12-05 11:06:43 +01:00
Arnd Bergmann	2de679ecd7	phy: stm32: work around constant-value overflow assertion FIELD_PREP() checks that a constant fits into the available bitfield, but if one of the two lookup tables in stm32_impedance_tune() does not find a matching entry, the index is out of range, which gcc correctly complains about: In file included from <command-line>: In function 'stm32_impedance_tune', inlined from 'stm32_combophy_pll_init' at drivers/phy/st/phy-stm32-combophy.c:247:9: include/linux/compiler_types.h:517:38: error: call to '__compiletime_assert_447' declared with attribute error: FIELD_PREP: value too large for the field 517 \| _compiletime_assert(condition, msg, __compiletime_assert_, __COUNTER__) \| ^ include/linux/bitfield.h:68:3: note: in expansion of macro 'BUILD_BUG_ON_MSG' 68 \| BUILD_BUG_ON_MSG(__builtin_constant_p(_val) ? \ 115 \| __BF_FIELD_CHECK(_mask, 0ULL, _val, "FIELD_PREP: "); \ \| ^~~~~~~~~~~~~~~~ drivers/phy/st/phy-stm32-combophy.c:162:8: note: in expansion of macro 'FIELD_PREP' 162 \| FIELD_PREP(STM32MP25_PCIEPRG_IMPCTRL_VSWING, vswing_of)); \| ^~~~~~~~~~ Rework this so the field value gets set inside of the loop and otherwise set to zero. Fixes: `47e1bb6b4b` ("phy: stm32: Add support for STM32MP25 COMBOPHY.") Signed-off-by: Arnd Bergmann <arnd@arndb.de> Link: https://lore.kernel.org/r/20241111103712.3520611-1-arnd@kernel.org Signed-off-by: Vinod Koul <vkoul@kernel.org>	2024-12-04 20:04:22 +05:30
Krishna Kurapati	8886fb3240	phy: qcom-qmp: Fix register name in RX Lane config of SC8280XP In RX Lane configuration sequence of SC8280XP, the register V5_RX_UCDR_FO_GAIN is incorrectly spelled as RX_UCDR_SO_GAIN and hence the programming sequence is wrong. Fix the register sequence accordingly to avoid any compliance failures. This has been tested on SA8775P by checking device mode enumeration in SuperSpeed. Cc: stable@vger.kernel.org Fixes: `c0c7769cda` ("phy: qcom-qmp: Add SC8280XP USB3 UNI phy") Signed-off-by: Krishna Kurapati <quic_kriskura@quicinc.com> Reviewed-by: Konrad Dybcio <konrad.dybcio@oss.qualcomm.com> Link: https://lore.kernel.org/r/20241112092831.4110942-1-quic_kriskura@quicinc.com Signed-off-by: Vinod Koul <vkoul@kernel.org>	2024-12-04 20:02:27 +05:30
Chukun Pan	fbcbffbac9	phy: rockchip: naneng-combphy: fix phy reset Currently, the USB port via combophy on the RK3528/RK3588 SoC is broken. usb usb8-port1: Cannot enable. Maybe the USB cable is bad? This is due to the combphy of RK3528/RK3588 SoC has multiple resets, but only "phy resets" need assert and deassert, "apb resets" don't need. So change the driver to only match the phy resets, which is also what the vendor kernel does. Fixes: `7160820d74` ("phy: rockchip: add naneng combo phy for RK3568") Cc: FUKAUMI Naoki <naoki@radxa.com> Cc: Michael Zimmermann <sigmaepsilon92@gmail.com> Signed-off-by: Chukun Pan <amadeus@jmu.edu.cn> Reviewed-by: Heiko Stuebner <heiko@sntech.de> Tested-by: FUKAUMI Naoki <naoki@radxa.com> Link: https://lore.kernel.org/r/20241122073006.99309-2-amadeus@jmu.edu.cn Signed-off-by: Vinod Koul <vkoul@kernel.org>	2024-12-04 19:49:08 +05:30
Justin Chen	0a92ea87bd	phy: usb: Toggle the PHY power during init When bringing up the PHY, it might be in a bad state if left powered. One case is we lose the PLL lock if the PLL is gated while the PHY is powered. Toggle the PHY power so we can start from a known state. Fixes: `4e5b9c9a73` ("phy: usb: Add support for new Synopsys USB controller on the 7216") Signed-off-by: Justin Chen <justin.chen@broadcom.com> Acked-by: Florian Fainelli <florian.fainelli@broadcom.com> Link: https://lore.kernel.org/r/20241024213540.1059412-1-justin.chen@broadcom.com Signed-off-by: Vinod Koul <vkoul@kernel.org>	2024-12-04 19:45:35 +05:30
Lizhi Hou	dcbef0798e	dmaengine: amd: qdma: Remove using the private get and set dma_ops APIs The get_dma_ops and set_dma_ops APIs were never for driver to use. Remove these calls from QDMA driver. Instead, pass the DMA device pointer from the qdma_platdata structure. Fixes: `73d5fc92a1` ("dmaengine: amd: qdma: Add AMD QDMA driver") Signed-off-by: Lizhi Hou <lizhi.hou@amd.com> Reviewed-by: Christoph Hellwig <hch@lst.de> Link: https://lore.kernel.org/r/20240918181022.2155715-1-lizhi.hou@amd.com Signed-off-by: Vinod Koul <vkoul@kernel.org>	2024-12-04 18:27:32 +05:30
Sasha Finkelstein	8d55e8a16f	dmaengine: apple-admac: Avoid accessing registers in probe The ADMAC attached to the AOP has complex power sequencing, and is power gated when the probe callback runs. Move the register reads to other functions, where we can guarantee that the hardware is switched on. Fixes: `568aa6dd64` ("dmaengine: apple-admac: Allocate cache SRAM to channels") Signed-off-by: Sasha Finkelstein <fnkl.kernel@gmail.com> Link: https://lore.kernel.org/r/20241124-admac-power-v1-1-58f2165a4d55@gmail.com Signed-off-by: Vinod Koul <vkoul@kernel.org>	2024-12-04 17:42:27 +05:30
Randy Dunlap	790fb9956e	linux/dmaengine.h: fix a few kernel-doc warnings The comment block for "Interleaved Transfer Request" should not begin with "/**" since it is not in kernel-doc format. Fix doc name for enum sum_check_flags. Fix all (4) missing struct member warnings. Use "Warning:" for one "Note:" in enum dma_desc_metadata_mode since scripts/kernel-doc does not allow more than one Note: per function or identifier description. This leaves around 49 kernel-doc warnings like: include/linux/dmaengine.h:43: warning: Enum value 'DMA_OUT_OF_ORDER' not described in enum 'dma_status' and another scripts/kernel-doc problem with it not being able to parse some typedefs. Fixes: `b14dab792d` ("DMAEngine: Define interleaved transfer request api") Fixes: `ad283ea4a3` ("async_tx: add sum check flags") Fixes: `272420214d` ("dmaengine: Add DMA_CTRL_REUSE") Fixes: `f067025bc6` ("dmaengine: add support to provide error result from a DMA transation") Fixes: `d38a8c622a` ("dmaengine: prepare for generic 'unmap' data") Fixes: `5878853fc9` ("dmaengine: Add API function dmaengine_prep_peripheral_dma_vec()") Signed-off-by: Randy Dunlap <rdunlap@infradead.org> Cc: Dan Williams <dan.j.williams@intel.com> Cc: Dave Jiang <dave.jiang@intel.com> Cc: Paul Cercueil <paul@crapouillou.net> Cc: Nuno Sa <nuno.sa@analog.com> Cc: Vinod Koul <vkoul@kernel.org> Cc: dmaengine@vger.kernel.org Link: https://lore.kernel.org/r/20241202172004.76020-1-rdunlap@infradead.org Signed-off-by: Vinod Koul <vkoul@kernel.org>	2024-12-04 17:41:25 +05:30
Levi Yun	6fe437cfe2	firmware: arm_ffa: Fix the race around setting ffa_dev->properties Currently, ffa_dev->properties is set after the ffa_device_register() call return in ffa_setup_partitions(). This could potentially result in a race where the partition's properties is accessed while probing struct ffa_device before it is set. Update the ffa_device_register() to receive ffa_partition_info so all the data from the partition information received from the firmware can be updated into the struct ffa_device before the calling device_register() in ffa_device_register(). Fixes: `e781858488` ("firmware: arm_ffa: Add initial FFA bus support for device enumeration") Signed-off-by: Levi Yun <yeoreum.yun@arm.com> Message-Id: <20241203143109.1030514-2-yeoreum.yun@arm.com> Signed-off-by: Sudeep Holla <sudeep.holla@arm.com>	2024-12-04 09:59:54 +00:00
Herve Codina	d7dfa7fde6	of: Fix error path in of_parse_phandle_with_args_map() The current code uses some 'goto put;' to cancel the parsing operation and can lead to a return code value of 0 even on error cases. Indeed, some goto calls are done from a loop without setting the ret value explicitly before the goto call and so the ret value can be set to 0 due to operation done in previous loop iteration. For instance match can be set to 0 in the previous loop iteration (leading to a new iteration) but ret can also be set to 0 it the of_property_read_u32() call succeed. In that case if no match are found or if an error is detected the new iteration, the return value can be wrongly 0. Avoid those cases setting the ret value explicitly before the goto calls. Fixes: `bd6f2fd5a1` ("of: Support parsing phandle argument lists through a nexus node") Cc: stable@vger.kernel.org Signed-off-by: Herve Codina <herve.codina@bootlin.com> Link: https://lore.kernel.org/r/20241202165819.158681-1-herve.codina@bootlin.com Signed-off-by: Rob Herring (Arm) <robh@kernel.org>	2024-12-03 11:31:19 -06:00
Rob Herring (Arm)	239521712b	dt-bindings: mtd: fixed-partitions: Fix "compression" typo The example erroneously has "compress" property rather than the documented "compression" property. Acked-by: Conor Dooley <conor.dooley@microchip.com> Link: https://lore.kernel.org/r/20241113225632.1783241-1-robh@kernel.org Signed-off-by: Rob Herring (Arm) <robh@kernel.org>	2024-12-03 11:31:19 -06:00
Arnd Bergmann	514b2262ad	firmware: arm_scmi: Fix i.MX build dependency The newly added SCMI vendor driver references functions in the protocol driver but needs a Kconfig dependency to ensure it can link, essentially the Kconfig dependency needs to be reversed to match the link time dependency: \| arm-linux-gnueabi-ld: sound/soc/fsl/fsl_mqs.o: in function `fsl_mqs_sm_write': \| fsl_mqs.c:(.text+0x1aa): undefined reference to `scmi_imx_misc_ctrl_set' \| arm-linux-gnueabi-ld: sound/soc/fsl/fsl_mqs.o: in function `fsl_mqs_sm_read': \| fsl_mqs.c:(.text+0x1ee): undefined reference to `scmi_imx_misc_ctrl_get' This however only works after changing the dependency in the SND_SOC_FSL_MQS driver as well, which uses 'select IMX_SCMI_MISC_DRV' to turn on a driver it depends on. This is generally a bad idea, so the best solution is to change that into a dependency. To allow the ASoC driver to keep building with the SCMI support, this needs to be an optional dependency that enforces the link-time dependency if IMX_SCMI_MISC_DRV is a loadable module but not depend on it if that is disabled. Fixes: `61c9f03e22` ("firmware: arm_scmi: Add initial support for i.MX MISC protocol") Fixes: `101c902359` ("ASoC: fsl_mqs: Support accessing registers by scmi interface") Signed-off-by: Arnd Bergmann <arnd@arndb.de> Acked-by: Mark Brown <broonie@kernel.org> Acked-by: Shengjiu Wang <shengjiu.wang@gmail.com> Message-Id: <20241115230555.2435004-1-arnd@kernel.org> Signed-off-by: Sudeep Holla <sudeep.holla@arm.com>	2024-12-03 15:47:11 +00:00
Binbin Zhou	4b65d5322e	dmaengine: loongson2-apb: Change GENMASK to GENMASK_ULL Fix the following smatch static checker warning: drivers/dma/loongson2-apb-dma.c:189 ls2x_dma_write_cmd() warn: was expecting a 64 bit value instead of '~(((0)) + (((~((0))) - (((1)) << (0)) + 1) & (~((0)) >> ((8 * 4) - 1 - (4)))))' The GENMASK macro used "unsigned long", which caused build issues when using a 32-bit toolchain because it would try to access bits > 31. This patch switches GENMASK to GENMASK_ULL, which uses "unsigned long long". Fixes: `71e7d3cb6e` ("dmaengine: ls2x-apb: New driver for the Loongson LS2X APB DMA controller") Reported-by: Dan Carpenter <dan.carpenter@linaro.org> Closes: https://lore.kernel.org/all/87cdc025-7246-4548-85ca-3d36fdc2be2d@stanley.mountain/ Signed-off-by: Binbin Zhou <zhoubinbin@loongson.cn> Link: https://lore.kernel.org/r/20241028093413.1145820-1-zhoubinbin@loongson.cn Signed-off-by: Vinod Koul <vkoul@kernel.org>	2024-12-02 22:48:57 +05:30
Andy Shevchenko	f0e870a0e9	dmaengine: dw: Select only supported masters for ACPI devices The recently submitted fix-commit revealed a problem in the iDMA 32-bit platform code. Even though the controller supported only a single master the dw_dma_acpi_filter() method hard-coded two master interfaces with IDs 0 and 1. As a result the sanity check implemented in the commit `b336268dde` ("dmaengine: dw: Add peripheral bus width verification") got incorrect interface data width and thus prevented the client drivers from configuring the DMA-channel with the EINVAL error returned. E.g., the next error was printed for the PXA2xx SPI controller driver trying to configure the requested channels: > [ 164.525604] pxa2xx_spi_pci 0000:00:07.1: DMA slave config failed > [ 164.536105] pxa2xx_spi_pci 0000:00:07.1: failed to get DMA TX descriptor > [ 164.543213] spidev spi-SPT0001:00: SPI transfer failed: -16 The problem would have been spotted much earlier if the iDMA 32-bit controller supported more than one master interfaces. But since it supports just a single master and the iDMA 32-bit specific code just ignores the master IDs in the CTLLO preparation method, the issue has been gone unnoticed so far. Fix the problem by specifying the default master ID for both memory and peripheral devices in the driver data. Thus the issue noticed for the iDMA 32-bit controllers will be eliminated and the ACPI-probed DW DMA controllers will be configured with the correct master ID by default. Cc: stable@vger.kernel.org Fixes: `b336268dde` ("dmaengine: dw: Add peripheral bus width verification") Fixes: `199244d694` ("dmaengine: dw: add support of iDMA 32-bit hardware") Reported-by: Ferry Toth <fntoth@gmail.com> Closes: https://lore.kernel.org/dmaengine/ZuXbCKUs1iOqFu51@black.fi.intel.com/ Reported-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com> Closes: https://lore.kernel.org/dmaengine/ZuXgI-VcHpMgbZ91@black.fi.intel.com/ Tested-by: Ferry Toth <fntoth@gmail.com> Signed-off-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com> Link: https://lore.kernel.org/r/20241104095142.157925-1-andriy.shevchenko@linux.intel.com Signed-off-by: Vinod Koul <vkoul@kernel.org>	2024-12-02 22:35:45 +05:30
Chen Ridong	c43ec96e8d	dmaengine: at_xdmac: avoid null_prt_deref in at_xdmac_prep_dma_memset The at_xdmac_memset_create_desc may return NULL, which will lead to a null pointer dereference. For example, the len input is error, or the atchan->free_descs_list is empty and memory is exhausted. Therefore, add check to avoid this. Fixes: `b206d9a23a` ("dmaengine: xdmac: Add memset support") Signed-off-by: Chen Ridong <chenridong@huawei.com> Link: https://lore.kernel.org/r/20241029082845.1185380-1-chenridong@huaweicloud.com Signed-off-by: Vinod Koul <vkoul@kernel.org>	2024-12-02 22:14:19 +05:30
Herve Codina	60bc447c85	of: Add #address-cells/#size-cells in the device-tree root empty node On systems where ACPI is enabled or when a device-tree is not passed to the kernel by the bootloader, a device-tree root empty node is created. This device-tree root empty node does not have the #address-cells and the #size-cells properties This leads to the use of the default address cells and size cells values which are defined in the code to 1 for the address cells value and 1 for the size cells value. According to the devicetree specification and the OpenFirmware standard (IEEE 1275-1994) the default value for #address-cells should be 2. Also, according to the devicetree specification, the #address-cells and the #size-cells are required properties in the root node. The device tree compiler already uses 2 as default value for address cells and 1 for size cells. The powerpc PROM code also uses 2 as default value for address cells and 1 for size cells. Modern implementation should have the #address-cells and the #size-cells properties set and should not rely on default values. On x86, this root empty node is used and the code default values are used. In preparation of the support for device-tree overlay on PCI devices feature on x86 (i.e. the creation of the PCI root bus device-tree node), the default value for #address-cells needs to be updated. Indeed, on x86_64, addresses are on 64bits and the upper part of an address is needed for correct address translations. On x86_32 having the default value updated does not lead to issues while the upper part of a 64-bit value is zero. Changing the default value for all architectures may break device-tree compatibility. Indeed, existing dts file without the #address-cells property set in the root node will not be compatible with this modification. Instead of updating default values, add both required #address-cells and #size-cells properties in the device-tree empty node. Use 2 for both properties value in order to fully support 64-bit addresses and sizes on systems using this empty root node. Signed-off-by: Herve Codina <herve.codina@bootlin.com> Link: https://lore.kernel.org/r/20241202131522.142268-6-herve.codina@bootlin.com Signed-off-by: Rob Herring (Arm) <robh@kernel.org>	2024-12-02 09:26:33 -06:00
Rob Herring (Arm)	61a6ba233f	dt-bindings: Unify "fsl,liodn" type definitions The type definition of "fsl,liodn" is defined as uint32 in crypto/fsl,sec-v4.0.yaml and uint32-array in soc/fsl/fsl,bman.yaml, soc/fsl/fsl,qman-portal.yaml, and soc/fsl/fsl,qman.yaml. Unify the type to be uint32-array and constraint the single entry cases. Reviewed-by: Conor Dooley <conor.dooley@microchip.com> Link: https://lore.kernel.org/r/20241113225614.1782862-1-robh@kernel.org Signed-off-by: Rob Herring (Arm) <robh@kernel.org>	2024-11-27 09:24:23 -06:00
Andrea della Porta	7f05e20b98	of: address: Preserve the flags portion on 1:1 dma-ranges mapping A missing or empty dma-ranges in a DT node implies a 1:1 mapping for dma translations. In this specific case, the current behaviour is to zero out the entire specifier so that the translation could be carried on as an offset from zero. This includes address specifier that has flags (e.g. PCI ranges). Once the flags portion has been zeroed, the translation chain is broken since the mapping functions will check the upcoming address specifier against mismatching flags, always failing the 1:1 mapping and its entire purpose of always succeeding. Set to zero only the address portion while passing the flags through. Fixes: `dbbdee9473` ("of/address: Merge all of the bus translation code") Cc: stable@vger.kernel.org Signed-off-by: Andrea della Porta <andrea.porta@suse.com> Tested-by: Herve Codina <herve.codina@bootlin.com> Link: https://lore.kernel.org/r/e51ae57874e58a9b349c35e2e877425ebc075d7a.1732441813.git.andrea.porta@suse.com Signed-off-by: Rob Herring (Arm) <robh@kernel.org>	2024-11-27 09:18:04 -06:00
Andrea della Porta	1a75e81baf	of/unittest: Add empty dma-ranges address translation tests Intermediate DT PCI nodes dynamically generated by enabling CONFIG_PCI_DYNAMIC_OF_NODES have empty dma-ranges property. PCI address specifiers have 3 cells and when dma-ranges is missing or empty, of_translate_one() is currently dropping the flag portion of PCI addresses which are subnodes of the aforementioned ones, failing the translation. Add new tests covering this case. With this test, we get 1 new failure which is fixed in subsequent commit: FAIL of_unittest_pci_empty_dma_ranges():1245 for_each_of_pci_range wrong CPU addr (ffffffffffffffff) on node /testcase-data/address-tests2/pcie@d1070000/pci@0,0/dev@0,0/local-bus@0 Signed-off-by: Andrea della Porta <andrea.porta@suse.com> Link: https://lore.kernel.org/r/08f8fee4fdc0379240fda2f4a0e6f11ebf9647a8.1732441813.git.andrea.porta@suse.com Signed-off-by: Rob Herring (Arm) <robh@kernel.org>	2024-11-27 09:18:04 -06:00
Samuel Holland	bc7acc0bd0	of: property: fw_devlink: Do not use interrupt-parent directly commit `7f00be96f1` ("of: property: Add device link support for interrupt-parent, dmas and -gpio(s)") started adding device links for the interrupt-parent property. commit `4104ca776b` ("of: property: Add fw_devlink support for interrupts") and commit `f265f06af1` ("of: property: Fix fw_devlink handling of interrupts/interrupts-extended") later added full support for parsing the interrupts and interrupts-extended properties, which includes looking up the node of the parent domain. This made the handler for the interrupt-parent property redundant. In fact, creating device links based solely on interrupt-parent is problematic, because it can create spurious cycles. A node may have this property without itself being an interrupt controller or consumer. For example, this property is often present in the root node or a /soc bus node to set the default interrupt parent for child nodes. However, it is incorrect for the bus to depend on the interrupt controller, as some of the bus's children may not be interrupt consumers at all or may have a different interrupt parent. Resolving these spurious dependency cycles can cause an incorrect probe order for interrupt controller drivers. This was observed on a RISC-V system with both an APLIC and IMSIC under /soc, where interrupt-parent in /soc points to the APLIC, and the APLIC msi-parent points to the IMSIC. fw_devlink found three dependency cycles and attempted to probe the APLIC before the IMSIC. After applying this patch, there were no dependency cycles and the probe order was correct. Acked-by: Marc Zyngier <maz@kernel.org> Cc: stable@vger.kernel.org Fixes: `4104ca776b` ("of: property: Add fw_devlink support for interrupts") Signed-off-by: Samuel Holland <samuel.holland@sifive.com> Link: https://lore.kernel.org/r/20241120233124.3649382-1-samuel.holland@sifive.com Signed-off-by: Rob Herring (Arm) <robh@kernel.org>	2024-11-25 08:24:17 -06:00

350 changed files with 4681 additions and 1943 deletions

1

.mailmap

View File

@@ -735,6 +735,7 @@ Wolfram Sang <wsa@kernel.org> <w.sang@pengutronix.de>
 Wolfram Sang <wsa@kernel.org> <wsa@the-dreams.de>
 Yakir Yang <kuankuan.y@gmail.com> <ykk@rock-chips.com>
 Yanteng Si <si.yanteng@linux.dev> <siyanteng@loongson.cn>
 Ying Huang <huang.ying.caritas@gmail.com> <ying.huang@intel.com>
 Yusuke Goda <goda.yusuke@renesas.com>
 Zack Rusin <zack.rusin@broadcom.com> <zackr@vmware.com>
 Zhu Yanjun <zyjzyj2000@gmail.com> <yanjunz@nvidia.com>

									
										4

Documentation/admin-guide/pm/amd-pstate.rst
									
												View File
												
				@@ -251,9 +251,7 @@ performance supported in `AMD CPPC Performance Capability <perf_cap_>`_).

				In some ASICs, the highest CPPC performance is not the one in the ``_CPC``

				table, so we need to expose it to sysfs. If boost is not active, but

				still supported, this maximum frequency will be larger than the one in

				``cpuinfo``. On systems that support preferred core, the driver will have

				different values for some cores than others and this will reflect the values

				advertised by the platform at bootup.

				``cpuinfo``.

				This attribute is read-only.

				``amd_pstate_lowest_nonlinear_freq``

									
										10

Documentation/devicetree/bindings/crypto/fsl,sec-v4.0.yaml
									
												View File
												
				@@ -114,8 +114,9 @@ patternProperties:

				          table that specifies the PPID to LIODN mapping. Needed if the PAMU is

				          used.  Value is a 12 bit value where value is a LIODN ID for this JR.

				          This property is normally set by boot firmware.

				        $ref: /schemas/types.yaml#/definitions/uint32

				        maximum: 0xfff

				        $ref: /schemas/types.yaml#/definitions/uint32-array

				        items:

				          - maximum: 0xfff

				  '^rtic@[0-9a-f]+$':

				    type: object

				@@ -186,8 +187,9 @@ patternProperties:

				              Needed if the PAMU is used.  Value is a 12 bit value where value

				              is a LIODN ID for this JR. This property is normally set by boot

				              firmware.

				            $ref: /schemas/types.yaml#/definitions/uint32

				            maximum: 0xfff

				            $ref: /schemas/types.yaml#/definitions/uint32-array

				            items:

				              - maximum: 0xfff

				          fsl,rtic-region:

				            description:

									
										2

Documentation/devicetree/bindings/mtd/partitions/fixed-partitions.yaml
									
												View File
												
				@@ -82,7 +82,7 @@ examples:

				        uimage@100000 {

				            reg = <0x0100000 0x200000>;

				            compress = "lzma";

				            compression = "lzma";

				        };

				    };

									
										2

Documentation/devicetree/bindings/soc/fsl/fsl,qman-portal.yaml
									
												View File
												
				@@ -35,6 +35,7 @@ properties:

				  fsl,liodn:

				    $ref: /schemas/types.yaml#/definitions/uint32-array

				    maxItems: 2

				    description: See pamu.txt. Two LIODN(s). DQRR LIODN (DLIODN) and Frame LIODN

				      (FLIODN)

				@@ -69,6 +70,7 @@ patternProperties:

				    type: object

				    properties:

				      fsl,liodn:

				        $ref: /schemas/types.yaml#/definitions/uint32-array

				        description: See pamu.txt, PAMU property used for static LIODN assignment

				      fsl,iommu-parent:

									
										2

Documentation/devicetree/bindings/sound/realtek,rt5645.yaml
									
												View File
												
				@@ -51,7 +51,7 @@ properties:

				    description: Power supply for AVDD, providing 1.8V.

				  cpvdd-supply:

				    description: Power supply for CPVDD, providing 3.5V.

				    description: Power supply for CPVDD, providing 1.8V.

				  hp-detect-gpios:

				    description:

									
										850

Documentation/mm/process_addrs.rst
									
												View File
												
				@@ -3,3 +3,853 @@

				=================

				Process Addresses

				=================

				.. toctree::

				   :maxdepth: 3

				Userland memory ranges are tracked by the kernel via Virtual Memory Areas or

				'VMA's of type :c:struct:`!struct vm_area_struct`.

				Each VMA describes a virtually contiguous memory range with identical

				attributes, each described by a :c:struct:`!struct vm_area_struct`

				object. Userland access outside of VMAs is invalid except in the case where an

				adjacent stack VMA could be extended to contain the accessed address.

				All VMAs are contained within one and only one virtual address space, described

				by a :c:struct:`!struct mm_struct` object which is referenced by all tasks (that is,

				threads) which share the virtual address space. We refer to this as the

				:c:struct:`!mm`.

				Each mm object contains a maple tree data structure which describes all VMAs

				within the virtual address space.

				.. note:: An exception to this is the 'gate' VMA which is provided by

				          architectures which use :c:struct:`!vsyscall` and is a global static

				          object which does not belong to any specific mm.

				-------

				Locking

				-------

				The kernel is designed to be highly scalable against concurrent read operations

				on VMA **metadata** so a complicated set of locks are required to ensure memory

				corruption does not occur.

				.. note:: Locking VMAs for their metadata does not have any impact on the memory

				          they describe nor the page tables that map them.

				Terminology

				-----------

				* **mmap locks** - Each MM has a read/write semaphore :c:member:`!mmap_lock`

				  which locks at a process address space granularity which can be acquired via

				  :c:func:`!mmap_read_lock`, :c:func:`!mmap_write_lock` and variants.

				* **VMA locks** - The VMA lock is at VMA granularity (of course) which behaves

				  as a read/write semaphore in practice. A VMA read lock is obtained via

				  :c:func:`!lock_vma_under_rcu` (and unlocked via :c:func:`!vma_end_read`) and a

				  write lock via :c:func:`!vma_start_write` (all VMA write locks are unlocked

				  automatically when the mmap write lock is released). To take a VMA write lock

				  you **must** have already acquired an :c:func:`!mmap_write_lock`.

				* **rmap locks** - When trying to access VMAs through the reverse mapping via a

				  :c:struct:`!struct address_space` or :c:struct:`!struct anon_vma` object

				  (reachable from a folio via :c:member:`!folio->mapping`). VMAs must be stabilised via

				  :c:func:`!anon_vma_[try]lock_read` or :c:func:`!anon_vma_[try]lock_write` for

				  anonymous memory and :c:func:`!i_mmap_[try]lock_read` or

				  :c:func:`!i_mmap_[try]lock_write` for file-backed memory. We refer to these

				  locks as the reverse mapping locks, or 'rmap locks' for brevity.

				We discuss page table locks separately in the dedicated section below.

				The first thing **any** of these locks achieve is to **stabilise** the VMA

				within the MM tree. That is, guaranteeing that the VMA object will not be

				deleted from under you nor modified (except for some specific fields

				described below).

				Stabilising a VMA also keeps the address space described by it around.

				Lock usage

				----------

				If you want to **read** VMA metadata fields or just keep the VMA stable, you

				must do one of the following:

				* Obtain an mmap read lock at the MM granularity via :c:func:`!mmap_read_lock` (or a

				  suitable variant), unlocking it with a matching :c:func:`!mmap_read_unlock` when

				  you're done with the VMA, *or*

				* Try to obtain a VMA read lock via :c:func:`!lock_vma_under_rcu`. This tries to

				  acquire the lock atomically so might fail, in which case fall-back logic is

				  required to instead obtain an mmap read lock if this returns :c:macro:`!NULL`,

				  *or*

				* Acquire an rmap lock before traversing the locked interval tree (whether

				  anonymous or file-backed) to obtain the required VMA.

				If you want to **write** VMA metadata fields, then things vary depending on the

				field (we explore each VMA field in detail below). For the majority you must:

				* Obtain an mmap write lock at the MM granularity via :c:func:`!mmap_write_lock` (or a

				  suitable variant), unlocking it with a matching :c:func:`!mmap_write_unlock` when

				  you're done with the VMA, *and*

				* Obtain a VMA write lock via :c:func:`!vma_start_write` for each VMA you wish to

				  modify, which will be released automatically when :c:func:`!mmap_write_unlock` is

				  called.

				* If you want to be able to write to **any** field, you must also hide the VMA

				  from the reverse mapping by obtaining an **rmap write lock**.

				VMA locks are special in that you must obtain an mmap **write** lock **first**

				in order to obtain a VMA **write** lock. A VMA **read** lock however can be

				obtained without any other lock (:c:func:`!lock_vma_under_rcu` will acquire then

				release an RCU lock to lookup the VMA for you).

				This constrains the impact of writers on readers, as a writer can interact with

				one VMA while a reader interacts with another simultaneously.

				.. note:: The primary users of VMA read locks are page fault handlers, which

				          means that without a VMA write lock, page faults will run concurrent with

				          whatever you are doing.

				Examining all valid lock states:

				.. table::

				   ========= ======== ========= ======= ===== =========== ==========

				   mmap lock VMA lock rmap lock Stable? Read? Write most? Write all?

				   ========= ======== ========= ======= ===== =========== ==========

				   \-        \-       \-        N       N     N           N

				   \-        R        \-        Y       Y     N           N

				   \-        \-       R/W       Y       Y     N           N

				   R/W       \-/R     \-/R/W    Y       Y     N           N

				   W         W        \-/R      Y       Y     Y           N

				   W         W        W         Y       Y     Y           Y

				   ========= ======== ========= ======= ===== =========== ==========

				.. warning:: While it's possible to obtain a VMA lock while holding an mmap read lock,

				             attempting to do the reverse is invalid as it can result in deadlock - if

				             another task already holds an mmap write lock and attempts to acquire a VMA

				             write lock that will deadlock on the VMA read lock.

				All of these locks behave as read/write semaphores in practice, so you can

				obtain either a read or a write lock for each of these.

				.. note:: Generally speaking, a read/write semaphore is a class of lock which

				          permits concurrent readers. However a write lock can only be obtained

				          once all readers have left the critical region (and pending readers

				          made to wait).

				          This renders read locks on a read/write semaphore concurrent with other

				          readers and write locks exclusive against all others holding the semaphore.

				VMA fields

				^^^^^^^^^^

				We can subdivide :c:struct:`!struct vm_area_struct` fields by their purpose, which makes it

				easier to explore their locking characteristics:

				.. note:: We exclude VMA lock-specific fields here to avoid confusion, as these

				          are in effect an internal implementation detail.

				.. table:: Virtual layout fields

				   ===================== ======================================== ===========

				   Field                 Description                              Write lock

				   ===================== ======================================== ===========

				   :c:member:`!vm_start` Inclusive start virtual address of range mmap write,

				                         VMA describes.                           VMA write,

				                                                                  rmap write.

				   :c:member:`!vm_end`   Exclusive end virtual address of range   mmap write,

				                         VMA describes.                           VMA write,

				                                                                  rmap write.

				   :c:member:`!vm_pgoff` Describes the page offset into the file, mmap write,

				                         the original page offset within the      VMA write,

				                         virtual address space (prior to any      rmap write.

				                         :c:func:`!mremap`), or PFN if a PFN map

				                         and the architecture does not support

				                         :c:macro:`!CONFIG_ARCH_HAS_PTE_SPECIAL`.

				   ===================== ======================================== ===========

				These fields describes the size, start and end of the VMA, and as such cannot be

				modified without first being hidden from the reverse mapping since these fields

				are used to locate VMAs within the reverse mapping interval trees.

				.. table:: Core fields

				   ============================ ======================================== =========================

				   Field                        Description                              Write lock

				   ============================ ======================================== =========================

				   :c:member:`!vm_mm`           Containing mm_struct.                    None - written once on

				                                                                         initial map.

				   :c:member:`!vm_page_prot`    Architecture-specific page table         mmap write, VMA write.

				                                protection bits determined from VMA

				                                flags.

				   :c:member:`!vm_flags`        Read-only access to VMA flags describing N/A

				                                attributes of the VMA, in union with

				                                private writable

				                                :c:member:`!__vm_flags`.

				   :c:member:`!__vm_flags`      Private, writable access to VMA flags    mmap write, VMA write.

				                                field, updated by

				                                :c:func:`!vm_flags_*` functions.

				   :c:member:`!vm_file`         If the VMA is file-backed, points to a   None - written once on

				                                struct file object describing the        initial map.

				                                underlying file, if anonymous then

				                                :c:macro:`!NULL`.

				   :c:member:`!vm_ops`          If the VMA is file-backed, then either   None - Written once on

				                                the driver or file-system provides a     initial map by

				                                :c:struct:`!struct vm_operations_struct` :c:func:`!f_ops->mmap()`.

				                                object describing callbacks to be

				                                invoked on VMA lifetime events.

				   :c:member:`!vm_private_data` A :c:member:`!void *` field for          Handled by driver.

				                                driver-specific metadata.

				   ============================ ======================================== =========================

				These are the core fields which describe the MM the VMA belongs to and its attributes.

				.. table:: Config-specific fields

				   ================================= ===================== ======================================== ===============

				   Field                             Configuration option  Description                              Write lock

				   ================================= ===================== ======================================== ===============

				   :c:member:`!anon_name`            CONFIG_ANON_VMA_NAME  A field for storing a                    mmap write,

				                                                           :c:struct:`!struct anon_vma_name`        VMA write.

				                                                           object providing a name for anonymous

				                                                           mappings, or :c:macro:`!NULL` if none

				                                                           is set or the VMA is file-backed. The

											   underlying object is reference counted

											   and can be shared across multiple VMAs

											   for scalability.

				   :c:member:`!swap_readahead_info`  CONFIG_SWAP           Metadata used by the swap mechanism      mmap read,

				                                                           to perform readahead. This field is      swap-specific

				                                                           accessed atomically.                     lock.

				   :c:member:`!vm_policy`            CONFIG_NUMA           :c:type:`!mempolicy` object which        mmap write,

				                                                           describes the NUMA behaviour of the      VMA write.

				                                                           VMA. The underlying object is reference

											   counted.

				   :c:member:`!numab_state`          CONFIG_NUMA_BALANCING :c:type:`!vma_numab_state` object which  mmap read,

				                                                           describes the current state of           numab-specific

				                                                           NUMA balancing in relation to this VMA.  lock.

				                                                           Updated under mmap read lock by

				                                                           :c:func:`!task_numa_work`.

				   :c:member:`!vm_userfaultfd_ctx`   CONFIG_USERFAULTFD    Userfaultfd context wrapper object of    mmap write,

				                                                           type :c:type:`!vm_userfaultfd_ctx`,      VMA write.

				                                                           either of zero size if userfaultfd is

				                                                           disabled, or containing a pointer

				                                                           to an underlying

				                                                           :c:type:`!userfaultfd_ctx` object which

				                                                           describes userfaultfd metadata.

				   ================================= ===================== ======================================== ===============

				These fields are present or not depending on whether the relevant kernel

				configuration option is set.

				.. table:: Reverse mapping fields

				   =================================== ========================================= ============================

				   Field                               Description                               Write lock

				   =================================== ========================================= ============================

				   :c:member:`!shared.rb`              A red/black tree node used, if the        mmap write, VMA write,

				                                       mapping is file-backed, to place the VMA  i_mmap write.

				                                       in the

				                                       :c:member:`!struct address_space->i_mmap`

				                                       red/black interval tree.

				   :c:member:`!shared.rb_subtree_last` Metadata used for management of the       mmap write, VMA write,

				                                       interval tree if the VMA is file-backed.  i_mmap write.

				   :c:member:`!anon_vma_chain`         List of pointers to both forked/CoW’d     mmap read, anon_vma write.

				                                       :c:type:`!anon_vma` objects and

				                                       :c:member:`!vma->anon_vma` if it is

				                                       non-:c:macro:`!NULL`.

				   :c:member:`!anon_vma`               :c:type:`!anon_vma` object used by        When :c:macro:`NULL` and

				                                       anonymous folios mapped exclusively to    setting non-:c:macro:`NULL`:

				                                       this VMA. Initially set by                mmap read, page_table_lock.

				                                       :c:func:`!anon_vma_prepare` serialised

				                                       by the :c:macro:`!page_table_lock`. This  When non-:c:macro:`NULL` and

				                                       is set as soon as any page is faulted in. setting :c:macro:`NULL`:

				                                                                                 mmap write, VMA write,

				                                                                                 anon_vma write.

				   =================================== ========================================= ============================

				These fields are used to both place the VMA within the reverse mapping, and for

				anonymous mappings, to be able to access both related :c:struct:`!struct anon_vma` objects

				and the :c:struct:`!struct anon_vma` in which folios mapped exclusively to this VMA should

				reside.

				.. note:: If a file-backed mapping is mapped with :c:macro:`!MAP_PRIVATE` set

				          then it can be in both the :c:type:`!anon_vma` and :c:type:`!i_mmap`

				          trees at the same time, so all of these fields might be utilised at

				          once.

				Page tables

				-----------

				We won't speak exhaustively on the subject but broadly speaking, page tables map

				virtual addresses to physical ones through a series of page tables, each of

				which contain entries with physical addresses for the next page table level

				(along with flags), and at the leaf level the physical addresses of the

				underlying physical data pages or a special entry such as a swap entry,

				migration entry or other special marker. Offsets into these pages are provided

				by the virtual address itself.

				In Linux these are divided into five levels - PGD, P4D, PUD, PMD and PTE. Huge

				pages might eliminate one or two of these levels, but when this is the case we

				typically refer to the leaf level as the PTE level regardless.

				.. note:: In instances where the architecture supports fewer page tables than

					  five the kernel cleverly 'folds' page table levels, that is stubbing

					  out functions related to the skipped levels. This allows us to

					  conceptually act as if there were always five levels, even if the

					  compiler might, in practice, eliminate any code relating to missing

					  ones.

				There are four key operations typically performed on page tables:

				1. **Traversing** page tables - Simply reading page tables in order to traverse

				   them. This only requires that the VMA is kept stable, so a lock which

				   establishes this suffices for traversal (there are also lockless variants

				   which eliminate even this requirement, such as :c:func:`!gup_fast`).

				2. **Installing** page table mappings - Whether creating a new mapping or

				   modifying an existing one in such a way as to change its identity. This

				   requires that the VMA is kept stable via an mmap or VMA lock (explicitly not

				   rmap locks).

				3. **Zapping/unmapping** page table entries - This is what the kernel calls

				   clearing page table mappings at the leaf level only, whilst leaving all page

				   tables in place. This is a very common operation in the kernel performed on

				   file truncation, the :c:macro:`!MADV_DONTNEED` operation via

				   :c:func:`!madvise`, and others. This is performed by a number of functions

				   including :c:func:`!unmap_mapping_range` and :c:func:`!unmap_mapping_pages`.

				   The VMA need only be kept stable for this operation.

				4. **Freeing** page tables - When finally the kernel removes page tables from a

				   userland process (typically via :c:func:`!free_pgtables`) extreme care must

				   be taken to ensure this is done safely, as this logic finally frees all page

				   tables in the specified range, ignoring existing leaf entries (it assumes the

				   caller has both zapped the range and prevented any further faults or

				   modifications within it).

				.. note:: Modifying mappings for reclaim or migration is performed under rmap

				          lock as it, like zapping, does not fundamentally modify the identity

				          of what is being mapped.

				**Traversing** and **zapping** ranges can be performed holding any one of the

				locks described in the terminology section above - that is the mmap lock, the

				VMA lock or either of the reverse mapping locks.

				That is - as long as you keep the relevant VMA **stable** - you are good to go

				ahead and perform these operations on page tables (though internally, kernel

				operations that perform writes also acquire internal page table locks to

				serialise - see the page table implementation detail section for more details).

				When **installing** page table entries, the mmap or VMA lock must be held to

				keep the VMA stable. We explore why this is in the page table locking details

				section below.

				.. warning:: Page tables are normally only traversed in regions covered by VMAs.

				             If you want to traverse page tables in areas that might not be

				             covered by VMAs, heavier locking is required.

				             See :c:func:`!walk_page_range_novma` for details.

				**Freeing** page tables is an entirely internal memory management operation and

				has special requirements (see the page freeing section below for more details).

				.. warning:: When **freeing** page tables, it must not be possible for VMAs

				             containing the ranges those page tables map to be accessible via

				             the reverse mapping.

				             The :c:func:`!free_pgtables` function removes the relevant VMAs

				             from the reverse mappings, but no other VMAs can be permitted to be

				             accessible and span the specified range.

				Lock ordering

				-------------

				As we have multiple locks across the kernel which may or may not be taken at the

				same time as explicit mm or VMA locks, we have to be wary of lock inversion, and

				the **order** in which locks are acquired and released becomes very important.

				.. note:: Lock inversion occurs when two threads need to acquire multiple locks,

				   but in doing so inadvertently cause a mutual deadlock.

				   For example, consider thread 1 which holds lock A and tries to acquire lock B,

				   while thread 2 holds lock B and tries to acquire lock A.

				   Both threads are now deadlocked on each other. However, had they attempted to

				   acquire locks in the same order, one would have waited for the other to

				   complete its work and no deadlock would have occurred.

				The opening comment in :c:macro:`!mm/rmap.c` describes in detail the required

				ordering of locks within memory management code:

				.. code-block::

				  inode->i_rwsem        (while writing or truncating, not reading or faulting)

				    mm->mmap_lock

				      mapping->invalidate_lock (in filemap_fault)

				        folio_lock

				          hugetlbfs_i_mmap_rwsem_key (in huge_pmd_share, see hugetlbfs below)

				            vma_start_write

				              mapping->i_mmap_rwsem

				                anon_vma->rwsem

				                  mm->page_table_lock or pte_lock

				                    swap_lock (in swap_duplicate, swap_info_get)

				                      mmlist_lock (in mmput, drain_mmlist and others)

				                      mapping->private_lock (in block_dirty_folio)

				                          i_pages lock (widely used)

				                            lruvec->lru_lock (in folio_lruvec_lock_irq)

				                      inode->i_lock (in set_page_dirty's __mark_inode_dirty)

				                      bdi.wb->list_lock (in set_page_dirty's __mark_inode_dirty)

				                        sb_lock (within inode_lock in fs/fs-writeback.c)

				                        i_pages lock (widely used, in set_page_dirty,

				                                  in arch-dependent flush_dcache_mmap_lock,

				                                  within bdi.wb->list_lock in __sync_single_inode)

				There is also a file-system specific lock ordering comment located at the top of

				:c:macro:`!mm/filemap.c`:

				.. code-block::

				  ->i_mmap_rwsem                        (truncate_pagecache)

				    ->private_lock                      (__free_pte->block_dirty_folio)

				      ->swap_lock                       (exclusive_swap_page, others)

				        ->i_pages lock

				  ->i_rwsem

				    ->invalidate_lock                   (acquired by fs in truncate path)

				      ->i_mmap_rwsem                    (truncate->unmap_mapping_range)

				  ->mmap_lock

				    ->i_mmap_rwsem

				      ->page_table_lock or pte_lock     (various, mainly in memory.c)

				        ->i_pages lock                  (arch-dependent flush_dcache_mmap_lock)

				  ->mmap_lock

				    ->invalidate_lock                   (filemap_fault)

				      ->lock_page                       (filemap_fault, access_process_vm)

				  ->i_rwsem                             (generic_perform_write)

				    ->mmap_lock                         (fault_in_readable->do_page_fault)

				  bdi->wb.list_lock

				    sb_lock                             (fs/fs-writeback.c)

				    ->i_pages lock                      (__sync_single_inode)

				  ->i_mmap_rwsem

				    ->anon_vma.lock                     (vma_merge)

				  ->anon_vma.lock

				    ->page_table_lock or pte_lock       (anon_vma_prepare and various)

				  ->page_table_lock or pte_lock

				    ->swap_lock                         (try_to_unmap_one)

				    ->private_lock                      (try_to_unmap_one)

				    ->i_pages lock                      (try_to_unmap_one)

				    ->lruvec->lru_lock                  (follow_page_mask->mark_page_accessed)

				    ->lruvec->lru_lock                  (check_pte_range->folio_isolate_lru)

				    ->private_lock                      (folio_remove_rmap_pte->set_page_dirty)

				    ->i_pages lock                      (folio_remove_rmap_pte->set_page_dirty)

				    bdi.wb->list_lock                   (folio_remove_rmap_pte->set_page_dirty)

				    ->inode->i_lock                     (folio_remove_rmap_pte->set_page_dirty)

				    bdi.wb->list_lock                   (zap_pte_range->set_page_dirty)

				    ->inode->i_lock                     (zap_pte_range->set_page_dirty)

				    ->private_lock                      (zap_pte_range->block_dirty_folio)

				Please check the current state of these comments which may have changed since

				the time of writing of this document.

				------------------------------

				Locking Implementation Details

				------------------------------

				.. warning:: Locking rules for PTE-level page tables are very different from

				             locking rules for page tables at other levels.

				Page table locking details

				--------------------------

				In addition to the locks described in the terminology section above, we have

				additional locks dedicated to page tables:

				* **Higher level page table locks** - Higher level page tables, that is PGD, P4D

				  and PUD each make use of the process address space granularity

				  :c:member:`!mm->page_table_lock` lock when modified.

				* **Fine-grained page table locks** - PMDs and PTEs each have fine-grained locks

				  either kept within the folios describing the page tables or allocated

				  separated and pointed at by the folios if :c:macro:`!ALLOC_SPLIT_PTLOCKS` is

				  set. The PMD spin lock is obtained via :c:func:`!pmd_lock`, however PTEs are

				  mapped into higher memory (if a 32-bit system) and carefully locked via

				  :c:func:`!pte_offset_map_lock`.

				These locks represent the minimum required to interact with each page table

				level, but there are further requirements.

				Importantly, note that on a **traversal** of page tables, sometimes no such

				locks are taken. However, at the PTE level, at least concurrent page table

				deletion must be prevented (using RCU) and the page table must be mapped into

				high memory, see below.

				Whether care is taken on reading the page table entries depends on the

				architecture, see the section on atomicity below.

				Locking rules

				^^^^^^^^^^^^^

				We establish basic locking rules when interacting with page tables:

				* When changing a page table entry the page table lock for that page table

				  **must** be held, except if you can safely assume nobody can access the page

				  tables concurrently (such as on invocation of :c:func:`!free_pgtables`).

				* Reads from and writes to page table entries must be *appropriately*

				  atomic. See the section on atomicity below for details.

				* Populating previously empty entries requires that the mmap or VMA locks are

				  held (read or write), doing so with only rmap locks would be dangerous (see

				  the warning below).

				* As mentioned previously, zapping can be performed while simply keeping the VMA

				  stable, that is holding any one of the mmap, VMA or rmap locks.

				.. warning:: Populating previously empty entries is dangerous as, when unmapping

				             VMAs, :c:func:`!vms_clear_ptes` has a window of time between

				             zapping (via :c:func:`!unmap_vmas`) and freeing page tables (via

				             :c:func:`!free_pgtables`), where the VMA is still visible in the

				             rmap tree. :c:func:`!free_pgtables` assumes that the zap has

				             already been performed and removes PTEs unconditionally (along with

				             all other page tables in the freed range), so installing new PTE

				             entries could leak memory and also cause other unexpected and

				             dangerous behaviour.

				There are additional rules applicable when moving page tables, which we discuss

				in the section on this topic below.

				PTE-level page tables are different from page tables at other levels, and there

				are extra requirements for accessing them:

				* On 32-bit architectures, they may be in high memory (meaning they need to be

				  mapped into kernel memory to be accessible).

				* When empty, they can be unlinked and RCU-freed while holding an mmap lock or

				  rmap lock for reading in combination with the PTE and PMD page table locks.

				  In particular, this happens in :c:func:`!retract_page_tables` when handling

				  :c:macro:`!MADV_COLLAPSE`.

				  So accessing PTE-level page tables requires at least holding an RCU read lock;

				  but that only suffices for readers that can tolerate racing with concurrent

				  page table updates such that an empty PTE is observed (in a page table that

				  has actually already been detached and marked for RCU freeing) while another

				  new page table has been installed in the same location and filled with

				  entries. Writers normally need to take the PTE lock and revalidate that the

				  PMD entry still refers to the same PTE-level page table.

				To access PTE-level page tables, a helper like :c:func:`!pte_offset_map_lock` or

				:c:func:`!pte_offset_map` can be used depending on stability requirements.

				These map the page table into kernel memory if required, take the RCU lock, and

				depending on variant, may also look up or acquire the PTE lock.

				See the comment on :c:func:`!__pte_offset_map_lock`.

				Atomicity

				^^^^^^^^^

				Regardless of page table locks, the MMU hardware concurrently updates accessed

				and dirty bits (perhaps more, depending on architecture). Additionally, page

				table traversal operations in parallel (though holding the VMA stable) and

				functionality like GUP-fast locklessly traverses (that is reads) page tables,

				without even keeping the VMA stable at all.

				When performing a page table traversal and keeping the VMA stable, whether a

				read must be performed once and only once or not depends on the architecture

				(for instance x86-64 does not require any special precautions).

				If a write is being performed, or if a read informs whether a write takes place

				(on an installation of a page table entry say, for instance in

				:c:func:`!__pud_install`), special care must always be taken. In these cases we

				can never assume that page table locks give us entirely exclusive access, and

				must retrieve page table entries once and only once.

				If we are reading page table entries, then we need only ensure that the compiler

				does not rearrange our loads. This is achieved via :c:func:`!pXXp_get`

				functions - :c:func:`!pgdp_get`, :c:func:`!p4dp_get`, :c:func:`!pudp_get`,

				:c:func:`!pmdp_get`, and :c:func:`!ptep_get`.

				Each of these uses :c:func:`!READ_ONCE` to guarantee that the compiler reads

				the page table entry only once.

				However, if we wish to manipulate an existing page table entry and care about

				the previously stored data, we must go further and use an hardware atomic

				operation as, for example, in :c:func:`!ptep_get_and_clear`.

				Equally, operations that do not rely on the VMA being held stable, such as

				GUP-fast (see :c:func:`!gup_fast` and its various page table level handlers like

				:c:func:`!gup_fast_pte_range`), must very carefully interact with page table

				entries, using functions such as :c:func:`!ptep_get_lockless` and equivalent for

				higher level page table levels.

				Writes to page table entries must also be appropriately atomic, as established

				by :c:func:`!set_pXX` functions - :c:func:`!set_pgd`, :c:func:`!set_p4d`,

				:c:func:`!set_pud`, :c:func:`!set_pmd`, and :c:func:`!set_pte`.

				Equally functions which clear page table entries must be appropriately atomic,

				as in :c:func:`!pXX_clear` functions - :c:func:`!pgd_clear`,

				:c:func:`!p4d_clear`, :c:func:`!pud_clear`, :c:func:`!pmd_clear`, and

				:c:func:`!pte_clear`.

				Page table installation

				^^^^^^^^^^^^^^^^^^^^^^^

				Page table installation is performed with the VMA held stable explicitly by an

				mmap or VMA lock in read or write mode (see the warning in the locking rules

				section for details as to why).

				When allocating a P4D, PUD or PMD and setting the relevant entry in the above

				PGD, P4D or PUD, the :c:member:`!mm->page_table_lock` must be held. This is

				acquired in :c:func:`!__p4d_alloc`, :c:func:`!__pud_alloc` and

				:c:func:`!__pmd_alloc` respectively.

				.. note:: :c:func:`!__pmd_alloc` actually invokes :c:func:`!pud_lock` and

				   :c:func:`!pud_lockptr` in turn, however at the time of writing it ultimately

				   references the :c:member:`!mm->page_table_lock`.

				Allocating a PTE will either use the :c:member:`!mm->page_table_lock` or, if

				:c:macro:`!USE_SPLIT_PMD_PTLOCKS` is defined, a lock embedded in the PMD

				physical page metadata in the form of a :c:struct:`!struct ptdesc`, acquired by

				:c:func:`!pmd_ptdesc` called from :c:func:`!pmd_lock` and ultimately

				:c:func:`!__pte_alloc`.

				Finally, modifying the contents of the PTE requires special treatment, as the

				PTE page table lock must be acquired whenever we want stable and exclusive

				access to entries contained within a PTE, especially when we wish to modify

				them.

				This is performed via :c:func:`!pte_offset_map_lock` which carefully checks to

				ensure that the PTE hasn't changed from under us, ultimately invoking

				:c:func:`!pte_lockptr` to obtain a spin lock at PTE granularity contained within

				the :c:struct:`!struct ptdesc` associated with the physical PTE page. The lock

				must be released via :c:func:`!pte_unmap_unlock`.

				.. note:: There are some variants on this, such as

				   :c:func:`!pte_offset_map_rw_nolock` when we know we hold the PTE stable but

				   for brevity we do not explore this.  See the comment for

				   :c:func:`!__pte_offset_map_lock` for more details.

				When modifying data in ranges we typically only wish to allocate higher page

				tables as necessary, using these locks to avoid races or overwriting anything,

				and set/clear data at the PTE level as required (for instance when page faulting

				or zapping).

				A typical pattern taken when traversing page table entries to install a new

				mapping is to optimistically determine whether the page table entry in the table

				above is empty, if so, only then acquiring the page table lock and checking

				again to see if it was allocated underneath us.

				This allows for a traversal with page table locks only being taken when

				required. An example of this is :c:func:`!__pud_alloc`.

				At the leaf page table, that is the PTE, we can't entirely rely on this pattern

				as we have separate PMD and PTE locks and a THP collapse for instance might have

				eliminated the PMD entry as well as the PTE from under us.

				This is why :c:func:`!__pte_offset_map_lock` locklessly retrieves the PMD entry

				for the PTE, carefully checking it is as expected, before acquiring the

				PTE-specific lock, and then *again* checking that the PMD entry is as expected.

				If a THP collapse (or similar) were to occur then the lock on both pages would

				be acquired, so we can ensure this is prevented while the PTE lock is held.

				Installing entries this way ensures mutual exclusion on write.

				Page table freeing

				^^^^^^^^^^^^^^^^^^

				Tearing down page tables themselves is something that requires significant

				care. There must be no way that page tables designated for removal can be

				traversed or referenced by concurrent tasks.

				It is insufficient to simply hold an mmap write lock and VMA lock (which will

				prevent racing faults, and rmap operations), as a file-backed mapping can be

				truncated under the :c:struct:`!struct address_space->i_mmap_rwsem` alone.

				As a result, no VMA which can be accessed via the reverse mapping (either

				through the :c:struct:`!struct anon_vma->rb_root` or the :c:member:`!struct

				address_space->i_mmap` interval trees) can have its page tables torn down.

				The operation is typically performed via :c:func:`!free_pgtables`, which assumes

				either the mmap write lock has been taken (as specified by its

				:c:member:`!mm_wr_locked` parameter), or that the VMA is already unreachable.

				It carefully removes the VMA from all reverse mappings, however it's important

				that no new ones overlap these or any route remain to permit access to addresses

				within the range whose page tables are being torn down.

				Additionally, it assumes that a zap has already been performed and steps have

				been taken to ensure that no further page table entries can be installed between

				the zap and the invocation of :c:func:`!free_pgtables`.

				Since it is assumed that all such steps have been taken, page table entries are

				cleared without page table locks (in the :c:func:`!pgd_clear`, :c:func:`!p4d_clear`,

				:c:func:`!pud_clear`, and :c:func:`!pmd_clear` functions.

				.. note:: It is possible for leaf page tables to be torn down independent of

				          the page tables above it as is done by

				          :c:func:`!retract_page_tables`, which is performed under the i_mmap

				          read lock, PMD, and PTE page table locks, without this level of care.

				Page table moving

				^^^^^^^^^^^^^^^^^

				Some functions manipulate page table levels above PMD (that is PUD, P4D and PGD

				page tables). Most notable of these is :c:func:`!mremap`, which is capable of

				moving higher level page tables.

				In these instances, it is required that **all** locks are taken, that is

				the mmap lock, the VMA lock and the relevant rmap locks.

				You can observe this in the :c:func:`!mremap` implementation in the functions

				:c:func:`!take_rmap_locks` and :c:func:`!drop_rmap_locks` which perform the rmap

				side of lock acquisition, invoked ultimately by :c:func:`!move_page_tables`.

				VMA lock internals

				------------------

				Overview

				^^^^^^^^

				VMA read locking is entirely optimistic - if the lock is contended or a competing

				write has started, then we do not obtain a read lock.

				A VMA **read** lock is obtained by :c:func:`!lock_vma_under_rcu`, which first

				calls :c:func:`!rcu_read_lock` to ensure that the VMA is looked up in an RCU

				critical section, then attempts to VMA lock it via :c:func:`!vma_start_read`,

				before releasing the RCU lock via :c:func:`!rcu_read_unlock`.

				VMA read locks hold the read lock on the :c:member:`!vma->vm_lock` semaphore for

				their duration and the caller of :c:func:`!lock_vma_under_rcu` must release it

				via :c:func:`!vma_end_read`.

				VMA **write** locks are acquired via :c:func:`!vma_start_write` in instances where a

				VMA is about to be modified, unlike :c:func:`!vma_start_read` the lock is always

				acquired. An mmap write lock **must** be held for the duration of the VMA write

				lock, releasing or downgrading the mmap write lock also releases the VMA write

				lock so there is no :c:func:`!vma_end_write` function.

				Note that a semaphore write lock is not held across a VMA lock. Rather, a

				sequence number is used for serialisation, and the write semaphore is only

				acquired at the point of write lock to update this.

				This ensures the semantics we require - VMA write locks provide exclusive write

				access to the VMA.

				Implementation details

				^^^^^^^^^^^^^^^^^^^^^^

				The VMA lock mechanism is designed to be a lightweight means of avoiding the use

				of the heavily contended mmap lock. It is implemented using a combination of a

				read/write semaphore and sequence numbers belonging to the containing

				:c:struct:`!struct mm_struct` and the VMA.

				Read locks are acquired via :c:func:`!vma_start_read`, which is an optimistic

				operation, i.e. it tries to acquire a read lock but returns false if it is

				unable to do so. At the end of the read operation, :c:func:`!vma_end_read` is

				called to release the VMA read lock.

				Invoking :c:func:`!vma_start_read` requires that :c:func:`!rcu_read_lock` has

				been called first, establishing that we are in an RCU critical section upon VMA

				read lock acquisition. Once acquired, the RCU lock can be released as it is only

				required for lookup. This is abstracted by :c:func:`!lock_vma_under_rcu` which

				is the interface a user should use.

				Writing requires the mmap to be write-locked and the VMA lock to be acquired via

				:c:func:`!vma_start_write`, however the write lock is released by the termination or

				downgrade of the mmap write lock so no :c:func:`!vma_end_write` is required.

				All this is achieved by the use of per-mm and per-VMA sequence counts, which are

				used in order to reduce complexity, especially for operations which write-lock

				multiple VMAs at once.

				If the mm sequence count, :c:member:`!mm->mm_lock_seq` is equal to the VMA

				sequence count :c:member:`!vma->vm_lock_seq` then the VMA is write-locked. If

				they differ, then it is not.

				Each time the mmap write lock is released in :c:func:`!mmap_write_unlock` or

				:c:func:`!mmap_write_downgrade`, :c:func:`!vma_end_write_all` is invoked which

				also increments :c:member:`!mm->mm_lock_seq` via

				:c:func:`!mm_lock_seqcount_end`.

				This way, we ensure that, regardless of the VMA's sequence number, a write lock

				is never incorrectly indicated and that when we release an mmap write lock we

				efficiently release **all** VMA write locks contained within the mmap at the

				same time.

				Since the mmap write lock is exclusive against others who hold it, the automatic

				release of any VMA locks on its release makes sense, as you would never want to

				keep VMAs locked across entirely separate write operations. It also maintains

				correct lock ordering.

				Each time a VMA read lock is acquired, we acquire a read lock on the

				:c:member:`!vma->vm_lock` read/write semaphore and hold it, while checking that

				the sequence count of the VMA does not match that of the mm.

				If it does, the read lock fails. If it does not, we hold the lock, excluding

				writers, but permitting other readers, who will also obtain this lock under RCU.

				Importantly, maple tree operations performed in :c:func:`!lock_vma_under_rcu`

				are also RCU safe, so the whole read lock operation is guaranteed to function

				correctly.

				On the write side, we acquire a write lock on the :c:member:`!vma->vm_lock`

				read/write semaphore, before setting the VMA's sequence number under this lock,

				also simultaneously holding the mmap write lock.

				This way, if any read locks are in effect, :c:func:`!vma_start_write` will sleep

				until these are finished and mutual exclusion is achieved.

				After setting the VMA's sequence number, the lock is released, avoiding

				complexity with a long-term held write lock.

				This clever combination of a read/write semaphore and sequence count allows for

				fast RCU-based per-VMA lock acquisition (especially on page fault, though

				utilised elsewhere) with minimal complexity around lock ordering.

				mmap write lock downgrading

				---------------------------

				When an mmap write lock is held one has exclusive access to resources within the

				mmap (with the usual caveats about requiring VMA write locks to avoid races with

				tasks holding VMA read locks).

				It is then possible to **downgrade** from a write lock to a read lock via

				:c:func:`!mmap_write_downgrade` which, similar to :c:func:`!mmap_write_unlock`,

				implicitly terminates all VMA write locks via :c:func:`!vma_end_write_all`, but

				importantly does not relinquish the mmap lock while downgrading, therefore

				keeping the locked virtual address space stable.

				An interesting consequence of this is that downgraded locks are exclusive

				against any other task possessing a downgraded lock (since a racing task would

				have to acquire a write lock first to downgrade it, and the downgraded lock

				prevents a new write lock from being obtained until the original lock is

				released).

				For clarity, we map read (R)/downgraded write (D)/write (W) locks against one

				another showing which locks exclude the others:

				.. list-table:: Lock exclusivity

				   :widths: 5 5 5 5

				   :header-rows: 1

				   :stub-columns: 1

				   * -

				     - R

				     - D

				     - W

				   * - R

				     - N

				     - N

				     - Y

				   * - D

				     - N

				     - Y

				     - Y

				   * - W

				     - Y

				     - Y

				     - Y

				Here a Y indicates the locks in the matching row/column are mutually exclusive,

				and N indicates that they are not.

				Stack expansion

				---------------

				Stack expansion throws up additional complexities in that we cannot permit there

				to be racing page faults, as a result we invoke :c:func:`!vma_start_write` to

				prevent this in :c:func:`!expand_downwards` or :c:func:`!expand_upwards`.

6

MAINTAINERS

View File

@@ -7347,7 +7347,7 @@ F:	drivers/gpu/drm/panel/panel-novatek-nt36672a.c
 DRM DRIVER FOR NVIDIA GEFORCE/QUADRO GPUS
 M:	Karol Herbst <kherbst@redhat.com>
 M:	Lyude Paul <lyude@redhat.com>
 M:	Danilo Krummrich <dakr@redhat.com>
 M:	Danilo Krummrich <dakr@kernel.org>
 L:	dri-devel@lists.freedesktop.org
 L:	nouveau@lists.freedesktop.org
 S:	Supported
@@ -8453,7 +8453,7 @@ F:	include/video/s1d13xxxfb.h
 EROFS FILE SYSTEM
 M:	Gao Xiang <xiang@kernel.org>
 M:	Chao Yu <chao@kernel.org>
 R:	Yue Hu <huyue2@coolpad.com>
 R:	Yue Hu <zbestahu@gmail.com>
 R:	Jeffle Xu <jefflexu@linux.alibaba.com>
 R:	Sandeep Dhavale <dhavale@google.com>
 L:	linux-erofs@lists.ozlabs.org
@@ -8924,7 +8924,7 @@ F:	include/linux/arm_ffa.h
 FIRMWARE LOADER (request_firmware)
 M:	Luis Chamberlain <mcgrof@kernel.org>
 M:	Russ Weight <russ.weight@linux.dev>
 M:	Danilo Krummrich <dakr@redhat.com>
 M:	Danilo Krummrich <dakr@kernel.org>
 L:	linux-kernel@vger.kernel.org
 S:	Maintained
 F:	Documentation/firmware_class/

									
										2

Makefile
									
												View File
												
				@@ -2,7 +2,7 @@

				VERSION = 6

				PATCHLEVEL = 13

				SUBLEVEL = 0

				EXTRAVERSION = -rc3

				EXTRAVERSION = -rc5

				NAME = Baby Opossum Posse

				# *DOCUMENTATION*

1

arch/arc/Kconfig

View File

@@ -6,6 +6,7 @@
 config ARC
 	def_bool y
 	select ARC_TIMERS
 	select ARCH_HAS_CPU_CACHE_ALIASING
 	select ARCH_HAS_CACHE_LINE_SIZE
 	select ARCH_HAS_DEBUG_VM_PGTABLE
 	select ARCH_HAS_DMA_PREP_COHERENT

									
										8

arch/arc/include/asm/cachetype.h
									
										Normal file
									
												View File
												
				@@ -0,0 +1,8 @@

				/* SPDX-License-Identifier: GPL-2.0 */

				#ifndef __ASM_ARC_CACHETYPE_H

				#define __ASM_ARC_CACHETYPE_H

				#define cpu_dcache_is_aliasing()	false

				#define cpu_icache_is_aliasing()	true

				#endif

2

arch/arm64/boot/dts/arm/fvp-base-revc.dts

View File

@@ -233,7 +233,7 @@
 		#interrupt-cells = <0x1>;
 		compatible = "pci-host-ecam-generic";
 		device_type = "pci";
 		bus-range = <0x0 0x1>;
 		bus-range = <0x0 0xff>;
 		reg = <0x0 0x40000000 0x0 0x10000000>;
 		ranges = <0x2000000 0x0 0x50000000 0x0 0x50000000 0x0 0x10000000>;
 		interrupt-map = <0 0 0 1 &gic 0 0 GIC_SPI 168 IRQ_TYPE_LEVEL_HIGH>,

8

arch/arm64/boot/dts/broadcom/bcm2712.dtsi

View File

@@ -67,7 +67,7 @@
 			l2_cache_l0: l2-cache-l0 {
 				compatible = "cache";
 				cache-size = <0x80000>;
 				cache-line-size = <128>;
 				cache-line-size = <64>;
 				cache-sets = <1024>; //512KiB(size)/64(line-size)=8192ways/8-way set
 				cache-level = <2>;
 				cache-unified;
@@ -91,7 +91,7 @@
 			l2_cache_l1: l2-cache-l1 {
 				compatible = "cache";
 				cache-size = <0x80000>;
 				cache-line-size = <128>;
 				cache-line-size = <64>;
 				cache-sets = <1024>; //512KiB(size)/64(line-size)=8192ways/8-way set
 				cache-level = <2>;
 				cache-unified;
@@ -115,7 +115,7 @@
 			l2_cache_l2: l2-cache-l2 {
 				compatible = "cache";
 				cache-size = <0x80000>;
 				cache-line-size = <128>;
 				cache-line-size = <64>;
 				cache-sets = <1024>; //512KiB(size)/64(line-size)=8192ways/8-way set
 				cache-level = <2>;
 				cache-unified;
@@ -139,7 +139,7 @@
 			l2_cache_l3: l2-cache-l3 {
 				compatible = "cache";
 				cache-size = <0x80000>;
 				cache-line-size = <128>;
 				cache-line-size = <64>;
 				cache-sets = <1024>; //512KiB(size)/64(line-size)=8192ways/8-way set
 				cache-level = <2>;
 				cache-unified;

									
										35

arch/arm64/kernel/signal.c
									
												View File
												
				@@ -36,15 +36,8 @@

				#include <asm/traps.h>

				#include <asm/vdso.h>

				#ifdef CONFIG_ARM64_GCS

				#define GCS_SIGNAL_CAP(addr) (((unsigned long)addr) & GCS_CAP_ADDR_MASK)

				static bool gcs_signal_cap_valid(u64 addr, u64 val)

				{

					return val == GCS_SIGNAL_CAP(addr);

				}

				#endif

				/*

				 * Do a signal return; undo the signal stack. These are aligned to 128-bit.

				 */

				@@ -1062,8 +1055,7 @@ static int restore_sigframe(struct pt_regs *regs,

				#ifdef CONFIG_ARM64_GCS

				static int gcs_restore_signal(void)

				{

					unsigned long __user *gcspr_el0;

					u64 cap;

					u64 gcspr_el0, cap;

					int ret;

					if (!system_supports_gcs())

				@@ -1072,7 +1064,7 @@ static int gcs_restore_signal(void)

					if (!(current->thread.gcs_el0_mode & PR_SHADOW_STACK_ENABLE))

						return 0;

					gcspr_el0 = (unsigned long __user *)read_sysreg_s(SYS_GCSPR_EL0);

					gcspr_el0 = read_sysreg_s(SYS_GCSPR_EL0);

					/*

					 * Ensure that any changes to the GCS done via GCS operations

				@@ -1087,22 +1079,23 @@ static int gcs_restore_signal(void)

					 * then faults will be generated on GCS operations - the main

					 * concern is to protect GCS pages.

					 */

					ret = copy_from_user(&cap, gcspr_el0, sizeof(cap));

					ret = copy_from_user(&cap, (unsigned long __user *)gcspr_el0,

							     sizeof(cap));

					if (ret)

						return -EFAULT;

					/*

					 * Check that the cap is the actual GCS before replacing it.

					 */

					if (!gcs_signal_cap_valid((u64)gcspr_el0, cap))

					if (cap != GCS_SIGNAL_CAP(gcspr_el0))

						return -EINVAL;

					/* Invalidate the token to prevent reuse */

					put_user_gcs(0, (__user void*)gcspr_el0, &ret);

					put_user_gcs(0, (unsigned long __user *)gcspr_el0, &ret);

					if (ret != 0)

						return -EFAULT;

					write_sysreg_s(gcspr_el0 + 1, SYS_GCSPR_EL0);

					write_sysreg_s(gcspr_el0 + 8, SYS_GCSPR_EL0);

					return 0;

				}

				@@ -1421,7 +1414,7 @@ static int get_sigframe(struct rt_sigframe_user_layout *user,

				static int gcs_signal_entry(__sigrestore_t sigtramp, struct ksignal *ksig)

				{

					unsigned long __user *gcspr_el0;

					u64 gcspr_el0;

					int ret = 0;

					if (!system_supports_gcs())

				@@ -1434,18 +1427,20 @@ static int gcs_signal_entry(__sigrestore_t sigtramp, struct ksignal *ksig)

					 * We are entering a signal handler, current register state is

					 * active.

					 */

					gcspr_el0 = (unsigned long __user *)read_sysreg_s(SYS_GCSPR_EL0);

					gcspr_el0 = read_sysreg_s(SYS_GCSPR_EL0);

					/*

					 * Push a cap and the GCS entry for the trampoline onto the GCS.

					 */

					put_user_gcs((unsigned long)sigtramp, gcspr_el0 - 2, &ret);

					put_user_gcs(GCS_SIGNAL_CAP(gcspr_el0 - 1), gcspr_el0 - 1, &ret);

					put_user_gcs((unsigned long)sigtramp,

						     (unsigned long __user *)(gcspr_el0 - 16), &ret);

					put_user_gcs(GCS_SIGNAL_CAP(gcspr_el0 - 8),

						     (unsigned long __user *)(gcspr_el0 - 8), &ret);

					if (ret != 0)

						return ret;

					gcspr_el0 -= 2;

					write_sysreg_s((unsigned long)gcspr_el0, SYS_GCSPR_EL0);

					gcspr_el0 -= 16;

					write_sysreg_s(gcspr_el0, SYS_GCSPR_EL0);

					return 0;

				}

									
										6

arch/hexagon/Makefile
									
												View File
												
				@@ -32,3 +32,9 @@ KBUILD_LDFLAGS += $(ldflags-y)

				TIR_NAME := r19

				KBUILD_CFLAGS += -ffixed-$(TIR_NAME) -DTHREADINFO_REG=$(TIR_NAME) -D__linux__

				KBUILD_AFLAGS += -DTHREADINFO_REG=$(TIR_NAME)

				# Disable HexagonConstExtenders pass for LLVM versions prior to 19.1.0

				# https://github.com/llvm/llvm-project/issues/99714

				ifneq ($(call clang-min-version, 190100),y)

				KBUILD_CFLAGS += -mllvm -hexagon-cext=false

				endif

1

arch/powerpc/configs/pmac32_defconfig

View File

@@ -208,6 +208,7 @@ CONFIG_FB_ATY=y
 CONFIG_FB_ATY_CT=y
 CONFIG_FB_ATY_GX=y
 CONFIG_FB_3DFX=y
 CONFIG_BACKLIGHT_CLASS_DEVICE=y
 # CONFIG_VGA_CONSOLE is not set
 CONFIG_FRAMEBUFFER_CONSOLE=y
 CONFIG_LOGO=y

1

arch/powerpc/configs/ppc6xx_defconfig

View File

@@ -716,6 +716,7 @@ CONFIG_FB_TRIDENT=m
 CONFIG_FB_SM501=m
 CONFIG_FB_IBM_GXT4500=y
 CONFIG_LCD_PLATFORM=m
 CONFIG_BACKLIGHT_CLASS_DEVICE=y
 CONFIG_FRAMEBUFFER_CONSOLE=y
 CONFIG_FRAMEBUFFER_CONSOLE_ROTATION=y
 CONFIG_LOGO=y

									
										36

arch/powerpc/platforms/book3s/vas-api.c
									
												View File
												
				@@ -464,7 +464,43 @@ static vm_fault_t vas_mmap_fault(struct vm_fault *vmf)

					return VM_FAULT_SIGBUS;

				}

				/*

				 * During mmap() paste address, mapping VMA is saved in VAS window

				 * struct which is used to unmap during migration if the window is

				 * still open. But the user space can remove this mapping with

				 * munmap() before closing the window and the VMA address will

				 * be invalid. Set VAS window VMA to NULL in this function which

				 * is called before VMA free.

				 */

				static void vas_mmap_close(struct vm_area_struct *vma)

				{

					struct file *fp = vma->vm_file;

					struct coproc_instance *cp_inst = fp->private_data;

					struct vas_window *txwin;

					/* Should not happen */

					if (!cp_inst || !cp_inst->txwin) {

						pr_err("No attached VAS window for the paste address mmap\n");

						return;

					}

					txwin = cp_inst->txwin;

					/*

					 * task_ref.vma is set in coproc_mmap() during mmap paste

					 * address. So it has to be the same VMA that is getting freed.

					 */

					if (WARN_ON(txwin->task_ref.vma != vma)) {

						pr_err("Invalid paste address mmaping\n");

						return;

					}

					mutex_lock(&txwin->task_ref.mmap_mutex);

					txwin->task_ref.vma = NULL;

					mutex_unlock(&txwin->task_ref.mmap_mutex);

				}

				static const struct vm_operations_struct vas_vm_ops = {

					.close = vas_mmap_close,

					.fault = vas_mmap_fault,

				};

									
										2

arch/s390/boot/startup.c
									
												View File
												
				@@ -234,6 +234,8 @@ static unsigned long get_vmem_size(unsigned long identity_size,

					vsize = round_up(SZ_2G + max_mappable, rte_size) +

						round_up(vmemmap_size, rte_size) +

						FIXMAP_SIZE + MODULES_LEN + KASLR_LEN;

					if (IS_ENABLED(CONFIG_KMSAN))

						vsize += MODULES_LEN * 2;

					return size_add(vsize, vmalloc_size);

				}

									
										6

arch/s390/boot/vmem.c
									
												View File
												
				@@ -306,7 +306,7 @@ static void pgtable_pte_populate(pmd_t *pmd, unsigned long addr, unsigned long e

							pages++;

						}

					}

					if (mode == POPULATE_DIRECT)

					if (mode == POPULATE_IDENTITY)

						update_page_count(PG_DIRECT_MAP_4K, pages);

				}

				@@ -339,7 +339,7 @@ static void pgtable_pmd_populate(pud_t *pud, unsigned long addr, unsigned long e

						}

						pgtable_pte_populate(pmd, addr, next, mode);

					}

					if (mode == POPULATE_DIRECT)

					if (mode == POPULATE_IDENTITY)

						update_page_count(PG_DIRECT_MAP_1M, pages);

				}

				@@ -372,7 +372,7 @@ static void pgtable_pud_populate(p4d_t *p4d, unsigned long addr, unsigned long e

						}

						pgtable_pmd_populate(pud, addr, next, mode);

					}

					if (mode == POPULATE_DIRECT)

					if (mode == POPULATE_IDENTITY)

						update_page_count(PG_DIRECT_MAP_2G, pages);

				}

									
										2

arch/s390/kernel/ipl.c
									
												View File
												
				@@ -270,7 +270,7 @@ static ssize_t sys_##_prefix##_##_name##_store(struct kobject *kobj,	\

					if (len >= sizeof(_value))					\

						return -E2BIG;						\

					len = strscpy(_value, buf, sizeof(_value));			\

					if (len < 0)							\

					if ((ssize_t)len < 0)						\

						return len;						\

					strim(_value);							\

					return len;							\

									
										12

arch/x86/events/intel/core.c
									
												View File
												
				@@ -429,6 +429,16 @@ static struct event_constraint intel_lnc_event_constraints[] = {

					EVENT_CONSTRAINT_END

				};

				static struct extra_reg intel_lnc_extra_regs[] __read_mostly = {

					INTEL_UEVENT_EXTRA_REG(0x012a, MSR_OFFCORE_RSP_0, 0xfffffffffffull, RSP_0),

					INTEL_UEVENT_EXTRA_REG(0x012b, MSR_OFFCORE_RSP_1, 0xfffffffffffull, RSP_1),

					INTEL_UEVENT_PEBS_LDLAT_EXTRA_REG(0x01cd),

					INTEL_UEVENT_EXTRA_REG(0x02c6, MSR_PEBS_FRONTEND, 0x9, FE),

					INTEL_UEVENT_EXTRA_REG(0x03c6, MSR_PEBS_FRONTEND, 0x7fff1f, FE),

					INTEL_UEVENT_EXTRA_REG(0x40ad, MSR_PEBS_FRONTEND, 0xf, FE),

					INTEL_UEVENT_EXTRA_REG(0x04c2, MSR_PEBS_FRONTEND, 0x8, FE),

					EVENT_EXTRA_END

				};

				EVENT_ATTR_STR(mem-loads,	mem_ld_nhm,	"event=0x0b,umask=0x10,ldlat=3");

				EVENT_ATTR_STR(mem-loads,	mem_ld_snb,	"event=0xcd,umask=0x1,ldlat=3");

				@@ -6422,7 +6432,7 @@ static __always_inline void intel_pmu_init_lnc(struct pmu *pmu)

					intel_pmu_init_glc(pmu);

					hybrid(pmu, event_constraints) = intel_lnc_event_constraints;

					hybrid(pmu, pebs_constraints) = intel_lnc_pebs_event_constraints;

					hybrid(pmu, extra_regs) = intel_rwc_extra_regs;

					hybrid(pmu, extra_regs) = intel_lnc_extra_regs;

				}

				static __always_inline void intel_pmu_init_skt(struct pmu *pmu)

									
										1

arch/x86/events/intel/ds.c
									
												View File
												
				@@ -2517,6 +2517,7 @@ void __init intel_ds_init(void)

							x86_pmu.large_pebs_flags |= PERF_SAMPLE_TIME;

							break;

						case 6:

						case 5:

							x86_pmu.pebs_ept = 1;

							fallthrough;

									
										1

arch/x86/events/intel/uncore.c
									
												View File
												
				@@ -1910,6 +1910,7 @@ static const struct x86_cpu_id intel_uncore_match[] __initconst = {

					X86_MATCH_VFM(INTEL_ATOM_GRACEMONT,	&adl_uncore_init),

					X86_MATCH_VFM(INTEL_ATOM_CRESTMONT_X,	&gnr_uncore_init),

					X86_MATCH_VFM(INTEL_ATOM_CRESTMONT,	&gnr_uncore_init),

					X86_MATCH_VFM(INTEL_ATOM_DARKMONT_X,	&gnr_uncore_init),

					{},

				};

				MODULE_DEVICE_TABLE(x86cpu, intel_uncore_match);

									
										1

arch/x86/include/asm/cpufeatures.h
									
												View File
												
				@@ -452,6 +452,7 @@

				#define X86_FEATURE_SME_COHERENT	(19*32+10) /* AMD hardware-enforced cache coherency */

				#define X86_FEATURE_DEBUG_SWAP		(19*32+14) /* "debug_swap" AMD SEV-ES full debug state swap support */

				#define X86_FEATURE_SVSM		(19*32+28) /* "svsm" SVSM present */

				#define X86_FEATURE_HV_INUSE_WR_ALLOWED	(19*32+30) /* Allow Write to in-use hypervisor-owned pages */

				/* AMD-defined Extended Feature 2 EAX, CPUID level 0x80000021 (EAX), word 20 */

				#define X86_FEATURE_NO_NESTED_DATA_BP	(20*32+ 0) /* No Nested Data Breakpoints */

									
										2

arch/x86/include/asm/processor.h
									
												View File
												
				@@ -230,6 +230,8 @@ static inline unsigned long long l1tf_pfn_limit(void)

					return BIT_ULL(boot_cpu_data.x86_cache_bits - 1 - PAGE_SHIFT);

				}

				void init_cpu_devs(void);

				void get_cpu_vendor(struct cpuinfo_x86 *c);

				extern void early_cpu_init(void);

				extern void identify_secondary_cpu(struct cpuinfo_x86 *);

				extern void print_cpu_info(struct cpuinfo_x86 *);

									
										15

arch/x86/include/asm/static_call.h
									
												View File
												
				@@ -65,4 +65,19 @@

				extern bool __static_call_fixup(void *tramp, u8 op, void *dest);

				extern void __static_call_update_early(void *tramp, void *func);

				#define static_call_update_early(name, _func)				\

				({									\

					typeof(&STATIC_CALL_TRAMP(name)) __F = (_func);			\

					if (static_call_initialized) {					\

						__static_call_update(&STATIC_CALL_KEY(name),		\

								     STATIC_CALL_TRAMP_ADDR(name), __F);\

					} else {							\

						WRITE_ONCE(STATIC_CALL_KEY(name).func, _func);		\

						__static_call_update_early(STATIC_CALL_TRAMP_ADDR(name),\

									   __F);			\

					}								\

				})

				#endif /* _ASM_STATIC_CALL_H */

									
										6

arch/x86/include/asm/sync_core.h
									
												View File
												
				@@ -8,7 +8,7 @@

				#include <asm/special_insns.h>

				#ifdef CONFIG_X86_32

				static inline void iret_to_self(void)

				static __always_inline void iret_to_self(void)

				{

					asm volatile (

						"pushfl\n\t"

				@@ -19,7 +19,7 @@ static inline void iret_to_self(void)

						: ASM_CALL_CONSTRAINT : : "memory");

				}

				#else

				static inline void iret_to_self(void)

				static __always_inline void iret_to_self(void)

				{

					unsigned int tmp;

				@@ -55,7 +55,7 @@ static inline void iret_to_self(void)

				 * Like all of Linux's memory ordering operations, this is a

				 * compiler barrier as well.

				 */

				static inline void sync_core(void)

				static __always_inline void sync_core(void)

				{

					/*

					 * The SERIALIZE instruction is the most straightforward way to

									
										36

arch/x86/include/asm/xen/hypercall.h
									
												View File
												
				@@ -39,9 +39,11 @@

				#include <linux/string.h>

				#include <linux/types.h>

				#include <linux/pgtable.h>

				#include <linux/instrumentation.h>

				#include <trace/events/xen.h>

				#include <asm/alternative.h>

				#include <asm/page.h>

				#include <asm/smap.h>

				#include <asm/nospec-branch.h>

				@@ -86,11 +88,20 @@ struct xen_dm_op_buf;

				 * there aren't more than 5 arguments...)

				 */

				extern struct { char _entry[32]; } hypercall_page[];

				void xen_hypercall_func(void);

				DECLARE_STATIC_CALL(xen_hypercall, xen_hypercall_func);

				#define __HYPERCALL		"call hypercall_page+%c[offset]"

				#define __HYPERCALL_ENTRY(x)						\

					[offset] "i" (__HYPERVISOR_##x * sizeof(hypercall_page[0]))

				#ifdef MODULE

				#define __ADDRESSABLE_xen_hypercall

				#else

				#define __ADDRESSABLE_xen_hypercall __ADDRESSABLE_ASM_STR(__SCK__xen_hypercall)

				#endif

				#define __HYPERCALL					\

					__ADDRESSABLE_xen_hypercall			\

					"call __SCT__xen_hypercall"

				#define __HYPERCALL_ENTRY(x)	"a" (x)

				#ifdef CONFIG_X86_32

				#define __HYPERCALL_RETREG	"eax"

				@@ -148,7 +159,7 @@ extern struct { char _entry[32]; } hypercall_page[];

					__HYPERCALL_0ARG();						\

					asm volatile (__HYPERCALL					\

						      : __HYPERCALL_0PARAM				\

						      : __HYPERCALL_ENTRY(name)				\

						      : __HYPERCALL_ENTRY(__HYPERVISOR_ ## name)	\

						      : __HYPERCALL_CLOBBER0);				\

					(type)__res;							\

				})

				@@ -159,7 +170,7 @@ extern struct { char _entry[32]; } hypercall_page[];

					__HYPERCALL_1ARG(a1);						\

					asm volatile (__HYPERCALL					\

						      : __HYPERCALL_1PARAM				\

						      : __HYPERCALL_ENTRY(name)				\

						      : __HYPERCALL_ENTRY(__HYPERVISOR_ ## name)	\

						      : __HYPERCALL_CLOBBER1);				\

					(type)__res;							\

				})

				@@ -170,7 +181,7 @@ extern struct { char _entry[32]; } hypercall_page[];

					__HYPERCALL_2ARG(a1, a2);					\

					asm volatile (__HYPERCALL					\

						      : __HYPERCALL_2PARAM				\

						      : __HYPERCALL_ENTRY(name)				\

						      : __HYPERCALL_ENTRY(__HYPERVISOR_ ## name)	\

						      : __HYPERCALL_CLOBBER2);				\

					(type)__res;							\

				})

				@@ -181,7 +192,7 @@ extern struct { char _entry[32]; } hypercall_page[];

					__HYPERCALL_3ARG(a1, a2, a3);					\

					asm volatile (__HYPERCALL					\

						      : __HYPERCALL_3PARAM				\

						      : __HYPERCALL_ENTRY(name)				\

						      : __HYPERCALL_ENTRY(__HYPERVISOR_ ## name)	\

						      : __HYPERCALL_CLOBBER3);				\

					(type)__res;							\

				})

				@@ -192,7 +203,7 @@ extern struct { char _entry[32]; } hypercall_page[];

					__HYPERCALL_4ARG(a1, a2, a3, a4);				\

					asm volatile (__HYPERCALL					\

						      : __HYPERCALL_4PARAM				\

						      : __HYPERCALL_ENTRY(name)				\

						      : __HYPERCALL_ENTRY(__HYPERVISOR_ ## name)	\

						      : __HYPERCALL_CLOBBER4);				\

					(type)__res;							\

				})

				@@ -206,12 +217,9 @@ xen_single_call(unsigned int call,

					__HYPERCALL_DECLS;

					__HYPERCALL_5ARG(a1, a2, a3, a4, a5);

					if (call >= PAGE_SIZE / sizeof(hypercall_page[0]))

						return -EINVAL;

					asm volatile(CALL_NOSPEC

					asm volatile(__HYPERCALL

						     : __HYPERCALL_5PARAM

						     : [thunk_target] "a" (&hypercall_page[call])

						     : __HYPERCALL_ENTRY(call)

						     : __HYPERCALL_CLOBBER5);

					return (long)__res;

									
										5

arch/x86/kernel/callthunks.c
									
												View File
												
				@@ -142,11 +142,6 @@ static bool skip_addr(void *dest)

					if (dest >= (void *)relocate_kernel &&

					    dest < (void*)relocate_kernel + KEXEC_CONTROL_CODE_MAX_SIZE)

						return true;

				#endif

				#ifdef CONFIG_XEN

					if (dest >= (void *)hypercall_page &&

					    dest < (void*)hypercall_page + PAGE_SIZE)

						return true;

				#endif

					return false;

				}

									
										30

arch/x86/kernel/cet.c
									
												View File
												
				@@ -81,6 +81,34 @@ static void do_user_cp_fault(struct pt_regs *regs, unsigned long error_code)

				static __ro_after_init bool ibt_fatal = true;

				/*

				 * By definition, all missing-ENDBRANCH #CPs are a result of WFE && !ENDBR.

				 *

				 * For the kernel IBT no ENDBR selftest where #CPs are deliberately triggered,

				 * the WFE state of the interrupted context needs to be cleared to let execution

				 * continue.  Otherwise when the CPU resumes from the instruction that just

				 * caused the previous #CP, another missing-ENDBRANCH #CP is raised and the CPU

				 * enters a dead loop.

				 *

				 * This is not a problem with IDT because it doesn't preserve WFE and IRET doesn't

				 * set WFE.  But FRED provides space on the entry stack (in an expanded CS area)

				 * to save and restore the WFE state, thus the WFE state is no longer clobbered,

				 * so software must clear it.

				 */

				static void ibt_clear_fred_wfe(struct pt_regs *regs)

				{

					/*

					 * No need to do any FRED checks.

					 *

					 * For IDT event delivery, the high-order 48 bits of CS are pushed

					 * as 0s into the stack, and later IRET ignores these bits.

					 *

					 * For FRED, a test to check if fred_cs.wfe is set would be dropped

					 * by compilers.

					 */

					regs->fred_cs.wfe = 0;

				}

				static void do_kernel_cp_fault(struct pt_regs *regs, unsigned long error_code)

				{

					if ((error_code & CP_EC) != CP_ENDBR) {

				@@ -90,6 +118,7 @@ static void do_kernel_cp_fault(struct pt_regs *regs, unsigned long error_code)

					if (unlikely(regs->ip == (unsigned long)&ibt_selftest_noendbr)) {

						regs->ax = 0;

						ibt_clear_fred_wfe(regs);

						return;

					}

				@@ -97,6 +126,7 @@ static void do_kernel_cp_fault(struct pt_regs *regs, unsigned long error_code)

					if (!ibt_fatal) {

						printk(KERN_DEFAULT CUT_HERE);

						__warn(__FILE__, __LINE__, (void *)regs->ip, TAINT_WARN, regs, NULL);

						ibt_clear_fred_wfe(regs);

						return;

					}

					BUG();

									
										38

arch/x86/kernel/cpu/common.c
									
												View File
												
				@@ -867,7 +867,7 @@ static void cpu_detect_tlb(struct cpuinfo_x86 *c)

						tlb_lld_4m[ENTRIES], tlb_lld_1g[ENTRIES]);

				}

				static void get_cpu_vendor(struct cpuinfo_x86 *c)

				void get_cpu_vendor(struct cpuinfo_x86 *c)

				{

					char *v = c->x86_vendor_id;

					int i;

				@@ -1649,15 +1649,11 @@ static void __init early_identify_cpu(struct cpuinfo_x86 *c)

					detect_nopl();

				}

				void __init early_cpu_init(void)

				void __init init_cpu_devs(void)

				{

					const struct cpu_dev *const *cdev;

					int count = 0;

				#ifdef CONFIG_PROCESSOR_SELECT

					pr_info("KERNEL supported cpus:\n");

				#endif

					for (cdev = __x86_cpu_dev_start; cdev < __x86_cpu_dev_end; cdev++) {

						const struct cpu_dev *cpudev = *cdev;

				@@ -1665,20 +1661,30 @@ void __init early_cpu_init(void)

							break;

						cpu_devs[count] = cpudev;

						count++;

					}

				}

				void __init early_cpu_init(void)

				{

				#ifdef CONFIG_PROCESSOR_SELECT

					unsigned int i, j;

					pr_info("KERNEL supported cpus:\n");

				#endif

					init_cpu_devs();

				#ifdef CONFIG_PROCESSOR_SELECT

						{

							unsigned int j;

							for (j = 0; j < 2; j++) {

								if (!cpudev->c_ident[j])

									continue;

								pr_info("  %s %s\n", cpudev->c_vendor,

									cpudev->c_ident[j]);

							}

					for (i = 0; i < X86_VENDOR_NUM && cpu_devs[i]; i++) {

						for (j = 0; j < 2; j++) {

							if (!cpu_devs[i]->c_ident[j])

								continue;

							pr_info("  %s %s\n", cpu_devs[i]->c_vendor,

								cpu_devs[i]->c_ident[j]);

						}

				#endif

					}

				#endif

					early_identify_cpu(&boot_cpu_data);

				}

									
										58

arch/x86/kernel/cpu/mshyperv.c
									
												View File
												
				@@ -223,6 +223,63 @@ static void hv_machine_crash_shutdown(struct pt_regs *regs)

					hyperv_cleanup();

				}

				#endif /* CONFIG_CRASH_DUMP */

				static u64 hv_ref_counter_at_suspend;

				static void (*old_save_sched_clock_state)(void);

				static void (*old_restore_sched_clock_state)(void);

				/*

				 * Hyper-V clock counter resets during hibernation. Save and restore clock

				 * offset during suspend/resume, while also considering the time passed

				 * before suspend. This is to make sure that sched_clock using hv tsc page

				 * based clocksource, proceeds from where it left off during suspend and

				 * it shows correct time for the timestamps of kernel messages after resume.

				 */

				static void save_hv_clock_tsc_state(void)

				{

					hv_ref_counter_at_suspend = hv_read_reference_counter();

				}

				static void restore_hv_clock_tsc_state(void)

				{

					/*

					 * Adjust the offsets used by hv tsc clocksource to

					 * account for the time spent before hibernation.

					 * adjusted value = reference counter (time) at suspend

					 *                - reference counter (time) now.

					 */

					hv_adj_sched_clock_offset(hv_ref_counter_at_suspend - hv_read_reference_counter());

				}

				/*

				 * Functions to override save_sched_clock_state and restore_sched_clock_state

				 * functions of x86_platform. The Hyper-V clock counter is reset during

				 * suspend-resume and the offset used to measure time needs to be

				 * corrected, post resume.

				 */

				static void hv_save_sched_clock_state(void)

				{

					old_save_sched_clock_state();

					save_hv_clock_tsc_state();

				}

				static void hv_restore_sched_clock_state(void)

				{

					restore_hv_clock_tsc_state();

					old_restore_sched_clock_state();

				}

				static void __init x86_setup_ops_for_tsc_pg_clock(void)

				{

					if (!(ms_hyperv.features & HV_MSR_REFERENCE_TSC_AVAILABLE))

						return;

					old_save_sched_clock_state = x86_platform.save_sched_clock_state;

					x86_platform.save_sched_clock_state = hv_save_sched_clock_state;

					old_restore_sched_clock_state = x86_platform.restore_sched_clock_state;

					x86_platform.restore_sched_clock_state = hv_restore_sched_clock_state;

				}

				#endif /* CONFIG_HYPERV */

				static uint32_t  __init ms_hyperv_platform(void)

				@@ -579,6 +636,7 @@ static void __init ms_hyperv_init_platform(void)

					/* Register Hyper-V specific clocksource */

					hv_init_clocksource();

					x86_setup_ops_for_tsc_pg_clock();

					hv_vtl_init_platform();

				#endif

					/*

									
										9

arch/x86/kernel/static_call.c
									
												View File
												
				@@ -172,6 +172,15 @@ void arch_static_call_transform(void *site, void *tramp, void *func, bool tail)

				}

				EXPORT_SYMBOL_GPL(arch_static_call_transform);

				noinstr void __static_call_update_early(void *tramp, void *func)

				{

					BUG_ON(system_state != SYSTEM_BOOTING);

					BUG_ON(!early_boot_irqs_disabled);

					BUG_ON(static_call_initialized);

					__text_gen_insn(tramp, JMP32_INSN_OPCODE, tramp, func, JMP32_INSN_SIZE);

					sync_core();

				}

				#ifdef CONFIG_MITIGATION_RETHUNK

				/*

				 * This is called by apply_returns() to fix up static call trampolines,

									
										4

arch/x86/kernel/vmlinux.lds.S
									
												View File
												
				@@ -519,14 +519,10 @@ INIT_PER_CPU(irq_stack_backing_store);

				 * linker will never mark as relocatable. (Using just ABSOLUTE() is not

				 * sufficient for that).

				 */

				#ifdef CONFIG_XEN

				#ifdef CONFIG_XEN_PV

				xen_elfnote_entry_value =

					ABSOLUTE(xen_elfnote_entry) + ABSOLUTE(startup_xen);

				#endif

				xen_elfnote_hypercall_page_value =

					ABSOLUTE(xen_elfnote_hypercall_page) + ABSOLUTE(hypercall_page);

				#endif

				#ifdef CONFIG_PVH

				xen_elfnote_phys32_entry_value =

					ABSOLUTE(xen_elfnote_phys32_entry) + ABSOLUTE(pvh_start_xen - LOAD_OFFSET);

									
										12

arch/x86/kvm/mmu/mmu.c
									
												View File
												
				@@ -3364,18 +3364,6 @@ static bool fast_pf_fix_direct_spte(struct kvm_vcpu *vcpu,

					return true;

				}

				static bool is_access_allowed(struct kvm_page_fault *fault, u64 spte)

				{

					if (fault->exec)

						return is_executable_pte(spte);

					if (fault->write)

						return is_writable_pte(spte);

					/* Fault was on Read access */

					return spte & PT_PRESENT_MASK;

				}

				/*

				 * Returns the last level spte pointer of the shadow page walk for the given

				 * gpa, and sets *spte to the spte value. This spte may be non-preset. If no

									
										17

arch/x86/kvm/mmu/spte.h
									
												View File
												
				@@ -461,6 +461,23 @@ static inline bool is_mmu_writable_spte(u64 spte)

					return spte & shadow_mmu_writable_mask;

				}

				/*

				 * Returns true if the access indicated by @fault is allowed by the existing

				 * SPTE protections.  Note, the caller is responsible for checking that the

				 * SPTE is a shadow-present, leaf SPTE (either before or after).

				 */

				static inline bool is_access_allowed(struct kvm_page_fault *fault, u64 spte)

				{

					if (fault->exec)

						return is_executable_pte(spte);

					if (fault->write)

						return is_writable_pte(spte);

					/* Fault was on Read access */

					return spte & PT_PRESENT_MASK;

				}

				/*

				 * If the MMU-writable flag is cleared, i.e. the SPTE is write-protected for

				 * write-tracking, remote TLBs must be flushed, even if the SPTE was read-only,

									
										5

arch/x86/kvm/mmu/tdp_mmu.c
									
												View File
												
				@@ -985,6 +985,11 @@ static int tdp_mmu_map_handle_target_level(struct kvm_vcpu *vcpu,

					if (fault->prefetch && is_shadow_present_pte(iter->old_spte))

						return RET_PF_SPURIOUS;

					if (is_shadow_present_pte(iter->old_spte) &&

					    is_access_allowed(fault, iter->old_spte) &&

					    is_last_spte(iter->old_spte, iter->level))

						return RET_PF_SPURIOUS;

					if (unlikely(!fault->slot))

						new_spte = make_mmio_spte(vcpu, iter->gfn, ACC_ALL);

					else

									
										6

arch/x86/kvm/svm/avic.c
									
												View File
												
				@@ -1199,6 +1199,12 @@ bool avic_hardware_setup(void)

						return false;

					}

					if (cc_platform_has(CC_ATTR_HOST_SEV_SNP) &&

					    !boot_cpu_has(X86_FEATURE_HV_INUSE_WR_ALLOWED)) {

						pr_warn("AVIC disabled: missing HvInUseWrAllowed on SNP-enabled system\n");

						return false;

					}

					if (boot_cpu_has(X86_FEATURE_AVIC)) {

						pr_info("AVIC enabled\n");

					} else if (force_avic) {

									
										9

arch/x86/kvm/svm/svm.c
									
												View File
												
				@@ -3201,15 +3201,6 @@ static int svm_set_msr(struct kvm_vcpu *vcpu, struct msr_data *msr)

						if (data & ~supported_de_cfg)

							return 1;

						/*

						 * Don't let the guest change the host-programmed value.  The

						 * MSR is very model specific, i.e. contains multiple bits that

						 * are completely unknown to KVM, and the one bit known to KVM

						 * is simply a reflection of hardware capabilities.

						 */

						if (!msr->host_initiated && data != svm->msr_decfg)

							return 1;

						svm->msr_decfg = data;

						break;

					}

									
										2

arch/x86/kvm/vmx/posted_intr.h
									
												View File
												
				@@ -2,7 +2,7 @@

				#ifndef __KVM_X86_VMX_POSTED_INTR_H

				#define __KVM_X86_VMX_POSTED_INTR_H

				#include <linux/find.h>

				#include <linux/bitmap.h>

				#include <asm/posted_intr.h>

				void vmx_vcpu_pi_load(struct kvm_vcpu *vcpu, int cpu);

									
										9

arch/x86/kvm/x86.c
									
												View File
												
				@@ -9976,7 +9976,7 @@ static int complete_hypercall_exit(struct kvm_vcpu *vcpu)

				{

					u64 ret = vcpu->run->hypercall.ret;

					if (!is_64_bit_mode(vcpu))

					if (!is_64_bit_hypercall(vcpu))

						ret = (u32)ret;

					kvm_rax_write(vcpu, ret);

					++vcpu->stat.hypercalls;

				@@ -12724,6 +12724,13 @@ int kvm_arch_init_vm(struct kvm *kvm, unsigned long type)

					kvm_hv_init_vm(kvm);

					kvm_xen_init_vm(kvm);

					if (ignore_msrs && !report_ignored_msrs) {

						pr_warn_once("Running KVM with ignore_msrs=1 and report_ignored_msrs=0 is not a\n"

							     "a supported configuration.  Lying to the guest about the existence of MSRs\n"

							     "may cause the guest operating system to hang or produce errors.  If a guest\n"

							     "does not run without ignore_msrs=1, please report it to kvm@vger.kernel.org.\n");

					}

					return 0;

				out_uninit_mmu:

									
										65

arch/x86/xen/enlighten.c
									
												View File
												
				@@ -2,6 +2,7 @@

				#include <linux/console.h>

				#include <linux/cpu.h>

				#include <linux/instrumentation.h>

				#include <linux/kexec.h>

				#include <linux/memblock.h>

				#include <linux/slab.h>

				@@ -21,7 +22,8 @@

				#include "xen-ops.h"

				EXPORT_SYMBOL_GPL(hypercall_page);

				DEFINE_STATIC_CALL(xen_hypercall, xen_hypercall_hvm);

				EXPORT_STATIC_CALL_TRAMP(xen_hypercall);

				/*

				 * Pointer to the xen_vcpu_info structure or

				@@ -68,6 +70,67 @@ EXPORT_SYMBOL(xen_start_flags);

				 */

				struct shared_info *HYPERVISOR_shared_info = &xen_dummy_shared_info;

				static __ref void xen_get_vendor(void)

				{

					init_cpu_devs();

					cpu_detect(&boot_cpu_data);

					get_cpu_vendor(&boot_cpu_data);

				}

				void xen_hypercall_setfunc(void)

				{

					if (static_call_query(xen_hypercall) != xen_hypercall_hvm)

						return;

					if ((boot_cpu_data.x86_vendor == X86_VENDOR_AMD ||

					     boot_cpu_data.x86_vendor == X86_VENDOR_HYGON))

						static_call_update(xen_hypercall, xen_hypercall_amd);

					else

						static_call_update(xen_hypercall, xen_hypercall_intel);

				}

				/*

				 * Evaluate processor vendor in order to select the correct hypercall

				 * function for HVM/PVH guests.

				 * Might be called very early in boot before vendor has been set by

				 * early_cpu_init().

				 */

				noinstr void *__xen_hypercall_setfunc(void)

				{

					void (*func)(void);

					/*

					 * Xen is supported only on CPUs with CPUID, so testing for

					 * X86_FEATURE_CPUID is a test for early_cpu_init() having been

					 * run.

					 *

					 * Note that __xen_hypercall_setfunc() is noinstr only due to a nasty

					 * dependency chain: it is being called via the xen_hypercall static

					 * call when running as a PVH or HVM guest. Hypercalls need to be

					 * noinstr due to PV guests using hypercalls in noinstr code. So we

					 * can safely tag the function body as "instrumentation ok", since

					 * the PV guest requirement is not of interest here (xen_get_vendor()

					 * calls noinstr functions, and static_call_update_early() might do

					 * so, too).

					 */

					instrumentation_begin();

					if (!boot_cpu_has(X86_FEATURE_CPUID))

						xen_get_vendor();

					if ((boot_cpu_data.x86_vendor == X86_VENDOR_AMD ||

					     boot_cpu_data.x86_vendor == X86_VENDOR_HYGON))

						func = xen_hypercall_amd;

					else

						func = xen_hypercall_intel;

					static_call_update_early(xen_hypercall, func);

					instrumentation_end();

					return func;

				}

				static int xen_cpu_up_online(unsigned int cpu)

				{

					xen_init_lock_cpu(cpu);

									
										13

arch/x86/xen/enlighten_hvm.c
									
												View File
												
				@@ -106,15 +106,8 @@ static void __init init_hvm_pv_info(void)

					/* PVH set up hypercall page in xen_prepare_pvh(). */

					if (xen_pvh_domain())

						pv_info.name = "Xen PVH";

					else {

						u64 pfn;

						uint32_t msr;

					else

						pv_info.name = "Xen HVM";

						msr = cpuid_ebx(base + 2);

						pfn = __pa(hypercall_page);

						wrmsr_safe(msr, (u32)pfn, (u32)(pfn >> 32));

					}

					xen_setup_features();

				@@ -300,6 +293,10 @@ static uint32_t __init xen_platform_hvm(void)

					if (xen_pv_domain())

						return 0;

					/* Set correct hypercall function. */

					if (xen_domain)

						xen_hypercall_setfunc();

					if (xen_pvh_domain() && nopv) {

						/* Guest booting via the Xen-PVH boot entry goes here */

						pr_info("\"nopv\" parameter is ignored in PVH guest\n");

									
										4

arch/x86/xen/enlighten_pv.c
									
												View File
												
				@@ -1341,6 +1341,9 @@ asmlinkage __visible void __init xen_start_kernel(struct start_info *si)

					xen_domain_type = XEN_PV_DOMAIN;

					xen_start_flags = xen_start_info->flags;

					/* Interrupts are guaranteed to be off initially. */

					early_boot_irqs_disabled = true;

					static_call_update_early(xen_hypercall, xen_hypercall_pv);

					xen_setup_features();

				@@ -1431,7 +1434,6 @@ asmlinkage __visible void __init xen_start_kernel(struct start_info *si)

					WARN_ON(xen_cpuhp_setup(xen_cpu_up_prepare_pv, xen_cpu_dead_pv));

					local_irq_disable();

					early_boot_irqs_disabled = true;

					xen_raw_console_write("mapping kernel into physical memory\n");

					xen_setup_kernel_pagetable((pgd_t *)xen_start_info->pt_base,

									
										7

arch/x86/xen/enlighten_pvh.c
									
												View File
												
				@@ -129,17 +129,10 @@ static void __init pvh_arch_setup(void)

				void __init xen_pvh_init(struct boot_params *boot_params)

				{

					u32 msr;

					u64 pfn;

					xen_pvh = 1;

					xen_domain_type = XEN_HVM_DOMAIN;

					xen_start_flags = pvh_start_info.flags;

					msr = cpuid_ebx(xen_cpuid_base() + 2);

					pfn = __pa(hypercall_page);

					wrmsr_safe(msr, (u32)pfn, (u32)(pfn >> 32));

					x86_init.oem.arch_setup = pvh_arch_setup;

					x86_init.oem.banner = xen_banner;

									
										50

arch/x86/xen/xen-asm.S
									
												View File
												
				@@ -20,9 +20,32 @@

				#include <linux/init.h>

				#include <linux/linkage.h>

				#include <linux/objtool.h>

				#include <../entry/calling.h>

				.pushsection .noinstr.text, "ax"

				/*

				 * PV hypercall interface to the hypervisor.

				 *

				 * Called via inline asm(), so better preserve %rcx and %r11.

				 *

				 * Input:

				 *	%eax: hypercall number

				 *	%rdi, %rsi, %rdx, %r10, %r8: args 1..5 for the hypercall

				 * Output: %rax

				 */

				SYM_FUNC_START(xen_hypercall_pv)

					ANNOTATE_NOENDBR

					push %rcx

					push %r11

					UNWIND_HINT_SAVE

					syscall

					UNWIND_HINT_RESTORE

					pop %r11

					pop %rcx

					RET

				SYM_FUNC_END(xen_hypercall_pv)

				/*

				 * Disabling events is simply a matter of making the event mask

				 * non-zero.

				@@ -176,7 +199,6 @@ SYM_CODE_START(xen_early_idt_handler_array)

				SYM_CODE_END(xen_early_idt_handler_array)

					__FINIT

				hypercall_iret = hypercall_page + __HYPERVISOR_iret * 32

				/*

				 * Xen64 iret frame:

				 *

				@@ -186,17 +208,28 @@ hypercall_iret = hypercall_page + __HYPERVISOR_iret * 32

				 *	cs

				 *	rip		<-- standard iret frame

				 *

				 *	flags

				 *	flags		<-- xen_iret must push from here on

				 *

				 *	rcx		}

				 *	r11		}<-- pushed by hypercall page

				 * rsp->rax		}

				 *	rcx

				 *	r11

				 * rsp->rax

				 */

				.macro xen_hypercall_iret

					pushq $0	/* Flags */

					push %rcx

					push %r11

					push %rax

					mov  $__HYPERVISOR_iret, %eax

					syscall		/* Do the IRET. */

				#ifdef CONFIG_MITIGATION_SLS

					int3

				#endif

				.endm

				SYM_CODE_START(xen_iret)

					UNWIND_HINT_UNDEFINED

					ANNOTATE_NOENDBR

					pushq $0

					jmp hypercall_iret

					xen_hypercall_iret

				SYM_CODE_END(xen_iret)

				/*

				@@ -301,8 +334,7 @@ SYM_CODE_START(xen_entry_SYSENTER_compat)

					ENDBR

					lea 16(%rsp), %rsp	/* strip %rcx, %r11 */

					mov $-ENOSYS, %rax

					pushq $0

					jmp hypercall_iret

					xen_hypercall_iret

				SYM_CODE_END(xen_entry_SYSENTER_compat)

				SYM_CODE_END(xen_entry_SYSCALL_compat)

									
										107

arch/x86/xen/xen-head.S
									
												View File
												
				@@ -6,9 +6,11 @@

				#include <linux/elfnote.h>

				#include <linux/init.h>

				#include <linux/instrumentation.h>

				#include <asm/boot.h>

				#include <asm/asm.h>

				#include <asm/frame.h>

				#include <asm/msr.h>

				#include <asm/page_types.h>

				#include <asm/percpu.h>

				@@ -20,28 +22,6 @@

				#include <xen/interface/xen-mca.h>

				#include <asm/xen/interface.h>

				.pushsection .noinstr.text, "ax"

					.balign PAGE_SIZE

				SYM_CODE_START(hypercall_page)

					.rept (PAGE_SIZE / 32)

						UNWIND_HINT_FUNC

						ANNOTATE_NOENDBR

						ANNOTATE_UNRET_SAFE

						ret

						/*

						 * Xen will write the hypercall page, and sort out ENDBR.

						 */

						.skip 31, 0xcc

					.endr

				#define HYPERCALL(n) \

					.equ xen_hypercall_##n, hypercall_page + __HYPERVISOR_##n * 32; \

					.type xen_hypercall_##n, @function; .size xen_hypercall_##n, 32

				#include <asm/xen-hypercalls.h>

				#undef HYPERCALL

				SYM_CODE_END(hypercall_page)

				.popsection

				#ifdef CONFIG_XEN_PV

					__INIT

				SYM_CODE_START(startup_xen)

				@@ -87,6 +67,87 @@ SYM_CODE_END(xen_cpu_bringup_again)

				#endif

				#endif

					.pushsection .noinstr.text, "ax"

				/*

				 * Xen hypercall interface to the hypervisor.

				 *

				 * Input:

				 *     %eax: hypercall number

				 *   32-bit:

				 *     %ebx, %ecx, %edx, %esi, %edi: args 1..5 for the hypercall

				 *   64-bit:

				 *     %rdi, %rsi, %rdx, %r10, %r8: args 1..5 for the hypercall

				 * Output: %[er]ax

				 */

				SYM_FUNC_START(xen_hypercall_hvm)

					ENDBR

					FRAME_BEGIN

					/* Save all relevant registers (caller save and arguments). */

				#ifdef CONFIG_X86_32

					push %eax

					push %ebx

					push %ecx

					push %edx

					push %esi

					push %edi

				#else

					push %rax

					push %rcx

					push %rdx

					push %rdi

					push %rsi

					push %r11

					push %r10

					push %r9

					push %r8

				#ifdef CONFIG_FRAME_POINTER

					pushq $0	/* Dummy push for stack alignment. */

				#endif

				#endif

					/* Set the vendor specific function. */

					call __xen_hypercall_setfunc

					/* Set ZF = 1 if AMD, Restore saved registers. */

				#ifdef CONFIG_X86_32

					lea xen_hypercall_amd, %ebx

					cmp %eax, %ebx

					pop %edi

					pop %esi

					pop %edx

					pop %ecx

					pop %ebx

					pop %eax

				#else

					lea xen_hypercall_amd(%rip), %rbx

					cmp %rax, %rbx

				#ifdef CONFIG_FRAME_POINTER

					pop %rax	/* Dummy pop. */

				#endif

					pop %r8

					pop %r9

					pop %r10

					pop %r11

					pop %rsi

					pop %rdi

					pop %rdx

					pop %rcx

					pop %rax

				#endif

					/* Use correct hypercall function. */

					jz xen_hypercall_amd

					jmp xen_hypercall_intel

				SYM_FUNC_END(xen_hypercall_hvm)

				SYM_FUNC_START(xen_hypercall_amd)

					vmmcall

					RET

				SYM_FUNC_END(xen_hypercall_amd)

				SYM_FUNC_START(xen_hypercall_intel)

					vmcall

					RET

				SYM_FUNC_END(xen_hypercall_intel)

					.popsection

					ELFNOTE(Xen, XEN_ELFNOTE_GUEST_OS,       .asciz "linux")

					ELFNOTE(Xen, XEN_ELFNOTE_GUEST_VERSION,  .asciz "2.6")

					ELFNOTE(Xen, XEN_ELFNOTE_XEN_VERSION,    .asciz "xen-3.0")

				@@ -116,8 +177,6 @@ SYM_CODE_END(xen_cpu_bringup_again)

				#else

				# define FEATURES_DOM0 0

				#endif

					ELFNOTE(Xen, XEN_ELFNOTE_HYPERCALL_PAGE, .globl xen_elfnote_hypercall_page;

						xen_elfnote_hypercall_page: _ASM_PTR xen_elfnote_hypercall_page_value - .)

					ELFNOTE(Xen, XEN_ELFNOTE_SUPPORTED_FEATURES,

						.long FEATURES_PV | FEATURES_PVH | FEATURES_DOM0)

					ELFNOTE(Xen, XEN_ELFNOTE_LOADER,         .asciz "generic")

									
										9

arch/x86/xen/xen-ops.h
									
												View File
												
				@@ -326,4 +326,13 @@ static inline void xen_smp_intr_free_pv(unsigned int cpu) {}

				static inline void xen_smp_count_cpus(void) { }

				#endif /* CONFIG_SMP */

				#ifdef CONFIG_XEN_PV

				void xen_hypercall_pv(void);

				#endif

				void xen_hypercall_hvm(void);

				void xen_hypercall_amd(void);

				void xen_hypercall_intel(void);

				void xen_hypercall_setfunc(void);

				void *__xen_hypercall_setfunc(void);

				#endif /* XEN_OPS_H */

									
										3

block/bdev.c
									
												View File
												
				@@ -155,8 +155,7 @@ int set_blocksize(struct file *file, int size)

					struct inode *inode = file->f_mapping->host;

					struct block_device *bdev = I_BDEV(inode);

					/* Size must be a power of two, and between 512 and PAGE_SIZE */

					if (size > PAGE_SIZE || size < 512 || !is_power_of_2(size))

					if (blk_validate_block_size(size))

						return -EINVAL;

					/* Size cannot be smaller than the size supported by the device */

									
										16

block/blk-mq-sysfs.c
									
												View File
												
				@@ -275,13 +275,15 @@ void blk_mq_sysfs_unregister_hctxs(struct request_queue *q)

					struct blk_mq_hw_ctx *hctx;

					unsigned long i;

					lockdep_assert_held(&q->sysfs_dir_lock);

					mutex_lock(&q->sysfs_dir_lock);

					if (!q->mq_sysfs_init_done)

						return;

						goto unlock;

					queue_for_each_hw_ctx(q, hctx, i)

						blk_mq_unregister_hctx(hctx);

				unlock:

					mutex_unlock(&q->sysfs_dir_lock);

				}

				int blk_mq_sysfs_register_hctxs(struct request_queue *q)

				@@ -290,10 +292,9 @@ int blk_mq_sysfs_register_hctxs(struct request_queue *q)

					unsigned long i;

					int ret = 0;

					lockdep_assert_held(&q->sysfs_dir_lock);

					mutex_lock(&q->sysfs_dir_lock);

					if (!q->mq_sysfs_init_done)

						return ret;

						goto unlock;

					queue_for_each_hw_ctx(q, hctx, i) {

						ret = blk_mq_register_hctx(hctx);

				@@ -301,5 +302,8 @@ int blk_mq_sysfs_register_hctxs(struct request_queue *q)

							break;

					}

				unlock:

					mutex_unlock(&q->sysfs_dir_lock);

					return ret;

				}

									
										40

block/blk-mq.c
									
												View File
												
				@@ -4412,6 +4412,15 @@ struct gendisk *blk_mq_alloc_disk_for_queue(struct request_queue *q,

				}

				EXPORT_SYMBOL(blk_mq_alloc_disk_for_queue);

				/*

				 * Only hctx removed from cpuhp list can be reused

				 */

				static bool blk_mq_hctx_is_reusable(struct blk_mq_hw_ctx *hctx)

				{

					return hlist_unhashed(&hctx->cpuhp_online) &&

						hlist_unhashed(&hctx->cpuhp_dead);

				}

				static struct blk_mq_hw_ctx *blk_mq_alloc_and_init_hctx(

						struct blk_mq_tag_set *set, struct request_queue *q,

						int hctx_idx, int node)

				@@ -4421,7 +4430,7 @@ static struct blk_mq_hw_ctx *blk_mq_alloc_and_init_hctx(

					/* reuse dead hctx first */

					spin_lock(&q->unused_hctx_lock);

					list_for_each_entry(tmp, &q->unused_hctx_list, hctx_list) {

						if (tmp->numa_node == node) {

						if (tmp->numa_node == node && blk_mq_hctx_is_reusable(tmp)) {

							hctx = tmp;

							break;

						}

				@@ -4453,8 +4462,7 @@ static void blk_mq_realloc_hw_ctxs(struct blk_mq_tag_set *set,

					unsigned long i, j;

					/* protect against switching io scheduler  */

					lockdep_assert_held(&q->sysfs_lock);

					mutex_lock(&q->sysfs_lock);

					for (i = 0; i < set->nr_hw_queues; i++) {

						int old_node;

						int node = blk_mq_get_hctx_node(set, i);

				@@ -4487,6 +4495,7 @@ static void blk_mq_realloc_hw_ctxs(struct blk_mq_tag_set *set,

					xa_for_each_start(&q->hctx_table, j, hctx, j)

						blk_mq_exit_hctx(q, set, hctx, j);

					mutex_unlock(&q->sysfs_lock);

					/* unregister cpuhp callbacks for exited hctxs */

					blk_mq_remove_hw_queues_cpuhp(q);

				@@ -4518,14 +4527,10 @@ int blk_mq_init_allocated_queue(struct blk_mq_tag_set *set,

					xa_init(&q->hctx_table);

					mutex_lock(&q->sysfs_lock);

					blk_mq_realloc_hw_ctxs(set, q);

					if (!q->nr_hw_queues)

						goto err_hctxs;

					mutex_unlock(&q->sysfs_lock);

					INIT_WORK(&q->timeout_work, blk_mq_timeout_work);

					blk_queue_rq_timeout(q, set->timeout ? set->timeout : 30 * HZ);

				@@ -4544,7 +4549,6 @@ int blk_mq_init_allocated_queue(struct blk_mq_tag_set *set,

					return 0;

				err_hctxs:

					mutex_unlock(&q->sysfs_lock);

					blk_mq_release(q);

				err_exit:

					q->mq_ops = NULL;

				@@ -4925,12 +4929,12 @@ static bool blk_mq_elv_switch_none(struct list_head *head,

						return false;

					/* q->elevator needs protection from ->sysfs_lock */

					lockdep_assert_held(&q->sysfs_lock);

					mutex_lock(&q->sysfs_lock);

					/* the check has to be done with holding sysfs_lock */

					if (!q->elevator) {

						kfree(qe);

						goto out;

						goto unlock;

					}

					INIT_LIST_HEAD(&qe->node);

				@@ -4940,7 +4944,9 @@ static bool blk_mq_elv_switch_none(struct list_head *head,

					__elevator_get(qe->type);

					list_add(&qe->node, head);

					elevator_disable(q);

				out:

				unlock:

					mutex_unlock(&q->sysfs_lock);

					return true;

				}

				@@ -4969,9 +4975,11 @@ static void blk_mq_elv_switch_back(struct list_head *head,

					list_del(&qe->node);

					kfree(qe);

					mutex_lock(&q->sysfs_lock);

					elevator_switch(q, t);

					/* drop the reference acquired in blk_mq_elv_switch_none */

					elevator_put(t);

					mutex_unlock(&q->sysfs_lock);

				}

				static void __blk_mq_update_nr_hw_queues(struct blk_mq_tag_set *set,

				@@ -4991,11 +4999,8 @@ static void __blk_mq_update_nr_hw_queues(struct blk_mq_tag_set *set,

					if (set->nr_maps == 1 && nr_hw_queues == set->nr_hw_queues)

						return;

					list_for_each_entry(q, &set->tag_list, tag_set_list) {

						mutex_lock(&q->sysfs_dir_lock);

						mutex_lock(&q->sysfs_lock);

					list_for_each_entry(q, &set->tag_list, tag_set_list)

						blk_mq_freeze_queue(q);

					}

					/*

					 * Switch IO scheduler to 'none', cleaning up the data associated

					 * with the previous scheduler. We will switch back once we are done

				@@ -5051,11 +5056,8 @@ switch_back:

					list_for_each_entry(q, &set->tag_list, tag_set_list)

						blk_mq_elv_switch_back(&head, q);

					list_for_each_entry(q, &set->tag_list, tag_set_list) {

					list_for_each_entry(q, &set->tag_list, tag_set_list)

						blk_mq_unfreeze_queue(q);

						mutex_unlock(&q->sysfs_lock);

						mutex_unlock(&q->sysfs_dir_lock);

					}

					/* Free the excess tags when nr_hw_queues shrink. */

					for (i = set->nr_hw_queues; i < prev_nr_hw_queues; i++)

									
										4

block/blk-sysfs.c
									
												View File
												
				@@ -706,11 +706,11 @@ queue_attr_store(struct kobject *kobj, struct attribute *attr,

					if (entry->load_module)

						entry->load_module(disk, page, length);

					mutex_lock(&q->sysfs_lock);

					blk_mq_freeze_queue(q);

					mutex_lock(&q->sysfs_lock);

					res = entry->store(disk, page, length);

					blk_mq_unfreeze_queue(q);

					mutex_unlock(&q->sysfs_lock);

					blk_mq_unfreeze_queue(q);

					return res;

				}

									
										2

drivers/accel/ivpu/ivpu_gem.c
									
												View File
												
				@@ -409,7 +409,7 @@ static void ivpu_bo_print_info(struct ivpu_bo *bo, struct drm_printer *p)

					mutex_lock(&bo->lock);

					drm_printf(p, "%-9p %-3u 0x%-12llx %-10lu 0x%-8x %-4u",

						   bo, bo->ctx->id, bo->vpu_addr, bo->base.base.size,

						   bo, bo->ctx ? bo->ctx->id : 0, bo->vpu_addr, bo->base.base.size,

						   bo->flags, kref_read(&bo->base.base.refcount));

					if (bo->base.pages)

									
										10

drivers/accel/ivpu/ivpu_mmu_context.c
									
												View File
												
				@@ -612,18 +612,22 @@ int ivpu_mmu_reserved_context_init(struct ivpu_device *vdev)

					if (!ivpu_mmu_ensure_pgd(vdev, &vdev->rctx.pgtable)) {

						ivpu_err(vdev, "Failed to allocate root page table for reserved context\n");

						ret = -ENOMEM;

						goto unlock;

						goto err_ctx_fini;

					}

					ret = ivpu_mmu_cd_set(vdev, vdev->rctx.id, &vdev->rctx.pgtable);

					if (ret) {

						ivpu_err(vdev, "Failed to set context descriptor for reserved context\n");

						goto unlock;

						goto err_ctx_fini;

					}

				unlock:

					mutex_unlock(&vdev->rctx.lock);

					return ret;

				err_ctx_fini:

					mutex_unlock(&vdev->rctx.lock);

					ivpu_mmu_context_fini(vdev, &vdev->rctx);

					return ret;

				}

				void ivpu_mmu_reserved_context_fini(struct ivpu_device *vdev)

									
										2

drivers/accel/ivpu/ivpu_pm.c
									
												View File
												
				@@ -378,6 +378,7 @@ void ivpu_pm_init(struct ivpu_device *vdev)

					pm_runtime_use_autosuspend(dev);

					pm_runtime_set_autosuspend_delay(dev, delay);

					pm_runtime_set_active(dev);

					ivpu_dbg(vdev, PM, "Autosuspend delay = %d\n", delay);

				}

				@@ -392,7 +393,6 @@ void ivpu_pm_enable(struct ivpu_device *vdev)

				{

					struct device *dev = vdev->drm.dev;

					pm_runtime_set_active(dev);

					pm_runtime_allow(dev);

					pm_runtime_mark_last_busy(dev);

					pm_runtime_put_autosuspend(dev);

4

drivers/acpi/Kconfig

View File

@@ -135,10 +135,10 @@ config ACPI_REV_OVERRIDE_POSSIBLE
 config ACPI_EC
 	bool "Embedded Controller"
 	depends on HAS_IOPORT
 	default X86
 	default X86 || LOONGARCH
 	help
 	  This driver handles communication with the microcontroller
 	  on many x86 laptops and other machines.
 	  on many x86/LoongArch laptops and other machines.
 config ACPI_EC_DEBUGFS
 	tristate "EC read/write access through /sys/kernel/debug/ec"

2

drivers/auxdisplay/Kconfig

View File

@@ -489,7 +489,7 @@ config IMG_ASCII_LCD
 config HT16K33
 	tristate "Holtek Ht16K33 LED controller with keyscan"
 	depends on FB && I2C && INPUT
 	depends on FB && I2C && INPUT && BACKLIGHT_CLASS_DEVICE
 	select FB_SYSMEM_HELPERS
 	select INPUT_MATRIXKMAP
 	select FB_BACKLIGHT

									
										26

drivers/block/ublk_drv.c
									
												View File
												
				@@ -1618,6 +1618,21 @@ static void ublk_unquiesce_dev(struct ublk_device *ub)

					blk_mq_kick_requeue_list(ub->ub_disk->queue);

				}

				static struct gendisk *ublk_detach_disk(struct ublk_device *ub)

				{

					struct gendisk *disk;

					/* Sync with ublk_abort_queue() by holding the lock */

					spin_lock(&ub->lock);

					disk = ub->ub_disk;

					ub->dev_info.state = UBLK_S_DEV_DEAD;

					ub->dev_info.ublksrv_pid = -1;

					ub->ub_disk = NULL;

					spin_unlock(&ub->lock);

					return disk;

				}

				static void ublk_stop_dev(struct ublk_device *ub)

				{

					struct gendisk *disk;

				@@ -1631,14 +1646,7 @@ static void ublk_stop_dev(struct ublk_device *ub)

						ublk_unquiesce_dev(ub);

					}

					del_gendisk(ub->ub_disk);

					/* Sync with ublk_abort_queue() by holding the lock */

					spin_lock(&ub->lock);

					disk = ub->ub_disk;

					ub->dev_info.state = UBLK_S_DEV_DEAD;

					ub->dev_info.ublksrv_pid = -1;

					ub->ub_disk = NULL;

					spin_unlock(&ub->lock);

					disk = ublk_detach_disk(ub);

					put_disk(disk);

				 unlock:

					mutex_unlock(&ub->mutex);

				@@ -2336,7 +2344,7 @@ static int ublk_ctrl_start_dev(struct ublk_device *ub, struct io_uring_cmd *cmd)

				out_put_cdev:

					if (ret) {

						ub->dev_info.state = UBLK_S_DEV_DEAD;

						ublk_detach_disk(ub);

						ublk_put_device(ub);

					}

					if (ret)

									
										15

drivers/block/zram/zram_drv.c
									
												View File
												
				@@ -614,6 +614,12 @@ static ssize_t backing_dev_store(struct device *dev,

					}

					nr_pages = i_size_read(inode) >> PAGE_SHIFT;

					/* Refuse to use zero sized device (also prevents self reference) */

					if (!nr_pages) {

						err = -EINVAL;

						goto out;

					}

					bitmap_sz = BITS_TO_LONGS(nr_pages) * sizeof(long);

					bitmap = kvzalloc(bitmap_sz, GFP_KERNEL);

					if (!bitmap) {

				@@ -1438,12 +1444,16 @@ static void zram_meta_free(struct zram *zram, u64 disksize)

					size_t num_pages = disksize >> PAGE_SHIFT;

					size_t index;

					if (!zram->table)

						return;

					/* Free all pages that are still in this zram device */

					for (index = 0; index < num_pages; index++)

						zram_free_page(zram, index);

					zs_destroy_pool(zram->mem_pool);

					vfree(zram->table);

					zram->table = NULL;

				}

				static bool zram_meta_alloc(struct zram *zram, u64 disksize)

				@@ -2320,11 +2330,6 @@ static void zram_reset_device(struct zram *zram)

					zram->limit_pages = 0;

					if (!init_done(zram)) {

						up_write(&zram->init_lock);

						return;

					}

					set_capacity_and_notify(zram->disk, 0);

					part_stat_set_all(zram->disk->part0, 0);

									
										14

drivers/clocksource/hyperv_timer.c
									
												View File
												
				@@ -27,7 +27,8 @@

				#include <asm/mshyperv.h>

				static struct clock_event_device __percpu *hv_clock_event;

				static u64 hv_sched_clock_offset __ro_after_init;

				/* Note: offset can hold negative values after hibernation. */

				static u64 hv_sched_clock_offset __read_mostly;

				/*

				 * If false, we're using the old mechanism for stimer0 interrupts

				@@ -470,6 +471,17 @@ static void resume_hv_clock_tsc(struct clocksource *arg)

					hv_set_msr(HV_MSR_REFERENCE_TSC, tsc_msr.as_uint64);

				}

				/*

				 * Called during resume from hibernation, from overridden

				 * x86_platform.restore_sched_clock_state routine. This is to adjust offsets

				 * used to calculate time for hv tsc page based sched_clock, to account for

				 * time spent before hibernation.

				 */

				void hv_adj_sched_clock_offset(u64 offset)

				{

					hv_sched_clock_offset -= offset;

				}

				#ifdef HAVE_VDSO_CLOCKMODE_HVCLOCK

				static int hv_cs_enable(struct clocksource *cs)

				{

									
										50

drivers/cpufreq/amd-pstate.c
									
												View File
												
				@@ -374,15 +374,19 @@ static inline int amd_pstate_cppc_enable(bool enable)

				static int msr_init_perf(struct amd_cpudata *cpudata)

				{

					u64 cap1;

					u64 cap1, numerator;

					int ret = rdmsrl_safe_on_cpu(cpudata->cpu, MSR_AMD_CPPC_CAP1,

								     &cap1);

					if (ret)

						return ret;

					WRITE_ONCE(cpudata->highest_perf, AMD_CPPC_HIGHEST_PERF(cap1));

					WRITE_ONCE(cpudata->max_limit_perf, AMD_CPPC_HIGHEST_PERF(cap1));

					ret = amd_get_boost_ratio_numerator(cpudata->cpu, &numerator);

					if (ret)

						return ret;

					WRITE_ONCE(cpudata->highest_perf, numerator);

					WRITE_ONCE(cpudata->max_limit_perf, numerator);

					WRITE_ONCE(cpudata->nominal_perf, AMD_CPPC_NOMINAL_PERF(cap1));

					WRITE_ONCE(cpudata->lowest_nonlinear_perf, AMD_CPPC_LOWNONLIN_PERF(cap1));

					WRITE_ONCE(cpudata->lowest_perf, AMD_CPPC_LOWEST_PERF(cap1));

				@@ -394,13 +398,18 @@ static int msr_init_perf(struct amd_cpudata *cpudata)

				static int shmem_init_perf(struct amd_cpudata *cpudata)

				{

					struct cppc_perf_caps cppc_perf;

					u64 numerator;

					int ret = cppc_get_perf_caps(cpudata->cpu, &cppc_perf);

					if (ret)

						return ret;

					WRITE_ONCE(cpudata->highest_perf, cppc_perf.highest_perf);

					WRITE_ONCE(cpudata->max_limit_perf, cppc_perf.highest_perf);

					ret = amd_get_boost_ratio_numerator(cpudata->cpu, &numerator);

					if (ret)

						return ret;

					WRITE_ONCE(cpudata->highest_perf, numerator);

					WRITE_ONCE(cpudata->max_limit_perf, numerator);

					WRITE_ONCE(cpudata->nominal_perf, cppc_perf.nominal_perf);

					WRITE_ONCE(cpudata->lowest_nonlinear_perf,

						   cppc_perf.lowest_nonlinear_perf);

				@@ -561,16 +570,13 @@ static int amd_pstate_verify(struct cpufreq_policy_data *policy_data)

				static int amd_pstate_update_min_max_limit(struct cpufreq_policy *policy)

				{

					u32 max_limit_perf, min_limit_perf, lowest_perf, max_perf;

					u32 max_limit_perf, min_limit_perf, lowest_perf, max_perf, max_freq;

					struct amd_cpudata *cpudata = policy->driver_data;

					if (cpudata->boost_supported && !policy->boost_enabled)

						max_perf = READ_ONCE(cpudata->nominal_perf);

					else

						max_perf = READ_ONCE(cpudata->highest_perf);

					max_limit_perf = div_u64(policy->max * max_perf, policy->cpuinfo.max_freq);

					min_limit_perf = div_u64(policy->min * max_perf, policy->cpuinfo.max_freq);

					max_perf = READ_ONCE(cpudata->highest_perf);

					max_freq = READ_ONCE(cpudata->max_freq);

					max_limit_perf = div_u64(policy->max * max_perf, max_freq);

					min_limit_perf = div_u64(policy->min * max_perf, max_freq);

					lowest_perf = READ_ONCE(cpudata->lowest_perf);

					if (min_limit_perf < lowest_perf)

				@@ -889,7 +895,6 @@ static int amd_pstate_init_freq(struct amd_cpudata *cpudata)

				{

					int ret;

					u32 min_freq, max_freq;

					u64 numerator;

					u32 nominal_perf, nominal_freq;

					u32 lowest_nonlinear_perf, lowest_nonlinear_freq;

					u32 boost_ratio, lowest_nonlinear_ratio;

				@@ -911,10 +916,7 @@ static int amd_pstate_init_freq(struct amd_cpudata *cpudata)

					nominal_perf = READ_ONCE(cpudata->nominal_perf);

					ret = amd_get_boost_ratio_numerator(cpudata->cpu, &numerator);

					if (ret)

						return ret;

					boost_ratio = div_u64(numerator << SCHED_CAPACITY_SHIFT, nominal_perf);

					boost_ratio = div_u64(cpudata->highest_perf << SCHED_CAPACITY_SHIFT, nominal_perf);

					max_freq = (nominal_freq * boost_ratio >> SCHED_CAPACITY_SHIFT) * 1000;

					lowest_nonlinear_perf = READ_ONCE(cpudata->lowest_nonlinear_perf);

				@@ -1869,18 +1871,18 @@ static int __init amd_pstate_init(void)

						static_call_update(amd_pstate_update_perf, shmem_update_perf);

					}

					ret = amd_pstate_register_driver(cppc_state);

					if (ret) {

						pr_err("failed to register with return %d\n", ret);

						return ret;

					}

					if (amd_pstate_prefcore) {

						ret = amd_detect_prefcore(&amd_pstate_prefcore);

						if (ret)

							return ret;

					}

					ret = amd_pstate_register_driver(cppc_state);

					if (ret) {

						pr_err("failed to register with return %d\n", ret);

						return ret;

					}

					dev_root = bus_get_dev_root(&cpu_subsys);

					if (dev_root) {

						ret = sysfs_create_group(&dev_root->kobj, &amd_pstate_global_attr_group);

									
										25

drivers/cxl/core/region.c
									
												View File
												
				@@ -1295,6 +1295,7 @@ static int cxl_port_setup_targets(struct cxl_port *port,

					struct cxl_region_params *p = &cxlr->params;

					struct cxl_decoder *cxld = cxl_rr->decoder;

					struct cxl_switch_decoder *cxlsd;

					struct cxl_port *iter = port;

					u16 eig, peig;

					u8 eiw, peiw;

				@@ -1311,16 +1312,26 @@ static int cxl_port_setup_targets(struct cxl_port *port,

					cxlsd = to_cxl_switch_decoder(&cxld->dev);

					if (cxl_rr->nr_targets_set) {

						int i, distance;

						int i, distance = 1;

						struct cxl_region_ref *cxl_rr_iter;

						/*

						 * Passthrough decoders impose no distance requirements between

						 * peers

						 * The "distance" between peer downstream ports represents which

						 * endpoint positions in the region interleave a given port can

						 * host.

						 *

						 * For example, at the root of a hierarchy the distance is

						 * always 1 as every index targets a different host-bridge. At

						 * each subsequent switch level those ports map every Nth region

						 * position where N is the width of the switch == distance.

						 */

						if (cxl_rr->nr_targets == 1)

							distance = 0;

						else

							distance = p->nr_targets / cxl_rr->nr_targets;

						do {

							cxl_rr_iter = cxl_rr_load(iter, cxlr);

							distance *= cxl_rr_iter->nr_targets;

							iter = to_cxl_port(iter->dev.parent);

						} while (!is_cxl_root(iter));

						distance *= cxlrd->cxlsd.cxld.interleave_ways;

						for (i = 0; i < cxl_rr->nr_targets_set; i++)

							if (ep->dport == cxlsd->target[i]) {

								rc = check_last_peer(cxled, ep, cxl_rr,

									
										6

drivers/cxl/pci.c
									
												View File
												
				@@ -836,6 +836,9 @@ static ssize_t rcd_pcie_cap_emit(struct device *dev, u16 offset, char *buf, size

					if (!root_dev)

						return -ENXIO;

					if (!dport->regs.rcd_pcie_cap)

						return -ENXIO;

					guard(device)(root_dev);

					if (!root_dev->driver)

						return -ENXIO;

				@@ -1032,8 +1035,7 @@ static int cxl_pci_probe(struct pci_dev *pdev, const struct pci_device_id *id)

					if (rc)

						return rc;

					rc = cxl_pci_ras_unmask(pdev);

					if (rc)

					if (cxl_pci_ras_unmask(pdev))

						dev_dbg(&pdev->dev, "No RAS reporting unmasked\n");

					pci_save_state(pdev);

									
										2

drivers/dma-buf/dma-buf.c
									
												View File
												
				@@ -60,7 +60,7 @@ static void __dma_buf_debugfs_list_add(struct dma_buf *dmabuf)

				{

				}

				static void __dma_buf_debugfs_list_del(struct file *file)

				static void __dma_buf_debugfs_list_del(struct dma_buf *dmabuf)

				{

				}

				#endif

									
										43

drivers/dma-buf/udmabuf.c
									
												View File
												
				@@ -297,7 +297,7 @@ static const struct dma_buf_ops udmabuf_ops = {

				};

				#define SEALS_WANTED (F_SEAL_SHRINK)

				#define SEALS_DENIED (F_SEAL_WRITE)

				#define SEALS_DENIED (F_SEAL_WRITE|F_SEAL_FUTURE_WRITE)

				static int check_memfd_seals(struct file *memfd)

				{

				@@ -317,12 +317,10 @@ static int check_memfd_seals(struct file *memfd)

					return 0;

				}

				static int export_udmabuf(struct udmabuf *ubuf,

							  struct miscdevice *device,

							  u32 flags)

				static struct dma_buf *export_udmabuf(struct udmabuf *ubuf,

								      struct miscdevice *device)

				{

					DEFINE_DMA_BUF_EXPORT_INFO(exp_info);

					struct dma_buf *buf;

					ubuf->device = device;

					exp_info.ops  = &udmabuf_ops;

				@@ -330,11 +328,7 @@ static int export_udmabuf(struct udmabuf *ubuf,

					exp_info.priv = ubuf;

					exp_info.flags = O_RDWR;

					buf = dma_buf_export(&exp_info);

					if (IS_ERR(buf))

						return PTR_ERR(buf);

					return dma_buf_fd(buf, flags);

					return dma_buf_export(&exp_info);

				}

				static long udmabuf_pin_folios(struct udmabuf *ubuf, struct file *memfd,

				@@ -391,6 +385,7 @@ static long udmabuf_create(struct miscdevice *device,

					struct folio **folios = NULL;

					pgoff_t pgcnt = 0, pglimit;

					struct udmabuf *ubuf;

					struct dma_buf *dmabuf;

					long ret = -EINVAL;

					u32 i, flags;

				@@ -436,23 +431,39 @@ static long udmabuf_create(struct miscdevice *device,

							goto err;

						}

						/*

						 * Take the inode lock to protect against concurrent

						 * memfd_add_seals(), which takes this lock in write mode.

						 */

						inode_lock_shared(file_inode(memfd));

						ret = check_memfd_seals(memfd);

						if (ret < 0) {

							fput(memfd);

							goto err;

						}

						if (ret)

							goto out_unlock;

						ret = udmabuf_pin_folios(ubuf, memfd, list[i].offset,

									 list[i].size, folios);

				out_unlock:

						inode_unlock_shared(file_inode(memfd));

						fput(memfd);

						if (ret)

							goto err;

					}

					flags = head->flags & UDMABUF_FLAGS_CLOEXEC ? O_CLOEXEC : 0;

					ret = export_udmabuf(ubuf, device, flags);

					if (ret < 0)

					dmabuf = export_udmabuf(ubuf, device);

					if (IS_ERR(dmabuf)) {

						ret = PTR_ERR(dmabuf);

						goto err;

					}

					/*

					 * Ownership of ubuf is held by the dmabuf from here.

					 * If the following dma_buf_fd() fails, dma_buf_put() cleans up both the

					 * dmabuf and the ubuf (through udmabuf_ops.release).

					 */

					ret = dma_buf_fd(dmabuf, flags);

					if (ret < 0)

						dma_buf_put(dmabuf);

					kvfree(folios);

					return ret;

									
										28

drivers/dma/amd/qdma/qdma.c
									
												View File
												
				@@ -7,9 +7,9 @@

				#include <linux/bitfield.h>

				#include <linux/bitops.h>

				#include <linux/dmaengine.h>

				#include <linux/dma-mapping.h>

				#include <linux/module.h>

				#include <linux/mod_devicetable.h>

				#include <linux/dma-map-ops.h>

				#include <linux/platform_device.h>

				#include <linux/platform_data/amd_qdma.h>

				#include <linux/regmap.h>

				@@ -492,18 +492,9 @@ static int qdma_device_verify(struct qdma_device *qdev)

				static int qdma_device_setup(struct qdma_device *qdev)

				{

					struct device *dev = &qdev->pdev->dev;

					u32 ring_sz = QDMA_DEFAULT_RING_SIZE;

					int ret = 0;

					while (dev && get_dma_ops(dev))

						dev = dev->parent;

					if (!dev) {

						qdma_err(qdev, "dma device not found");

						return -EINVAL;

					}

					set_dma_ops(&qdev->pdev->dev, get_dma_ops(dev));

					ret = qdma_setup_fmap_context(qdev);

					if (ret) {

						qdma_err(qdev, "Failed setup fmap context");

				@@ -548,11 +539,12 @@ static void qdma_free_queue_resources(struct dma_chan *chan)

				{

					struct qdma_queue *queue = to_qdma_queue(chan);

					struct qdma_device *qdev = queue->qdev;

					struct device *dev = qdev->dma_dev.dev;

					struct qdma_platdata *pdata;

					qdma_clear_queue_context(queue);

					vchan_free_chan_resources(&queue->vchan);

					dma_free_coherent(dev, queue->ring_size * QDMA_MM_DESC_SIZE,

					pdata = dev_get_platdata(&qdev->pdev->dev);

					dma_free_coherent(pdata->dma_dev, queue->ring_size * QDMA_MM_DESC_SIZE,

							  queue->desc_base, queue->dma_desc_base);

				}

				@@ -565,6 +557,7 @@ static int qdma_alloc_queue_resources(struct dma_chan *chan)

					struct qdma_queue *queue = to_qdma_queue(chan);

					struct qdma_device *qdev = queue->qdev;

					struct qdma_ctxt_sw_desc desc;

					struct qdma_platdata *pdata;

					size_t size;

					int ret;

				@@ -572,8 +565,9 @@ static int qdma_alloc_queue_resources(struct dma_chan *chan)

					if (ret)

						return ret;

					pdata = dev_get_platdata(&qdev->pdev->dev);

					size = queue->ring_size * QDMA_MM_DESC_SIZE;

					queue->desc_base = dma_alloc_coherent(qdev->dma_dev.dev, size,

					queue->desc_base = dma_alloc_coherent(pdata->dma_dev, size,

									      &queue->dma_desc_base,

									      GFP_KERNEL);

					if (!queue->desc_base) {

				@@ -588,7 +582,7 @@ static int qdma_alloc_queue_resources(struct dma_chan *chan)

					if (ret) {

						qdma_err(qdev, "Failed to setup SW desc ctxt for %s",

							 chan->name);

						dma_free_coherent(qdev->dma_dev.dev, size, queue->desc_base,

						dma_free_coherent(pdata->dma_dev, size, queue->desc_base,

								  queue->dma_desc_base);

						return ret;

					}

				@@ -948,8 +942,9 @@ static int qdma_init_error_irq(struct qdma_device *qdev)

				static int qdmam_alloc_qintr_rings(struct qdma_device *qdev)

				{

					u32 ctxt[QDMA_CTXT_REGMAP_LEN];

					struct qdma_platdata *pdata = dev_get_platdata(&qdev->pdev->dev);

					struct device *dev = &qdev->pdev->dev;

					u32 ctxt[QDMA_CTXT_REGMAP_LEN];

					struct qdma_intr_ring *ring;

					struct qdma_ctxt_intr intr_ctxt;

					u32 vector;

				@@ -969,7 +964,8 @@ static int qdmam_alloc_qintr_rings(struct qdma_device *qdev)

						ring->msix_id = qdev->err_irq_idx + i + 1;

						ring->ridx = i;

						ring->color = 1;

						ring->base = dmam_alloc_coherent(dev, QDMA_INTR_RING_SIZE,

						ring->base = dmam_alloc_coherent(pdata->dma_dev,

										 QDMA_INTR_RING_SIZE,

										 &ring->dev_base, GFP_KERNEL);

						if (!ring->base) {

							qdma_err(qdev, "Failed to alloc intr ring %d", i);

									
										7

drivers/dma/apple-admac.c
									
												View File
												
				@@ -153,6 +153,8 @@ static int admac_alloc_sram_carveout(struct admac_data *ad,

				{

					struct admac_sram *sram;

					int i, ret = 0, nblocks;

					ad->txcache.size = readl_relaxed(ad->base + REG_TX_SRAM_SIZE);

					ad->rxcache.size = readl_relaxed(ad->base + REG_RX_SRAM_SIZE);

					if (dir == DMA_MEM_TO_DEV)

						sram = &ad->txcache;

				@@ -912,12 +914,7 @@ static int admac_probe(struct platform_device *pdev)

						goto free_irq;

					}

					ad->txcache.size = readl_relaxed(ad->base + REG_TX_SRAM_SIZE);

					ad->rxcache.size = readl_relaxed(ad->base + REG_RX_SRAM_SIZE);

					dev_info(&pdev->dev, "Audio DMA Controller\n");

					dev_info(&pdev->dev, "imprint %x TX cache %u RX cache %u\n",

						 readl_relaxed(ad->base + REG_IMPRINT), ad->txcache.size, ad->rxcache.size);

					return 0;

									
										2

drivers/dma/at_xdmac.c
									
												View File
												
				@@ -1363,6 +1363,8 @@ at_xdmac_prep_dma_memset(struct dma_chan *chan, dma_addr_t dest, int value,

						return NULL;

					desc = at_xdmac_memset_create_desc(chan, atchan, dest, len, value);

					if (!desc)

						return NULL;

					list_add_tail(&desc->desc_node, &desc->descs_list);

					desc->tx_dma_desc.cookie = -EBUSY;

									
										6

drivers/dma/dw/acpi.c
									
												View File
												
				@@ -8,13 +8,15 @@

				static bool dw_dma_acpi_filter(struct dma_chan *chan, void *param)

				{

					struct dw_dma *dw = to_dw_dma(chan->device);

					struct dw_dma_chip_pdata *data = dev_get_drvdata(dw->dma.dev);

					struct acpi_dma_spec *dma_spec = param;

					struct dw_dma_slave slave = {

						.dma_dev = dma_spec->dev,

						.src_id = dma_spec->slave_id,

						.dst_id = dma_spec->slave_id,

						.m_master = 0,

						.p_master = 1,

						.m_master = data->m_master,

						.p_master = data->p_master,

					};

					return dw_dma_filter(chan, &slave);

									
										8

drivers/dma/dw/internal.h
									
												View File
												
				@@ -51,11 +51,15 @@ struct dw_dma_chip_pdata {

					int (*probe)(struct dw_dma_chip *chip);

					int (*remove)(struct dw_dma_chip *chip);

					struct dw_dma_chip *chip;

					u8 m_master;

					u8 p_master;

				};

				static __maybe_unused const struct dw_dma_chip_pdata dw_dma_chip_pdata = {

					.probe = dw_dma_probe,

					.remove = dw_dma_remove,

					.m_master = 0,

					.p_master = 1,

				};

				static const struct dw_dma_platform_data idma32_pdata = {

				@@ -72,6 +76,8 @@ static __maybe_unused const struct dw_dma_chip_pdata idma32_chip_pdata = {

					.pdata = &idma32_pdata,

					.probe = idma32_dma_probe,

					.remove = idma32_dma_remove,

					.m_master = 0,

					.p_master = 0,

				};

				static const struct dw_dma_platform_data xbar_pdata = {

				@@ -88,6 +94,8 @@ static __maybe_unused const struct dw_dma_chip_pdata xbar_chip_pdata = {

					.pdata = &xbar_pdata,

					.probe = idma32_dma_probe,

					.remove = idma32_dma_remove,

					.m_master = 0,

					.p_master = 0,

				};

				#endif /* _DMA_DW_INTERNAL_H */

									
										4

drivers/dma/dw/pci.c
									
												View File
												
				@@ -56,10 +56,10 @@ static int dw_pci_probe(struct pci_dev *pdev, const struct pci_device_id *pid)

					if (ret)

						return ret;

					dw_dma_acpi_controller_register(chip->dw);

					pci_set_drvdata(pdev, data);

					dw_dma_acpi_controller_register(chip->dw);

					return 0;

				}

									
										1

drivers/dma/fsl-edma-common.h
									
												View File
												
				@@ -166,6 +166,7 @@ struct fsl_edma_chan {

					struct work_struct		issue_worker;

					struct platform_device		*pdev;

					struct device			*pd_dev;

					struct device_link		*pd_dev_link;

					u32				srcid;

					struct clk			*clk;

					int                             priority;

									
										41

drivers/dma/fsl-edma-main.c
									
												View File
												
				@@ -417,10 +417,33 @@ static const struct of_device_id fsl_edma_dt_ids[] = {

				};

				MODULE_DEVICE_TABLE(of, fsl_edma_dt_ids);

				static void fsl_edma3_detach_pd(struct fsl_edma_engine *fsl_edma)

				{

					struct fsl_edma_chan *fsl_chan;

					int i;

					for (i = 0; i < fsl_edma->n_chans; i++) {

						if (fsl_edma->chan_masked & BIT(i))

							continue;

						fsl_chan = &fsl_edma->chans[i];

						if (fsl_chan->pd_dev_link)

							device_link_del(fsl_chan->pd_dev_link);

						if (fsl_chan->pd_dev) {

							dev_pm_domain_detach(fsl_chan->pd_dev, false);

							pm_runtime_dont_use_autosuspend(fsl_chan->pd_dev);

							pm_runtime_set_suspended(fsl_chan->pd_dev);

						}

					}

				}

				static void devm_fsl_edma3_detach_pd(void *data)

				{

					fsl_edma3_detach_pd(data);

				}

				static int fsl_edma3_attach_pd(struct platform_device *pdev, struct fsl_edma_engine *fsl_edma)

				{

					struct fsl_edma_chan *fsl_chan;

					struct device_link *link;

					struct device *pd_chan;

					struct device *dev;

					int i;

				@@ -436,15 +459,16 @@ static int fsl_edma3_attach_pd(struct platform_device *pdev, struct fsl_edma_eng

						pd_chan = dev_pm_domain_attach_by_id(dev, i);

						if (IS_ERR_OR_NULL(pd_chan)) {

							dev_err(dev, "Failed attach pd %d\n", i);

							return -EINVAL;

							goto detach;

						}

						link = device_link_add(dev, pd_chan, DL_FLAG_STATELESS |

						fsl_chan->pd_dev_link = device_link_add(dev, pd_chan, DL_FLAG_STATELESS |

									     DL_FLAG_PM_RUNTIME |

									     DL_FLAG_RPM_ACTIVE);

						if (!link) {

						if (!fsl_chan->pd_dev_link) {

							dev_err(dev, "Failed to add device_link to %d\n", i);

							return -EINVAL;

							dev_pm_domain_detach(pd_chan, false);

							goto detach;

						}

						fsl_chan->pd_dev = pd_chan;

				@@ -455,6 +479,10 @@ static int fsl_edma3_attach_pd(struct platform_device *pdev, struct fsl_edma_eng

					}

					return 0;

				detach:

					fsl_edma3_detach_pd(fsl_edma);

					return -EINVAL;

				}

				static int fsl_edma_probe(struct platform_device *pdev)

				@@ -544,6 +572,9 @@ static int fsl_edma_probe(struct platform_device *pdev)

						ret = fsl_edma3_attach_pd(pdev, fsl_edma);

						if (ret)

							return ret;

						ret = devm_add_action_or_reset(&pdev->dev, devm_fsl_edma3_detach_pd, fsl_edma);

						if (ret)

							return ret;

					}

					if (drvdata->flags & FSL_EDMA_DRV_TCD64)

									
										2

drivers/dma/loongson2-apb-dma.c
									
												View File
												
				@@ -31,7 +31,7 @@

				#define LDMA_ASK_VALID		BIT(2)

				#define LDMA_START		BIT(3) /* DMA start operation */

				#define LDMA_STOP		BIT(4) /* DMA stop operation */

				#define LDMA_CONFIG_MASK	GENMASK(4, 0) /* DMA controller config bits mask */

				#define LDMA_CONFIG_MASK	GENMASK_ULL(4, 0) /* DMA controller config bits mask */

				/* Bitfields in ndesc_addr field of HW descriptor */

				#define LDMA_DESC_EN		BIT(0) /*1: The next descriptor is valid */

									
										2

drivers/dma/mv_xor.c
									
												View File
												
				@@ -1388,6 +1388,7 @@ static int mv_xor_probe(struct platform_device *pdev)

							irq = irq_of_parse_and_map(np, 0);

							if (!irq) {

								ret = -ENODEV;

								of_node_put(np);

								goto err_channel_add;

							}

				@@ -1396,6 +1397,7 @@ static int mv_xor_probe(struct platform_device *pdev)

							if (IS_ERR(chan)) {

								ret = PTR_ERR(chan);

								irq_dispose_mapping(irq);

								of_node_put(np);

								goto err_channel_add;

							}

									
										10

drivers/dma/tegra186-gpc-dma.c
									
												View File
												
				@@ -231,6 +231,7 @@ struct tegra_dma_channel {

					bool config_init;

					char name[30];

					enum dma_transfer_direction sid_dir;

					enum dma_status status;

					int id;

					int irq;

					int slave_id;

				@@ -393,6 +394,8 @@ static int tegra_dma_pause(struct tegra_dma_channel *tdc)

						tegra_dma_dump_chan_regs(tdc);

					}

					tdc->status = DMA_PAUSED;

					return ret;

				}

				@@ -419,6 +422,8 @@ static void tegra_dma_resume(struct tegra_dma_channel *tdc)

					val = tdc_read(tdc, TEGRA_GPCDMA_CHAN_CSRE);

					val &= ~TEGRA_GPCDMA_CHAN_CSRE_PAUSE;

					tdc_write(tdc, TEGRA_GPCDMA_CHAN_CSRE, val);

					tdc->status = DMA_IN_PROGRESS;

				}

				static int tegra_dma_device_resume(struct dma_chan *dc)

				@@ -544,6 +549,7 @@ static void tegra_dma_xfer_complete(struct tegra_dma_channel *tdc)

					tegra_dma_sid_free(tdc);

					tdc->dma_desc = NULL;

					tdc->status = DMA_COMPLETE;

				}

				static void tegra_dma_chan_decode_error(struct tegra_dma_channel *tdc,

				@@ -716,6 +722,7 @@ static int tegra_dma_terminate_all(struct dma_chan *dc)

						tdc->dma_desc = NULL;

					}

					tdc->status = DMA_COMPLETE;

					tegra_dma_sid_free(tdc);

					vchan_get_all_descriptors(&tdc->vc, &head);

					spin_unlock_irqrestore(&tdc->vc.lock, flags);

				@@ -769,6 +776,9 @@ static enum dma_status tegra_dma_tx_status(struct dma_chan *dc,

					if (ret == DMA_COMPLETE)

						return ret;

					if (tdc->status == DMA_PAUSED)

						ret = DMA_PAUSED;

					spin_lock_irqsave(&tdc->vc.lock, flags);

					vd = vchan_find_desc(&tdc->vc, cookie);

					if (vd) {

									
										15

drivers/firmware/arm_ffa/bus.c
									
												View File
												
				@@ -187,13 +187,18 @@ bool ffa_device_is_valid(struct ffa_device *ffa_dev)

					return valid;

				}

				struct ffa_device *ffa_device_register(const uuid_t *uuid, int vm_id,

								       const struct ffa_ops *ops)

				struct ffa_device *

				ffa_device_register(const struct ffa_partition_info *part_info,

						    const struct ffa_ops *ops)

				{

					int id, ret;

					uuid_t uuid;

					struct device *dev;

					struct ffa_device *ffa_dev;

					if (!part_info)

						return NULL;

					id = ida_alloc_min(&ffa_bus_id, 1, GFP_KERNEL);

					if (id < 0)

						return NULL;

				@@ -210,9 +215,11 @@ struct ffa_device *ffa_device_register(const uuid_t *uuid, int vm_id,

					dev_set_name(&ffa_dev->dev, "arm-ffa-%d", id);

					ffa_dev->id = id;

					ffa_dev->vm_id = vm_id;

					ffa_dev->vm_id = part_info->id;

					ffa_dev->properties = part_info->properties;

					ffa_dev->ops = ops;

					uuid_copy(&ffa_dev->uuid, uuid);

					import_uuid(&uuid, (u8 *)part_info->uuid);

					uuid_copy(&ffa_dev->uuid, &uuid);

					ret = device_register(&ffa_dev->dev);

					if (ret) {

									
										7

drivers/firmware/arm_ffa/driver.c
									
												View File
												
				@@ -1387,7 +1387,6 @@ static struct notifier_block ffa_bus_nb = {

				static int ffa_setup_partitions(void)

				{

					int count, idx, ret;

					uuid_t uuid;

					struct ffa_device *ffa_dev;

					struct ffa_dev_part_info *info;

					struct ffa_partition_info *pbuf, *tpbuf;

				@@ -1406,23 +1405,19 @@ static int ffa_setup_partitions(void)

					xa_init(&drv_info->partition_info);

					for (idx = 0, tpbuf = pbuf; idx < count; idx++, tpbuf++) {

						import_uuid(&uuid, (u8 *)tpbuf->uuid);

						/* Note that if the UUID will be uuid_null, that will require

						 * ffa_bus_notifier() to find the UUID of this partition id

						 * with help of ffa_device_match_uuid(). FF-A v1.1 and above

						 * provides UUID here for each partition as part of the

						 * discovery API and the same is passed.

						 */

						ffa_dev = ffa_device_register(&uuid, tpbuf->id, &ffa_drv_ops);

						ffa_dev = ffa_device_register(tpbuf, &ffa_drv_ops);

						if (!ffa_dev) {

							pr_err("%s: failed to register partition ID 0x%x\n",

							       __func__, tpbuf->id);

							continue;

						}

						ffa_dev->properties = tpbuf->properties;

						if (drv_info->version > FFA_VERSION_1_0 &&

						    !(tpbuf->properties & FFA_PARTITION_AARCH64_EXEC))

							ffa_mode_32bit_set(ffa_dev);

1

drivers/firmware/arm_scmi/vendors/imx/Kconfig vendored

View File

@@ -15,6 +15,7 @@ config IMX_SCMI_BBM_EXT
 config IMX_SCMI_MISC_EXT
 	tristate "i.MX SCMI MISC EXTENSION"
 	depends on ARM_SCMI_PROTOCOL || (COMPILE_TEST && OF)
 	depends on IMX_SCMI_MISC_DRV
 	default y if ARCH_MXC
 	help
 	  This enables i.MX System MISC control logic such as gpio expander

1

drivers/firmware/imx/Kconfig

View File

@@ -25,7 +25,6 @@ config IMX_SCU
 config IMX_SCMI_MISC_DRV
 	tristate "IMX SCMI MISC Protocol driver"
 	depends on IMX_SCMI_MISC_EXT || COMPILE_TEST
 	default y if ARCH_MXC
 	help
 	  The System Controller Management Interface firmware (SCMI FW) is

									
										4

drivers/firmware/microchip/mpfs-auto-update.c
									
												View File
												
				@@ -402,10 +402,10 @@ static int mpfs_auto_update_available(struct mpfs_auto_update_priv *priv)

						return -EIO;

					/*

					 * Bit 5 of byte 1 is "UL_Auto Update" & if it is set, Auto Update is

					 * Bit 5 of byte 1 is "UL_IAP" & if it is set, Auto Update is

					 * not possible.

					 */

					if (response_msg[1] & AUTO_UPDATE_FEATURE_ENABLED)

					if ((((u8 *)response_msg)[1] & AUTO_UPDATE_FEATURE_ENABLED))

						return -EPERM;

					return 0;

4

drivers/gpu/drm/Kconfig

View File

@@ -99,6 +99,7 @@ config DRM_KUNIT_TEST
 config DRM_KMS_HELPER
 	tristate
 	depends on DRM
 	select FB_CORE if DRM_FBDEV_EMULATION
 	help
 	  CRTC helpers for KMS drivers.
@@ -358,6 +359,7 @@ config DRM_TTM_HELPER
 	tristate
 	depends on DRM
 	select DRM_TTM
 	select FB_CORE if DRM_FBDEV_EMULATION
 	select FB_SYSMEM_HELPERS_DEFERRED if DRM_FBDEV_EMULATION
 	help
 	  Helpers for ttm-based gem objects
@@ -365,6 +367,7 @@ config DRM_TTM_HELPER
 config DRM_GEM_DMA_HELPER
 	tristate
 	depends on DRM
 	select FB_CORE if DRM_FBDEV_EMULATION
 	select FB_DMAMEM_HELPERS_DEFERRED if DRM_FBDEV_EMULATION
 	help
 	  Choose this if you need the GEM DMA helper functions
@@ -372,6 +375,7 @@ config DRM_GEM_DMA_HELPER
 config DRM_GEM_SHMEM_HELPER
 	tristate
 	depends on DRM && MMU
 	select FB_CORE if DRM_FBDEV_EMULATION
 	select FB_SYSMEM_HELPERS_DEFERRED if DRM_FBDEV_EMULATION
 	help
 	  Choose this if you need the GEM shmem helper functions

									
										5

drivers/gpu/drm/amd/amdgpu/amdgpu_dev_coredump.c
									
												View File
												
				@@ -343,11 +343,10 @@ void amdgpu_coredump(struct amdgpu_device *adev, bool skip_vram_check,

					coredump->skip_vram_check = skip_vram_check;

					coredump->reset_vram_lost = vram_lost;

					if (job && job->vm) {

						struct amdgpu_vm *vm = job->vm;

					if (job && job->pasid) {

						struct amdgpu_task_info *ti;

						ti = amdgpu_vm_get_task_info_vm(vm);

						ti = amdgpu_vm_get_task_info_pasid(adev, job->pasid);

						if (ti) {

							coredump->reset_task_info = *ti;

							amdgpu_vm_put_task_info(ti);

									
										3

drivers/gpu/drm/amd/amdgpu/amdgpu_device.c
									
												View File
												
				@@ -417,6 +417,9 @@ bool amdgpu_device_supports_boco(struct drm_device *dev)

				{

					struct amdgpu_device *adev = drm_to_adev(dev);

					if (!IS_ENABLED(CONFIG_HOTPLUG_PCI_PCIE))

						return false;

					if (adev->has_pr3 ||

					    ((adev->flags & AMD_IS_PX) && amdgpu_is_atpx_hybrid()))

						return true;

									
										3

drivers/gpu/drm/amd/amdgpu/amdgpu_job.c
									
												View File
												
				@@ -255,7 +255,6 @@ void amdgpu_job_set_resources(struct amdgpu_job *job, struct amdgpu_bo *gds,

				void amdgpu_job_free_resources(struct amdgpu_job *job)

				{

					struct amdgpu_ring *ring = to_amdgpu_ring(job->base.sched);

					struct dma_fence *f;

					unsigned i;

				@@ -268,7 +267,7 @@ void amdgpu_job_free_resources(struct amdgpu_job *job)

						f = NULL;

					for (i = 0; i < job->num_ibs; ++i)

						amdgpu_ib_free(ring->adev, &job->ibs[i], f);

						amdgpu_ib_free(NULL, &job->ibs[i], f);

				}

				static void amdgpu_job_free_cb(struct drm_sched_job *s_job)

									
										7

drivers/gpu/drm/amd/amdgpu/amdgpu_vm.c
									
												View File
												
				@@ -1266,10 +1266,9 @@ int amdgpu_vm_bo_update(struct amdgpu_device *adev, struct amdgpu_bo_va *bo_va,

					 * next command submission.

					 */

					if (amdgpu_vm_is_bo_always_valid(vm, bo)) {

						uint32_t mem_type = bo->tbo.resource->mem_type;

						if (!(bo->preferred_domains &

						      amdgpu_mem_type_to_domain(mem_type)))

						if (bo->tbo.resource &&

						    !(bo->preferred_domains &

						      amdgpu_mem_type_to_domain(bo->tbo.resource->mem_type)))

							amdgpu_vm_bo_evicted(&bo_va->base);

						else

							amdgpu_vm_bo_idle(&bo_va->base);

									
										2

drivers/gpu/drm/amd/amdgpu/gfx_v12_0.c
									
												View File
												
				@@ -4123,7 +4123,7 @@ static int gfx_v12_0_set_clockgating_state(void *handle,

					if (amdgpu_sriov_vf(adev))

						return 0;

					switch (adev->ip_versions[GC_HWIP][0]) {

					switch (amdgpu_ip_version(adev, GC_HWIP, 0)) {

					case IP_VERSION(12, 0, 0):

					case IP_VERSION(12, 0, 1):

						gfx_v12_0_update_gfx_clock_gating(adev,

									
										2

drivers/gpu/drm/amd/amdgpu/mmhub_v4_1_0.c
									
												View File
												
				@@ -108,7 +108,7 @@ mmhub_v4_1_0_print_l2_protection_fault_status(struct amdgpu_device *adev,

					dev_err(adev->dev,

						"MMVM_L2_PROTECTION_FAULT_STATUS_LO32:0x%08X\n",

						status);

					switch (adev->ip_versions[MMHUB_HWIP][0]) {

					switch (amdgpu_ip_version(adev, MMHUB_HWIP, 0)) {

					case IP_VERSION(4, 1, 0):

						mmhub_cid = mmhub_client_ids_v4_1_0[cid][rw];

						break;

									
										11

drivers/gpu/drm/amd/amdgpu/nbio_v7_0.c
									
												View File
												
				@@ -271,8 +271,19 @@ const struct nbio_hdp_flush_reg nbio_v7_0_hdp_flush_reg = {

					.ref_and_mask_sdma1 = GPU_HDP_FLUSH_DONE__SDMA1_MASK,

				};

				#define regRCC_DEV0_EPF6_STRAP4                                                                         0xd304

				#define regRCC_DEV0_EPF6_STRAP4_BASE_IDX                                                                5

				static void nbio_v7_0_init_registers(struct amdgpu_device *adev)

				{

					uint32_t data;

					switch (amdgpu_ip_version(adev, NBIO_HWIP, 0)) {

					case IP_VERSION(2, 5, 0):

						data = RREG32_SOC15(NBIO, 0, regRCC_DEV0_EPF6_STRAP4) & ~BIT(23);

						WREG32_SOC15(NBIO, 0, regRCC_DEV0_EPF6_STRAP4, data);

						break;

					}

				}

				#define MMIO_REG_HOLE_OFFSET (0x80000 - PAGE_SIZE)

									
										2

drivers/gpu/drm/amd/amdgpu/nbio_v7_11.c
									
												View File
												
				@@ -275,7 +275,7 @@ static void nbio_v7_11_init_registers(struct amdgpu_device *adev)

					if (def != data)

						WREG32_SOC15(NBIO, 0, regBIF_BIF256_CI256_RC3X4_USB4_PCIE_MST_CTRL_3, data);

					switch (adev->ip_versions[NBIO_HWIP][0]) {

					switch (amdgpu_ip_version(adev, NBIO_HWIP, 0)) {

					case IP_VERSION(7, 11, 0):

					case IP_VERSION(7, 11, 1):

					case IP_VERSION(7, 11, 2):

									
										2

drivers/gpu/drm/amd/amdgpu/nbio_v7_7.c
									
												View File
												
				@@ -247,7 +247,7 @@ static void nbio_v7_7_init_registers(struct amdgpu_device *adev)

					if (def != data)

						WREG32_SOC15(NBIO, 0, regBIF0_PCIE_MST_CTRL_3, data);

					switch (adev->ip_versions[NBIO_HWIP][0]) {

					switch (amdgpu_ip_version(adev, NBIO_HWIP, 0)) {

					case IP_VERSION(7, 7, 0):

						data = RREG32_SOC15(NBIO, 0, regRCC_DEV0_EPF5_STRAP4) & ~BIT(23);

						WREG32_SOC15(NBIO, 0, regRCC_DEV0_EPF5_STRAP4, data);

									
										2

drivers/gpu/drm/amd/pm/swsmu/smu14/smu_v14_0_2_ppt.c
									
												View File
												
				@@ -2096,7 +2096,7 @@ static int smu_v14_0_2_enable_gfx_features(struct smu_context *smu)

				{

					struct amdgpu_device *adev = smu->adev;

					if (adev->ip_versions[MP1_HWIP][0] == IP_VERSION(14, 0, 2))

					if (amdgpu_ip_version(adev, MP1_HWIP, 0) == IP_VERSION(14, 0, 2))

						return smu_cmn_send_smc_msg_with_param(smu, SMU_MSG_EnableAllSmuFeatures,

														   FEATURE_PWR_GFX, NULL);

					else

									
										10

drivers/gpu/drm/display/drm_dp_tunnel.c
									
												View File
												
				@@ -1896,8 +1896,8 @@ static void destroy_mgr(struct drm_dp_tunnel_mgr *mgr)

				 *

				 * Creates a DP tunnel manager for @dev.

				 *

				 * Returns a pointer to the tunnel manager if created successfully or NULL in

				 * case of an error.

				 * Returns a pointer to the tunnel manager if created successfully or error

				 * pointer in case of failure.

				 */

				struct drm_dp_tunnel_mgr *

				drm_dp_tunnel_mgr_create(struct drm_device *dev, int max_group_count)

				@@ -1907,7 +1907,7 @@ drm_dp_tunnel_mgr_create(struct drm_device *dev, int max_group_count)

					mgr = kzalloc(sizeof(*mgr), GFP_KERNEL);

					if (!mgr)

						return NULL;

						return ERR_PTR(-ENOMEM);

					mgr->dev = dev;

					init_waitqueue_head(&mgr->bw_req_queue);

				@@ -1916,7 +1916,7 @@ drm_dp_tunnel_mgr_create(struct drm_device *dev, int max_group_count)

					if (!mgr->groups) {

						kfree(mgr);

						return NULL;

						return ERR_PTR(-ENOMEM);

					}

				#ifdef CONFIG_DRM_DISPLAY_DP_TUNNEL_STATE_DEBUG

				@@ -1927,7 +1927,7 @@ drm_dp_tunnel_mgr_create(struct drm_device *dev, int max_group_count)

						if (!init_group(mgr, &mgr->groups[i])) {

							destroy_mgr(mgr);

							return NULL;

							return ERR_PTR(-ENOMEM);

						}

						mgr->group_count++;

									
										11

drivers/gpu/drm/drm_modes.c
									
												View File
												
				@@ -1287,14 +1287,11 @@ EXPORT_SYMBOL(drm_mode_set_name);

				 */

				int drm_mode_vrefresh(const struct drm_display_mode *mode)

				{

					unsigned int num, den;

					unsigned int num = 1, den = 1;

					if (mode->htotal == 0 || mode->vtotal == 0)

						return 0;

					num = mode->clock;

					den = mode->htotal * mode->vtotal;

					if (mode->flags & DRM_MODE_FLAG_INTERLACE)

						num *= 2;

					if (mode->flags & DRM_MODE_FLAG_DBLSCAN)

				@@ -1302,6 +1299,12 @@ int drm_mode_vrefresh(const struct drm_display_mode *mode)

					if (mode->vscan > 1)

						den *= mode->vscan;

					if (check_mul_overflow(mode->clock, num, &num))

						return 0;

					if (check_mul_overflow(mode->htotal * mode->vtotal, den, &den))

						return 0;

					return DIV_ROUND_CLOSEST_ULL(mul_u32_u32(num, 1000), den);

				}

				EXPORT_SYMBOL(drm_mode_vrefresh);

									
										5

drivers/gpu/drm/i915/gt/intel_engine_types.h
									
												View File
												
				@@ -343,6 +343,11 @@ struct intel_engine_guc_stats {

					 * @start_gt_clk: GT clock time of last idle to active transition.

					 */

					u64 start_gt_clk;

					/**

					 * @total: The last value of total returned

					 */

					u64 total;

				};

				union intel_engine_tlb_inv_reg {

									
										41

drivers/gpu/drm/i915/gt/uc/intel_guc_submission.c
									
												View File
												
				@@ -1243,6 +1243,21 @@ static void __get_engine_usage_record(struct intel_engine_cs *engine,

					} while (++i < 6);

				}

				static void __set_engine_usage_record(struct intel_engine_cs *engine,

								      u32 last_in, u32 id, u32 total)

				{

					struct iosys_map rec_map = intel_guc_engine_usage_record_map(engine);

				#define record_write(map_, field_, val_) \

					iosys_map_wr_field(map_, 0, struct guc_engine_usage_record, field_, val_)

					record_write(&rec_map, last_switch_in_stamp, last_in);

					record_write(&rec_map, current_context_index, id);

					record_write(&rec_map, total_runtime, total);

				#undef record_write

				}

				static void guc_update_engine_gt_clks(struct intel_engine_cs *engine)

				{

					struct intel_engine_guc_stats *stats = &engine->stats.guc;

				@@ -1363,9 +1378,12 @@ static ktime_t guc_engine_busyness(struct intel_engine_cs *engine, ktime_t *now)

						total += intel_gt_clock_interval_to_ns(gt, clk);

					}

					if (total > stats->total)

						stats->total = total;

					spin_unlock_irqrestore(&guc->timestamp.lock, flags);

					return ns_to_ktime(total);

					return ns_to_ktime(stats->total);

				}

				static void guc_enable_busyness_worker(struct intel_guc *guc)

				@@ -1431,8 +1449,21 @@ static void __reset_guc_busyness_stats(struct intel_guc *guc)

					guc_update_pm_timestamp(guc, &unused);

					for_each_engine(engine, gt, id) {

						struct intel_engine_guc_stats *stats = &engine->stats.guc;

						guc_update_engine_gt_clks(engine);

						engine->stats.guc.prev_total = 0;

						/*

						 * If resetting a running context, accumulate the active

						 * time as well since there will be no context switch.

						 */

						if (stats->running) {

							u64 clk = guc->timestamp.gt_stamp - stats->start_gt_clk;

							stats->total_gt_clks += clk;

						}

						stats->prev_total = 0;

						stats->running = 0;

					}

					spin_unlock_irqrestore(&guc->timestamp.lock, flags);

				@@ -1543,6 +1574,9 @@ err_trylock:

				static int guc_action_enable_usage_stats(struct intel_guc *guc)

				{

					struct intel_gt *gt = guc_to_gt(guc);

					struct intel_engine_cs *engine;

					enum intel_engine_id id;

					u32 offset = intel_guc_engine_usage_offset(guc);

					u32 action[] = {

						INTEL_GUC_ACTION_SET_ENG_UTIL_BUFF,

				@@ -1550,6 +1584,9 @@ static int guc_action_enable_usage_stats(struct intel_guc *guc)

						0,

					};

					for_each_engine(engine, gt, id)

						__set_engine_usage_record(engine, 0, 0xffffffff, 0);

					return intel_guc_send(guc, action, ARRAY_SIZE(action));

				}

									
										2

drivers/gpu/drm/panel/panel-himax-hx83102.c
									
												View File
												
				@@ -565,6 +565,8 @@ static int hx83102_get_modes(struct drm_panel *panel,

					struct drm_display_mode *mode;

					mode = drm_mode_duplicate(connector->dev, m);

					if (!mode)

						return -ENOMEM;

					mode->type = DRM_MODE_TYPE_DRIVER | DRM_MODE_TYPE_PREFERRED;

					drm_mode_set_name(mode);

									
										4

drivers/gpu/drm/panel/panel-novatek-nt35950.c
									
												View File
												
				@@ -481,9 +481,9 @@ static int nt35950_probe(struct mipi_dsi_device *dsi)

							return dev_err_probe(dev, -EPROBE_DEFER, "Cannot get secondary DSI host\n");

						nt->dsi[1] = mipi_dsi_device_register_full(dsi_r_host, info);

						if (!nt->dsi[1]) {

						if (IS_ERR(nt->dsi[1])) {

							dev_err(dev, "Cannot get secondary DSI node\n");

							return -ENODEV;

							return PTR_ERR(nt->dsi[1]);

						}

						num_dsis++;

					}

									
										1

drivers/gpu/drm/panel/panel-sitronix-st7701.c
									
												View File
												
				@@ -1177,6 +1177,7 @@ static int st7701_probe(struct device *dev, int connector_type)

						return dev_err_probe(dev, ret, "Failed to get orientation\n");

					drm_panel_init(&st7701->panel, dev, &st7701_funcs, connector_type);

					st7701->panel.prepare_prev_first = true;

					/**

					 * Once sleep out has been issued, ST7701 IC required to wait 120ms

Compare commits

421 Commits v6.13-rc3 ... v6.13-rc5

1 .mailmap Unescape Escape View File

4 Documentation/admin-guide/pm/amd-pstate.rst Unescape Escape View File

10 Documentation/devicetree/bindings/crypto/fsl,sec-v4.0.yaml Unescape Escape View File

2 Documentation/devicetree/bindings/mtd/partitions/fixed-partitions.yaml Unescape Escape View File

2 Documentation/devicetree/bindings/soc/fsl/fsl,qman-portal.yaml Unescape Escape View File

2 Documentation/devicetree/bindings/sound/realtek,rt5645.yaml Unescape Escape View File

850 Documentation/mm/process_addrs.rst Unescape Escape View File

6 MAINTAINERS Unescape Escape View File

2 Makefile Unescape Escape View File

1 arch/arc/Kconfig Unescape Escape View File

8 arch/arc/include/asm/cachetype.h Normal file Unescape Escape View File

2 arch/arm64/boot/dts/arm/fvp-base-revc.dts Unescape Escape View File

8 arch/arm64/boot/dts/broadcom/bcm2712.dtsi Unescape Escape View File

35 arch/arm64/kernel/signal.c Unescape Escape View File

6 arch/hexagon/Makefile Unescape Escape View File

1 arch/powerpc/configs/pmac32_defconfig Unescape Escape View File

1 arch/powerpc/configs/ppc6xx_defconfig Unescape Escape View File

36 arch/powerpc/platforms/book3s/vas-api.c Unescape Escape View File

2 arch/s390/boot/startup.c Unescape Escape View File

6 arch/s390/boot/vmem.c Unescape Escape View File

2 arch/s390/kernel/ipl.c Unescape Escape View File

12 arch/x86/events/intel/core.c Unescape Escape View File

1 arch/x86/events/intel/ds.c Unescape Escape View File

1 arch/x86/events/intel/uncore.c Unescape Escape View File

1 arch/x86/include/asm/cpufeatures.h Unescape Escape View File

2 arch/x86/include/asm/processor.h Unescape Escape View File

15 arch/x86/include/asm/static_call.h Unescape Escape View File

6 arch/x86/include/asm/sync_core.h Unescape Escape View File

36 arch/x86/include/asm/xen/hypercall.h Unescape Escape View File

5 arch/x86/kernel/callthunks.c Unescape Escape View File

30 arch/x86/kernel/cet.c Unescape Escape View File

38 arch/x86/kernel/cpu/common.c Unescape Escape View File

58 arch/x86/kernel/cpu/mshyperv.c Unescape Escape View File

9 arch/x86/kernel/static_call.c Unescape Escape View File

4 arch/x86/kernel/vmlinux.lds.S Unescape Escape View File

12 arch/x86/kvm/mmu/mmu.c Unescape Escape View File

17 arch/x86/kvm/mmu/spte.h Unescape Escape View File

5 arch/x86/kvm/mmu/tdp_mmu.c Unescape Escape View File

6 arch/x86/kvm/svm/avic.c Unescape Escape View File

9 arch/x86/kvm/svm/svm.c Unescape Escape View File

2 arch/x86/kvm/vmx/posted_intr.h Unescape Escape View File

9 arch/x86/kvm/x86.c Unescape Escape View File

65 arch/x86/xen/enlighten.c Unescape Escape View File

13 arch/x86/xen/enlighten_hvm.c Unescape Escape View File

4 arch/x86/xen/enlighten_pv.c Unescape Escape View File

7 arch/x86/xen/enlighten_pvh.c Unescape Escape View File

50 arch/x86/xen/xen-asm.S Unescape Escape View File

107 arch/x86/xen/xen-head.S Unescape Escape View File

9 arch/x86/xen/xen-ops.h Unescape Escape View File

3 block/bdev.c Unescape Escape View File

16 block/blk-mq-sysfs.c Unescape Escape View File

40 block/blk-mq.c Unescape Escape View File

4 block/blk-sysfs.c Unescape Escape View File

2 drivers/accel/ivpu/ivpu_gem.c Unescape Escape View File

10 drivers/accel/ivpu/ivpu_mmu_context.c Unescape Escape View File

2 drivers/accel/ivpu/ivpu_pm.c Unescape Escape View File

4 drivers/acpi/Kconfig Unescape Escape View File

2 drivers/auxdisplay/Kconfig Unescape Escape View File

26 drivers/block/ublk_drv.c Unescape Escape View File

15 drivers/block/zram/zram_drv.c Unescape Escape View File

14 drivers/clocksource/hyperv_timer.c Unescape Escape View File

50 drivers/cpufreq/amd-pstate.c Unescape Escape View File

25 drivers/cxl/core/region.c Unescape Escape View File

6 drivers/cxl/pci.c Unescape Escape View File

2 drivers/dma-buf/dma-buf.c Unescape Escape View File

43 drivers/dma-buf/udmabuf.c Unescape Escape View File

28 drivers/dma/amd/qdma/qdma.c Unescape Escape View File

7 drivers/dma/apple-admac.c Unescape Escape View File

2 drivers/dma/at_xdmac.c Unescape Escape View File

6 drivers/dma/dw/acpi.c Unescape Escape View File

8 drivers/dma/dw/internal.h Unescape Escape View File

4 drivers/dma/dw/pci.c Unescape Escape View File

1 drivers/dma/fsl-edma-common.h Unescape Escape View File

41 drivers/dma/fsl-edma-main.c Unescape Escape View File

2 drivers/dma/loongson2-apb-dma.c Unescape Escape View File

2 drivers/dma/mv_xor.c Unescape Escape View File

10 drivers/dma/tegra186-gpc-dma.c Unescape Escape View File

15 drivers/firmware/arm_ffa/bus.c Unescape Escape View File

421 Commits

v6.13-rc3 ... v6.13-rc5

1

.mailmap

View File

4

Documentation/admin-guide/pm/amd-pstate.rst

View File

10

Documentation/devicetree/bindings/crypto/fsl,sec-v4.0.yaml

View File

2

Documentation/devicetree/bindings/mtd/partitions/fixed-partitions.yaml

View File

2

Documentation/devicetree/bindings/soc/fsl/fsl,qman-portal.yaml

View File

2

Documentation/devicetree/bindings/sound/realtek,rt5645.yaml

View File

850

Documentation/mm/process_addrs.rst

View File

6

MAINTAINERS

View File

2

Makefile

View File

1

arch/arc/Kconfig

View File

8

arch/arc/include/asm/cachetype.h Normal file

View File

2

arch/arm64/boot/dts/arm/fvp-base-revc.dts

View File

8

arch/arm64/boot/dts/broadcom/bcm2712.dtsi

View File

35

arch/arm64/kernel/signal.c

View File

6

arch/hexagon/Makefile

View File

1

arch/powerpc/configs/pmac32_defconfig

View File

1

arch/powerpc/configs/ppc6xx_defconfig

View File

36

arch/powerpc/platforms/book3s/vas-api.c

View File

2

arch/s390/boot/startup.c

View File

6

arch/s390/boot/vmem.c

View File

2

arch/s390/kernel/ipl.c

View File

12

arch/x86/events/intel/core.c

View File

1

arch/x86/events/intel/ds.c

View File

1

arch/x86/events/intel/uncore.c

View File

1

arch/x86/include/asm/cpufeatures.h

View File

2

arch/x86/include/asm/processor.h

View File

15

arch/x86/include/asm/static_call.h

View File

6

arch/x86/include/asm/sync_core.h

View File

36

arch/x86/include/asm/xen/hypercall.h

View File

5

arch/x86/kernel/callthunks.c

View File

30

arch/x86/kernel/cet.c

View File

38

arch/x86/kernel/cpu/common.c

View File

58

arch/x86/kernel/cpu/mshyperv.c

View File

9

arch/x86/kernel/static_call.c

View File

4

arch/x86/kernel/vmlinux.lds.S

View File

12

arch/x86/kvm/mmu/mmu.c

View File

17

arch/x86/kvm/mmu/spte.h

View File

5

arch/x86/kvm/mmu/tdp_mmu.c

View File

6

arch/x86/kvm/svm/avic.c

View File

9

arch/x86/kvm/svm/svm.c

View File

2

arch/x86/kvm/vmx/posted_intr.h

View File

9

arch/x86/kvm/x86.c

View File

65

arch/x86/xen/enlighten.c

View File

13

arch/x86/xen/enlighten_hvm.c

View File

4

arch/x86/xen/enlighten_pv.c

View File

7

arch/x86/xen/enlighten_pvh.c

View File

50

arch/x86/xen/xen-asm.S

View File

107

arch/x86/xen/xen-head.S

View File

9

arch/x86/xen/xen-ops.h

View File

3

block/bdev.c

View File

16

block/blk-mq-sysfs.c

View File

40

block/blk-mq.c

View File

4

block/blk-sysfs.c

View File

2

drivers/accel/ivpu/ivpu_gem.c

View File

10

drivers/accel/ivpu/ivpu_mmu_context.c

View File

2

drivers/accel/ivpu/ivpu_pm.c

View File

4

drivers/acpi/Kconfig

View File

2

drivers/auxdisplay/Kconfig

View File

26

drivers/block/ublk_drv.c

View File

15

drivers/block/zram/zram_drv.c

View File

14

drivers/clocksource/hyperv_timer.c

View File

50

drivers/cpufreq/amd-pstate.c

View File

25

drivers/cxl/core/region.c

View File

6

drivers/cxl/pci.c

View File

2

drivers/dma-buf/dma-buf.c

View File

43

drivers/dma-buf/udmabuf.c

View File

28

drivers/dma/amd/qdma/qdma.c

View File

7

drivers/dma/apple-admac.c

View File

2

drivers/dma/at_xdmac.c

View File

6

drivers/dma/dw/acpi.c

View File

8

drivers/dma/dw/internal.h

View File

4

drivers/dma/dw/pci.c

View File

1

drivers/dma/fsl-edma-common.h

View File

41

drivers/dma/fsl-edma-main.c

View File

2

drivers/dma/loongson2-apb-dma.c

View File

2

drivers/dma/mv_xor.c

View File

10

drivers/dma/tegra186-gpc-dma.c

View File

15

drivers/firmware/arm_ffa/bus.c

View File

7

drivers/firmware/arm_ffa/driver.c

View File