#  @SmallAd3697 SmallAd3697
SmallAd3697 posts on Reddit about azure, databricks, microsoft, hosted the most. They currently have [---] followers and [---] posts still getting attention that total [--] engagements in the last [--] hours.
### Engagements: [--] [#](/creator/reddit::t2_6dfcntmw/interactions)

- [--] Week [---] +83%
- [--] Month [---] +34%
- [--] Months [-----] +124%
- [--] Year [-----] +757%
### Mentions: [--] [#](/creator/reddit::t2_6dfcntmw/posts_active)

- [--] Week [--] -33%
- [--] Month [--] +25%
- [--] Months [--] +35%
- [--] Year [---] +1,583%
### Followers: [---] [#](/creator/reddit::t2_6dfcntmw/followers)

- [--] Months [-----] +7.30%
- [--] Year [-----] +82%
### CreatorRank: undefined [#](/creator/reddit::t2_6dfcntmw/influencer_rank)

### Social Influence
**Social category influence**
[technology brands](/list/technology-brands) 35% [stocks](/list/stocks) 12% [finance](/list/finance) 3%
**Social topic influence**
[azure](/topic/azure) 17%, [databricks](/topic/databricks) #97, [microsoft](/topic/microsoft) 11%, [hosted](/topic/hosted) 3%, [away from](/topic/away-from) 3%, [lack of](/topic/lack-of) 2%, [app](/topic/app) 1%, [network](/topic/network) 1%, [api](/topic/api) 1%, [stocks](/topic/stocks) 1%
**Top accounts mentioned or mentioned by**
[@unitedcom](/creator/undefined)
**Top assets mentioned**
[Microsoft Corp. (MSFT)](/topic/microsoft) [Alphabet Inc Class A (GOOGL)](/topic/$googl) [Viking Holdings Ltd (VIK)](/topic/$vik)
### Top Social Posts
Top posts by engagements in the last [--] hours
"Slapping a vendor's brand on hosted duckdb dataengineering dataengineering"
[Reddit Link](https://redd.it/1q4135w) 2026-01-04T20:48Z [--] followers, [---] engagements
"App Service disallows a common network API (C#.Net) AZURE AZURE"
[Reddit Link](https://redd.it/1r1akp1) 2026-02-10T19:36Z [--] followers, [--] engagements
"Slow dataset import against sql endpoint (F64 capacity with [--] million rows) MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1pfvl3i) 2025-12-06T18:02Z [--] followers, [--] engagements
"Lack of Network Connectivity in Fabric dataengineering dataengineering"
[Reddit Link](https://redd.it/1qf0y19) 2026-01-17T02:44Z [--] followers, [--] engagements
"Publish to duckdb from databricks UC databricks databricks"
[Reddit Link](https://redd.it/1qvdh15) 2026-02-04T03:38Z [--] followers, [--] engagements
"Very high goodwill (Progress Software) stocks stocks"
[Reddit Link](https://redd.it/1lt77c5) 2025-07-06T17:55Z [--] followers, [---] engagements
"Multi table transactions databricks databricks"
[Reddit Link](https://redd.it/1oy13j4) 2025-11-15T20:01Z [--] followers, [--] engagements
"Using GEN2 dataflows (CICD variety) as a source is failing about 30% of time MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1r3rsns) 2026-02-13T15:16Z [--] followers, [--] engagements
"Showing exec plans for SQL analytics endpoint of LH For some time I've planned to start using the SQL analytics endpoint of a lakehouse. It seems to be one of the more innovative things that has happened in fabric recently. The Microsoft docs warn heavily against using it since it performs more slowly than directlake semantic model. However I have to believe that there are some scenarios where it is suitable. I didn't want to dive into these sorts of queries blindfolded especially given the caveats in the docs. Before trying to use them in a solution I had lots of questions to answer. Eg."
[Reddit Link](https://redd.it/1j29uxz) 2025-03-03T03:51Z [---] followers, [--] engagements
"Bag charge missing / not billed I've called a couple times now and United seems unconcerned about an issue impacting my son. . He travels end of April and booking confirmation says I payed for "premium bundle" for baggage. The booking confirmation came from United Airlines (notifications@united.com) It shows cc number cc payment of [------] consisting of fare [------] and Taxes [-----] and Premium add-ons [-----]. The premium add-ons were for baggage. As soon as I made purchase I started getting additional advertising to buy bags. Something seemed off. I called a couple times. The first time they"
[Reddit Link](https://redd.it/1jdjjcp) 2025-03-17T18:14Z [---] followers, [--] engagements
"Cost trade-offs for occasionally used reports Are any developers in this community at liberty to pick a conventional ERP reporting approach with conventional tools like ssrs against the ERP/API Do you ever choose NOT to use power bi (PQ with a duplicated/remote copy of the same underlying data) Or does the conventional reporting go to a different team I'm a fan of PBI but it isn't a general purpose reporting tool. I can definitely see it's pro's and con's. Especially when it comes to cost. I've seen some crazy things happening in PBI from a cost perspective. I see places where report"
[Reddit Link](https://redd.it/1jftgz0) 2025-03-20T16:51Z [---] followers, [--] engagements
"Where does the mashup run There are times when I know where when and how my Power Query will run. Eg. I can run it from PBI desktop or thru an on-premise gateway. Or even in a vnet managed gateway. There are other times where I'm a lot more confused. Like if a dataset only needs a "cloud connection" to get to data and it does not prompt for the selection of a gateway. where would the PQ get executed The details are abstracted away from the user and the behavior can be uncertain. Is Microsoft hosting in a VM In a virtualization container Is it isolated from other customers or will it be"
[Reddit Link](https://redd.it/1jlmmhi) 2025-03-28T04:07Z [--] followers, [--] engagements
"GEN2 dataflows blanking out results on post-staging data I have a support case about this but it seems faster to reach FTE's here than thru CSS/pro support. For about a year we have had no problems with a large GEN2 dataflow. It stages some preliminary tables - each with data that is specific to particular fiscal year. Then as a last step we use table.combine on the related years in order to generate the final table (sort of like a de-partitioning operation). All tables have enabled staging. There are four years that are gathered and the final result is a single table with about [--] million"
[Reddit Link](https://redd.it/1jx1d9k) 2025-04-11T21:38Z [---] followers, [--] engagements
"SQL profiler against SQL analytics endpoint or DW Internally in Dataflow GEN2 the default storage destination will alternate rapidly between DataflowStagingLakehouse and DataflowStagingWarehouse. If I turn on additional logs for the dataflow I see the SQL statements sent to the WH. But they are truncated to [---] chars or so. Is there another way to inspect SQL query traffic to a WH or LH I would like to see the queries to review for perf problems costs and bugs. Sometimes they may help me identify workarounds while I'm waiting on a problem to be fixed that is out of my control. (I have a case"
[Reddit Link](https://redd.it/1jzv2k3) 2025-04-15T15:40Z [--] followers, [--] engagements
"Hitting Reset on a DW Workspace in Fabric MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1k1qrbb) 2025-04-18T00:08Z [--] followers, [--] engagements
"Cheaper Power Query Hosting I'm a conventional software programmer but I often use Power Query transformations. I rely on them for a lot of our simple models or when prototyping something new. The biggest issue I encounter with PQ is the cost that is incurred when my PQ is blocking (on an API for example). For Gen1 dataflows it was not expensive to wait on an API. But in Gen2 the costs have become unreasonable. Microsoft sets a stopwatch and charges us for the total duration of our PQ even when PQ is simply blocking on another third-party service. It leads me to think about other options for"
[Reddit Link](https://redd.it/1kcqj8g) 2025-05-02T02:39Z [---] followers, [--] engagements
"HDInsight outages this month I truly love HDInsight on Azure. It is a workhorse; it can process massive amounts of data at low cost. And there is very little drama related to outages and bugs (unlike Microsoft Synapse and Fabric). It runs smoothly day after day and year after year. In rare cases when I need CSS support it is normally a high quality experience (both pro and premier). This past month I've started experiencing severe outages as a result of cluster scaling problems. It is very surprising to have these sorts of experiences in HDI for the first time. The most recent was a four day"
[Reddit Link](https://redd.it/1l02q3v) 2025-05-31T17:33Z [---] followers, [--] engagements
"Who is responsible for DAX Out of curiosity I looked at the Wikipedia page for DAX and MDX. There is an engineer named in the credits for MDX and there is vendor adoption outside of Microsoft. For DAX there are no engineers named and no vendor outside of Microsoft have ever introduced the query language into another product so far as I'm aware. Are there any Microsoft engineers or PM names associated with the DAX language The highest profile names I'm aware of are folks *outside* of Microsoft who have been cheerleading it (eg the Italians for example) Nobody has ever attached their name to it"
[Reddit Link](https://redd.it/1lbpfdv) 2025-06-15T02:19Z [---] followers, [--] engagements
"Various questions about directlake on onelake I am just starting to take a look at directlake on onelake. I really appreciate having this additional layer of control. It feels almost like we are being given a "back-door" approach for populating a tabular model with the necessary data. We will have more control to manage the data structures used for storing the model's data. And it gives us a way to repurpose the same delta tables for purposes unrelated to the model (giving us a much bigger bang for the buck). The normal ("front door") way to import data into a model is via "import" operations"
[Reddit Link](https://redd.it/1lci29x) 2025-06-16T02:49Z [--] followers, [--] engagements
"Getting Deeper into Hype re: DirectLake Plus Import I started hearing about DirectLake plus Import recently. Marco Russo is a big advocate. Here is a link to a blog and video: Direct Lake vs Import vs Direct Lake+Import Fabric semantic models (May 2025) - SQLBI(https://www.sqlbi.com/blog/marco/2025/05/13/direct-lake-vs-import-vs-direct-lakeimport-fabric-semantic-models-may-2025/) I'm starting to drink the coolaid. But before I chug a whole pitcher of it I wanted to focus on a more couple performance concerns. Marco seems overly optimistic and claims things that seem too good to be true ie.: -"
[Reddit Link](https://redd.it/1lf9pi6) 2025-06-19T12:35Z [---] followers, [--] engagements
"Deploying an MDX Script into a Tabular Model We are nearing the end of our migration from on-prem multidimensional models to PBI tabular models. Some of our calcs in DAX are still pretty convoluted and slow (compared to MDX) especially where hierarchies are concerned. It is discouraging and I think it is an artificially imposed kind of problem since tabular models are perfectly capable of MDX. On the Excel side our users miss the ability to share their MDX solutions back to the I.T. team and deploy them as part of their cubes so that other users can share specialized calcs sets and so on. I'm"
[Reddit Link](https://redd.it/1lg4a9v) 2025-06-20T13:41Z [---] followers, [--] engagements
"Is there no Path to get a Pbip for this Model (directlake on onelake plus import) I'm trying to evaluate the "directlake on onelake" with "plus import" tables. We can find this approach here: https://www.sqlbi.com/blog/marco/2025/05/13/direct-lake-vs-import-vs-direct-lakeimport-fabric-semantic-models-may-2025/(https://www.sqlbi.com/blog/marco/2025/05/13/direct-lake-vs-import-vs-direct-lakeimport-fabric-semantic-models-may-2025/) I'm not able to open in PBI desktop for some reason once the import tables are introduced into the model. The error is: Live editing is only available for models"
[Reddit Link](https://redd.it/1lg6bs6) 2025-06-20T15:07Z [--] followers, [--] engagements
"Is Azure Analysis Services Dead Can we say Azure Analysis Services is dead I'm looking at the available data sources: https://learn.microsoft.com/en-us/analysis-services/azure-analysis-services/analysis-services-datasourceview=sql-analysis-services-2025#azure-data-sources(https://learn.microsoft.com/en-us/analysis-services/azure-analysis-services/analysis-services-datasourceview=sql-analysis-services-2025#azure-data-sources)"
[Reddit Link](https://redd.it/1li2cvo) 2025-06-22T23:55Z [---] followers, [--] engagements
"Would Fabric be able to Compete as a Multi-Cloud SaaS Could Fabric could go toe-to-toe with Databricks as a first-party platform on multiple clouds (AWS and GCP) Would it even be profitable if it was available on another cloud What would it take for Microsoft to make it available I'm guessing it would never happen but I'm having a hard time finding the right language to explain why. I think the simple explanation is that nobody wants it anywhere else. (There are too many great options for doing data analytics on the other clouds and Fabric would be crowded out.) Even in Azure it may not keep"
[Reddit Link](https://redd.it/1llg9of) 2025-06-27T00:28Z [---] followers, [--] engagements
"Remote Code Execution Bad or Good A few decades ago when someone mentioned the phrase "remote code execution" it indicated a serious vulnerability. In those days single identity or principal should NEVER have rights to do BOTH a deployment of code AND subsequently execute it. We rarely hear the phrase being mentioned anymore especially not in the context of data engineering. Our execution sandboxes are very restricted and Fabric developers who can deploy are also able to execute. The risks are ultimately very small. It is hard to envision a python notebook in Fabric which can replicate itself"
[Reddit Link](https://redd.it/1llgxik) 2025-06-27T01:00Z [---] followers, [--] engagements
"Direct-lake on OneLake performance I'm a little frustrated by my experiences with direct-lake on OneLake. I think there is misinformation circling about the source of performance regressions as compared to import. I'm seeing various problems - even after I've started importing all my dim tables (strategy called "plus import") . This still isnt making the model as fast as import. . The biggest problems are when using pivot tables in Excel and "stacking" *multiple* dimensions on rows. When evaluating these queries it requires jumping across multiple dims all joined back to the fact table. The"
[Reddit Link](https://redd.it/1lnuqr3) 2025-06-30T01:05Z [--] followers, [--] engagements
"Partition Questions related to DirectLake-on-OneLake The "DirectLake-on-OneLake" (DL-on-OL) is pretty compelling. I do have some concerns that it is likely to stay in preview for quite a LONG while (at least the parts I care about). For my purpose I want to allow most of my model to remain "import" for the sake of Excel hierarches and MDX. . I would ONLY use DirectLake-on-Onelake for a few isolated tables. This approach is called a "with import" model or "hybrid" (I think). If this "with import" feature is going to remain in preview for a couple of years I'm trying to brainstorm how to"
[Reddit Link](https://redd.it/1m34z0i) 2025-07-18T15:13Z [--] followers, [--] engagements
"Incredibly slow semantic model metadata via xmla/ssms My semantic models are hosted in an Azure region that is only [--] ms away from me. However it is a painfully slow process to use SSMS to connect to workspaces list models create scripted operations get the TMSL of the tables and so on. Eg. it can take [--] to [--] seconds to do simple things with the metadata of a model (read-only operations which should be instantaneous.) Does anyone experience this much pain with xmla endpoints in ssms or other tools Is this performance something that the Microsoft PG might improve one day I've been waiting 2"
[Reddit Link](https://redd.it/1m360en) 2025-07-18T15:53Z [--] followers, [--] engagements
"Azure managed spark We are moving an apache spark solution to azure for our staging and production environments. We would like to host on a managed spark service. The criteria for a selection would be to (1) Avoid proprietary extensions so that workloads can run the same way on premise as in azure and (2) Avoid vendor lock-in and (3) keep costs as low as possible. Fabric is already ruled out where spark is concerned given that it fails to meet any of these basic goals. Are the remaining options just Databricks and HDI and Synapse Where can I find one that doesn't have all the bells and"
[Reddit Link](https://redd.it/1m4phfc) 2025-07-20T13:53Z [--] followers, [--] engagements
"Smaller Clusters for Spark The smallest Spark cluster I can create seems to be a 4-core driver and 4-core executor both consuming up to [--] GB. This seems excessive and soaks up lots of CU's. Excessive(https://preview.redd.it/ix69i0b5zhef1.pngwidth=531&format=png&auto=webp&s=ce28510e4f07edb7845164e9f0c9e115b8eede79) . Can someone share a cheaper way to use Spark on Fabric About [--] years ago when we were migrating from Databricks to Synapse Analytics Workspaces the CSS engineers at Microsoft had said they were working on providing "single node clusters" which is an inexpensive way to run a Spark"
[Reddit Link](https://redd.it/1m6rklg) 2025-07-22T22:01Z [--] followers, [--] engagements
"Anyone know anything about HDInsight (2025) I'm really confused about the prospects of a platform in Azure called Microsoft HDInsight. Given that I've been a customer of this platform for a number of years I probably shouldn't be this confused. I really like HDInsight aside from the fact that it isn't keeping up with the latest open source Spark runtimes. There appears to be no public roadmap or announcements about its fate. I have tried to get in touch with product/program managers at Microsoft and had no luck. The version we use is v.5.1 and seems to be the only version left. There are no"
[Reddit Link](https://redd.it/1m6slis) 2025-07-22T22:44Z [--] followers, [--] engagements
"HDInsight Spark is Delivered in Azure with High-Severity Vulnerabilities I'm pretty confused by the lack of any public-facing communication or roadmaps for HDInsight. It is heartbreaking that such a great product is now ending its life in this way Everyone is probably aware that HDInsight had outdated components like Ubunto (18.04) and Spark (3.3.1). EG. Here is the doc showing Spark 3.3.1 is delivered with V.5.1: https://learn.microsoft.com/en-us/azure/hdinsight/hdinsight-5x-component-versioning(https://learn.microsoft.com/en-us/azure/hdinsight/hdinsight-5x-component-versioning) However I"
[Reddit Link](https://redd.it/1marbjq) 2025-07-27T16:55Z [--] followers, [--] engagements
"Semantic Model Query Execution from Databricks (like Sempy) We are migrating Spark workloads from Fabric to Databricks for reduced costs and improved notebook experiences. The "semantic models" are a type of component that has a pretty central place in our "Fabric" environment. We use them in a variety of ways. Eg. In Fabric an ipynb user can connect to them (via "sempy"). But in Databricks we are finding it to be more cumbersome to reach our data. I never expected our semantic models to be so inaccessible to remote python developers. I've done a small amount of investigation but I'm not"
[Reddit Link](https://redd.it/1maw9rf) 2025-07-27T20:12Z [--] followers, [--] engagements
"Should Engineers Pick their Storage Engine or should the Cloud Vendor Databricks a large and multi-cloud Spark service seems to be VERY motivated in getting customers to store their data one particular DW format. It seems odd to me considering they are supposed to be a flexible and open-source player. Why would an open-source vendor be so close-minded about using relational databases for storage Personally I am NOT a fan of storing ALL data in deltatables. Certainly that makes sense for data coming in and leaving the system (temp/bronze files on one side and gold/presentation on the other)."
[Reddit Link](https://redd.it/1mewifo) 2025-08-01T13:28Z [--] followers, [--] engagements
"Where do pyspark devs put checkpoints in fabric Oddly this is hard to find in a web search. At least in the context of fabric. Where do others put there checkpoint data (setcheckpointdir) Should I drop it in a temp for in the default lakehouse Is there a cheaper place for it (normal azure storage) Checkpoints are needed to truncate a logical plan in spark and avoid repeating cpu intensive operations. Cpu is not free even in spark I've been using local checkpoint in the past but it is known to be unreliable if spark executors are being dynamically deallocated (by choice). I think I need to use"
[Reddit Link](https://redd.it/1mf1f0g) 2025-08-01T16:40Z [--] followers, [--] engagements
"Ghost artifacts in workspace (typically they are deleted notebooks) Sometimes I need to clear some notebooks and redeploy or delete and re-upload. For whatever reason Fabric makes this super painful. Google AI says there are ghost artifacts and the moderators in the forums agreed: https://preview.redd.it/3qkhx1v3p8hf1.pngwidth=749&format=png&auto=webp&s=effc779cea7e334362c828f80da57acad6d6d9a0 The error presented to the user looks like this: **Message: OperationConflictError: A notebook with the same name "Whatever" already exists in workspace whatever.** Can someone tell me how long it takes"
[Reddit Link](https://redd.it/1migcrm) 2025-08-05T17:59Z [--] followers, [--] engagements
"DirectLake on OneLake - another unexpected gotcha in Excel I was pretty excited about the "DirectLake on OneLake" models in Power BI. Especially the variety where some part of the data is imported (called "D/L on O/L plus import" models). The idea behind the "plus import" model is that they would be more **compatible with Excel pivot tables**. After investing many days of effort into this architecture we find that users are NOT actually allowed to create calculated measures as we assumed they would. The error says "**MDX session-scope statements like CREATE MEMBER are not allowed on"
[Reddit Link](https://redd.it/1mikf2m) 2025-08-05T20:28Z [--] followers, [--] engagements
"Always being throttled on data IO in Azure SQL Database (forced to use hints) We are always throttled on I/O in Azure SQL. We pay for [--] vcores in a sql elastic pool. It is about $1600 per month. The "per-database settings" will allow all [--] vcores to be allocated to a single database. I do most of my testing on a single database off-hours in order to explore the underlying problems. My databases are continually getting throttled on IO ("data" and "logs" is often at 100% on the database). I have no problem with compute so it is disappointing to have to increase our vcores simply for the sake of"
[Reddit Link](https://redd.it/1miqjat) 2025-08-06T00:41Z [--] followers, [--] engagements
"Another One Bites the Dust (Azure SQL Connector for Spark) I wasn't paying attention at the time. The Spark connector we use for interacting with Azure SQL was killed in February. Microsoft seems unreliable when it comes to offering long-term support for data engineering solutions. At least once a year we get the rug pulled on us in one place or another. Here lies the remains of the Azure SQL connector that we had been using in various Azure-hosted Spark environments. https://github.com/microsoft/sql-spark-connector(https://github.com/microsoft/sql-spark-connector)"
[Reddit Link](https://redd.it/1misa14) 2025-08-06T02:02Z [--] followers, [--] engagements
"Standard Tier on Azure is Still Available. I used the pricing calculator today and noticed that the standard tier is about 25% cheaper for a common scenario on Azure. We typically define an average-sized cluster of five vm's of DS4v2 and we submit spark jobs on it via the API. Does anyone know why the Azure standard tier wasn't phased out yet It is odd that it didn't happen at the same time as AWS and Google Cloud. Given that the vast majority of our Spark jobs are NOT interactive it seems very compelling to save the 25%. If we also wish to have the interactive experience with unity catalog"
[Reddit Link](https://redd.it/1mqd4p0) 2025-08-14T21:01Z [--] followers, [--] engagements
"Are Databricks SQL Warehouses opensource Most of my exposure to spark has been outside of databricks. I'm spending more time in databricks again after a three year break or so. I see there is now a concept of a SQL warehouse aka SQL endpoint. Is this stuff opensource I'm assuming it is built on lots of proprietary extensions to spark (eg. serverless and photon and whatnot). I'm assuming there is NOT any way for me to get a so-called SQL warehouse running on my own laptop (. with the full set of DML and DDL capabilities). True Do the proprietary aspects of "SQL warehouses" make these things"
[Reddit Link](https://redd.it/1n8muwa) 2025-09-04T21:31Z [--] followers, [--] engagements
"Missing from Fabric - a Reverse ETL Tool Anyone hear of "Reverse ETL" I've been in the Fabric community for a while and don't see this term. Another data engineering subreddit uses it from time to time and I was a little jealous that they have both ETL and Reverse ETL tools In the context of Fabric I'm guessing that the term "Reverse ETL" would just be considered meaningless technobabble. It probably corresponds to retrieving data from a client after it has been added into the data platform. As such I'm guessing ALL the following might be considered "reverse ETL" tools with different"
[Reddit Link](https://redd.it/1nc1r2j) 2025-09-08T22:21Z [--] followers, [--] engagements
"Need a name. Adopted as Lori but there is one at my work. Have a Lori at work so want to change it. Looking for another name. Maybe Tessa Or Ellie Or viking Very curious and friendly kitty who is getting used to her surroundings. NameMyCat NameMyCat"
[Reddit Link](https://redd.it/1ng5jdh) 2025-09-13T18:51Z [--] followers, [---] engagements
"Frustrating Throttling Problem with an Azure SQL Query I have a query that runs for about [--] mins and gets about [--] million rows out of an Azure SQL database. It is doing an index seek on a clustered index with a predicate that limits to the current year. Based on the execution plan details it appears to be happening on a single thread (not a parallel plan) The problem is that I'm on a general purpose sku with [--] vcores. While the query is running the database becomes unusable to others. I need to be able to use the sql database for other things during this time. The query is consuming all of"
[Reddit Link](https://redd.it/1njismm) 2025-09-17T17:01Z [--] followers, [--] engagements
"Minimum Viable DirectLake on OneLake I just looked at the roadmap for Power BI https://roadmap.fabric.microsoft.com/product=powerbi(https://roadmap.fabric.microsoft.com/product=powerbi) I'm not seeing anything about DirectLake on OneLake. (aka DirectLake v2) I think it is still in preview without a planned GA date. Is there any list of milestones that need to be reached before this goes to GA Can we see the list How much longer might it take before we reach the first GA I was hoping to use this feature in production in [----] and the only major show-stopper for us are the Excel issues (Pivot"
[Reddit Link](https://redd.it/1nvoxjp) 2025-10-02T00:08Z [--] followers, [--] engagements
"How to isolate dev and test (unity catalog) I'm starting to use databricks unity catalog for the first time and at first glance I have concerns. I'm in a DEVELOPMENT workspace (instance of azure databricks) but it cannot be fully isolated from production. If someone shares something with me it appears in my list of catalogs even though I intend to remain isolated in my development "sandbox". I'm told there is no way to create an isolated metadata catalog to keep my dev and prod far away from each other in a given region. So I'm guessing I will be forced to create separate entra account for"
[Reddit Link](https://redd.it/1o0t89o) 2025-10-07T22:21Z [--] followers, [--] engagements
"Where to see Spark CPU and Memory in my Spark Cluster I have noticed that many of my notebooks are doing odd things like frenetically killing and recreating executors (yarn containers) every couple minutes. There are no YARN logs to be found in Fabric despite the fact that yarn appears to be the scheduler (according to the Spark UI). In many other Spark hosting environments I am shown the CPU and memory usage of the worker nodes. These resources (CPU and memory) are pretty essential things to monitor on any spark cluster yet I haven't found where these things are exposed to me for my worker"
[Reddit Link](https://redd.it/1o38qcu) 2025-10-10T18:21Z [--] followers, [--] engagements
"Azure data factory is a miserable pile of crap. dataengineering dataengineering"
[Reddit Link](https://redd.it/1emcns0) 2024-08-07T14:07Z [--] followers, [----] engagements
"Spark is excessively buggy MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1i2v9ke) 2025-01-16T18:30Z [--] followers, [--] engagements
"Alternatives to Service Bus We are planning on migrating our legacy service broker to Azure from on-prem. We have been using Jboss AMQ from Redhat which is based on Apache Active MQ and openwire (we were not using AMQP). We are considering all cloud-hosting options that are based on the open AMQP protocol. Azure service bus was obviously at the top of our list to be evaluate primarily because it is from Microsoft. I'm really not happy with what I'm seeing in azure service bus. The management tooling is poor. There is no visibility to see connected clients. There are arbitrary technical"
[Reddit Link](https://redd.it/1ippig8) 2025-02-15T00:37Z [--] followers, [--] engagements
"Dataflow Gen2 wetting the bed MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1iugy9w) 2025-02-21T03:10Z [--] followers, [---] engagements
"Material left in my garage I found a package of this material in my garage. The door was open and I'm guessing someone dropped it in there when we were away. Is it is a reasonable theory Never heard of bahai going from door to door like Jehovah's Witnesses. bahai bahai"
[Reddit Link](https://redd.it/1ivv5lh) 2025-02-22T22:40Z [---] followers, [---] engagements
"There is no formal QA department MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1j6jl8i) 2025-03-08T15:38Z [--] followers, [---] engagements
"More Adventures in Support MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1j9x8jh) 2025-03-12T22:53Z [--] followers, [--] engagements
"Half day outage w/GEN2 dataflows MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1jc46w7) 2025-03-15T20:23Z [--] followers, [---] engagements
"Timeout in service after three minutes MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1jgyk37) 2025-03-22T02:44Z [--] followers, [--] engagements
"Fabric DW Software Lifecycles MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1k0x554) 2025-04-16T22:25Z [--] followers, [--] engagements
"Is developer mode of power BI generally available (2025) MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1l2dzkt) 2025-06-03T14:57Z [--] followers, [--] engagements
"PQ imports from Excel MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1lp3415) 2025-07-01T14:28Z [--] followers, [--] engagements
"My notebook in DEV is randomly accessing PROD lakehouse MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1mckmcu) 2025-07-29T19:14Z [--] followers, [--] engagements
"Spark notebook can corrupt delta MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1od9nyk) 2025-10-22T14:25Z [--] followers, [--] engagements
"Direct Lake on OneLake for Semantic Models (is it done yet) MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1oicpjf) 2025-10-28T15:39Z [--] followers, [--] engagements
"Use Source Control for Model Permissions (or Just Backups) MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1ontwry) 2025-11-04T01:08Z [--] followers, [--] engagements
"DataflowsStagingLakehouse in my workspace MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1p1g3b5) 2025-11-19T19:07Z [--] followers, [--] engagements
"Spark Connect for Building Applications databricks databricks"
[Reddit Link](https://redd.it/1p31ik1) 2025-11-21T15:33Z [--] followers, [--] engagements
"The DAX bug Involving Auto-Exist and ALL() MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1p61p76) 2025-11-25T03:21Z [--] followers, [--] engagements
"Why did Microsoft kill their Spark on Containers/Kubernetes dataengineering dataengineering"
[Reddit Link](https://redd.it/1pao0hv) 2025-11-30T17:35Z [--] followers, [--] engagements
"What is a reasonable timeframe for a feature to GA MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1pavnd8) 2025-11-30T23:11Z [--] followers, [--] engagements
"The Spark Notebook Monitoring UI is Removing my Stuff MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1pdkhpy) 2025-12-03T23:51Z [--] followers, [--] engagements
"Amazon Delivery Gone Wrong homeowners homeowners"
[Reddit Link](https://redd.it/1pdm1wx) 2025-12-04T00:57Z [--] followers, [--] engagements
"Possible Amazon Delivery Mishap (Claim Filed ) AmazonFlexDrivers AmazonFlexDrivers"
[Reddit Link](https://redd.it/1pdmc1c) 2025-12-04T01:11Z [--] followers, [--] engagements
""Dataset" name in docs (still) MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1pf814t) 2025-12-05T22:12Z [--] followers, [--] engagements
"Is there any database mirroring feature in the databricks ecosystem databricks databricks"
[Reddit Link](https://redd.it/1pkbsng) 2025-12-11T23:10Z [--] followers, [--] engagements
"Questions about roadmap for python API (UDF worker) apachespark apachespark"
[Reddit Link](https://redd.it/1plycim) 2025-12-13T22:54Z [--] followers, [--] engagements
"Select/SelectMany vs Map/FlatMap csharp csharp"
[Reddit Link](https://redd.it/1pmwnwb) 2025-12-15T03:00Z [--] followers, [--] engagements
"Databricks SQL DW - stating the obvious. dataengineering dataengineering"
[Reddit Link](https://redd.it/1py4j8c) 2025-12-28T22:32Z [--] followers, [--] engagements
"Databricks SQL innovations planned databricks databricks"
[Reddit Link](https://redd.it/1pz8b3b) 2025-12-30T05:19Z [--] followers, [--] engagements
"Vast numbers of explorer.exe launched on windows [--] Windows11 Windows11"
[Reddit Link](https://redd.it/1q55711) 2026-01-06T10:43Z [--] followers, [--] engagements
"Isolation of sql context in interactive cluster databricks databricks"
[Reddit Link](https://redd.it/1q5mozo) 2026-01-06T16:09Z [--] followers, [--] engagements
"Capacity Throttling and Smoothing is a Failure of Biblical Proportions MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1q6vs4c) 2026-01-07T23:47Z [--] followers, [---] engagements
"AsyncContextThread in Nito.AsyncEx - any replacements available on .net [--] or later dotnet dotnet"
[Reddit Link](https://redd.it/1qb2v25) 2026-01-12T18:43Z [--] followers, [--] engagements
"PQ online auto-pain MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1qdt803) 2026-01-15T19:33Z [--] followers, [--] engagements
"Why Enforce Lowercase Queue Names in Service Bus AZURE AZURE"
[Reddit Link](https://redd.it/1qem5al) 2026-01-16T17:24Z [--] followers, [--] engagements
"Refresh icon in wrong spot (DF GEN2 CICD) MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1qj8fg6) 2026-01-21T20:04Z [--] followers, [--] engagements
"Top Secret Technical Comms for Sharepoint sharepoint sharepoint"
[Reddit Link](https://redd.it/1qlys30) 2026-01-24T22:07Z [--] followers, [--] engagements
"ASWL New Era and Leadership MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1l6inaf) 2025-06-08T18:06Z [--] followers, [--] engagements
"DirectLake development in connected mode MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1lfkzq1) 2025-06-19T20:21Z [--] followers, [--] engagements
"Any Chance of Multi-Threaded Query Plans for PBI Semantic Models MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1m35fz9) 2025-07-18T15:31Z [--] followers, [--] engagements
"Managed Airflow in Databricks databricks databricks"
[Reddit Link](https://redd.it/1qakwb7) 2026-01-12T04:09Z [--] followers, [--] engagements
"SQL endpoint names for LH MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1qjy3js) 2026-01-22T16:01Z [--] followers, [--] engagements
"Do Pythons hate Windows Python Python"
[Reddit Link](https://redd.it/1ql3a3m) 2026-01-23T21:30Z [--] followers, [--] engagements
"Deployment from ADO yet (2026) MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1qm4ew9) 2026-01-25T00:52Z [--] followers, [--] engagements
"Managing your Perspectives (2026) MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1qpqpkm) 2026-01-28T22:54Z [--] followers, [--] engagements
"Maintaining Perspectives in [----] PowerBI PowerBI"
[Reddit Link](https://redd.it/1qq9j0b) 2026-01-29T15:16Z [--] followers, [--] engagements
"Azure Everything [---] AZURE AZURE"
[Reddit Link](https://redd.it/1qqe5f9) 2026-01-29T17:53Z [--] followers, [--] engagements
"Spark connector for SQL databases dead/reborn MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1qr7gb7) 2026-01-30T14:52Z [--] followers, [--] engagements
"The unnecessary ceremony related unappliedChanges.json in pbip PowerBI PowerBI"
[Reddit Link](https://redd.it/1quuq1o) 2026-02-03T17:33Z [--] followers, [--] engagements
"SQL endpoint synchronization requirement (2026) MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1qv4hqp) 2026-02-03T21:18Z [--] followers, [--] engagements
"MPEs for PLS - Why sooooo long MicrosoftFabric MicrosoftFabric"
[Reddit Link](https://redd.it/1qej3m0) 2026-01-16T15:37Z [--] followers, [--] engagements
"AI as the end user (lakebase) databricks databricks"
[Reddit Link](https://redd.it/1qmrwin) 2026-01-25T19:16Z [--] followers, [--] engagements
Limited data mode. Full metrics available with subscription: lunarcrush.com/pricing
@SmallAd3697 SmallAd3697SmallAd3697 posts on Reddit about azure, databricks, microsoft, hosted the most. They currently have [---] followers and [---] posts still getting attention that total [--] engagements in the last [--] hours.
Social category influence technology brands 35% stocks 12% finance 3%
Social topic influence azure 17%, databricks #97, microsoft 11%, hosted 3%, away from 3%, lack of 2%, app 1%, network 1%, api 1%, stocks 1%
Top accounts mentioned or mentioned by @unitedcom
Top assets mentioned Microsoft Corp. (MSFT) Alphabet Inc Class A (GOOGL) Viking Holdings Ltd (VIK)
Top posts by engagements in the last [--] hours
"Slapping a vendor's brand on hosted duckdb dataengineering dataengineering"
Reddit Link 2026-01-04T20:48Z [--] followers, [---] engagements
"App Service disallows a common network API (C#.Net) AZURE AZURE"
Reddit Link 2026-02-10T19:36Z [--] followers, [--] engagements
"Slow dataset import against sql endpoint (F64 capacity with [--] million rows) MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-12-06T18:02Z [--] followers, [--] engagements
"Lack of Network Connectivity in Fabric dataengineering dataengineering"
Reddit Link 2026-01-17T02:44Z [--] followers, [--] engagements
"Publish to duckdb from databricks UC databricks databricks"
Reddit Link 2026-02-04T03:38Z [--] followers, [--] engagements
"Very high goodwill (Progress Software) stocks stocks"
Reddit Link 2025-07-06T17:55Z [--] followers, [---] engagements
"Multi table transactions databricks databricks"
Reddit Link 2025-11-15T20:01Z [--] followers, [--] engagements
"Using GEN2 dataflows (CICD variety) as a source is failing about 30% of time MicrosoftFabric MicrosoftFabric"
Reddit Link 2026-02-13T15:16Z [--] followers, [--] engagements
"Showing exec plans for SQL analytics endpoint of LH For some time I've planned to start using the SQL analytics endpoint of a lakehouse. It seems to be one of the more innovative things that has happened in fabric recently. The Microsoft docs warn heavily against using it since it performs more slowly than directlake semantic model. However I have to believe that there are some scenarios where it is suitable. I didn't want to dive into these sorts of queries blindfolded especially given the caveats in the docs. Before trying to use them in a solution I had lots of questions to answer. Eg."
Reddit Link 2025-03-03T03:51Z [---] followers, [--] engagements
"Bag charge missing / not billed I've called a couple times now and United seems unconcerned about an issue impacting my son. . He travels end of April and booking confirmation says I payed for "premium bundle" for baggage. The booking confirmation came from United Airlines (notifications@united.com) It shows cc number cc payment of [------] consisting of fare [------] and Taxes [-----] and Premium add-ons [-----]. The premium add-ons were for baggage. As soon as I made purchase I started getting additional advertising to buy bags. Something seemed off. I called a couple times. The first time they"
Reddit Link 2025-03-17T18:14Z [---] followers, [--] engagements
"Cost trade-offs for occasionally used reports Are any developers in this community at liberty to pick a conventional ERP reporting approach with conventional tools like ssrs against the ERP/API Do you ever choose NOT to use power bi (PQ with a duplicated/remote copy of the same underlying data) Or does the conventional reporting go to a different team I'm a fan of PBI but it isn't a general purpose reporting tool. I can definitely see it's pro's and con's. Especially when it comes to cost. I've seen some crazy things happening in PBI from a cost perspective. I see places where report"
Reddit Link 2025-03-20T16:51Z [---] followers, [--] engagements
"Where does the mashup run There are times when I know where when and how my Power Query will run. Eg. I can run it from PBI desktop or thru an on-premise gateway. Or even in a vnet managed gateway. There are other times where I'm a lot more confused. Like if a dataset only needs a "cloud connection" to get to data and it does not prompt for the selection of a gateway. where would the PQ get executed The details are abstracted away from the user and the behavior can be uncertain. Is Microsoft hosting in a VM In a virtualization container Is it isolated from other customers or will it be"
Reddit Link 2025-03-28T04:07Z [--] followers, [--] engagements
"GEN2 dataflows blanking out results on post-staging data I have a support case about this but it seems faster to reach FTE's here than thru CSS/pro support. For about a year we have had no problems with a large GEN2 dataflow. It stages some preliminary tables - each with data that is specific to particular fiscal year. Then as a last step we use table.combine on the related years in order to generate the final table (sort of like a de-partitioning operation). All tables have enabled staging. There are four years that are gathered and the final result is a single table with about [--] million"
Reddit Link 2025-04-11T21:38Z [---] followers, [--] engagements
"SQL profiler against SQL analytics endpoint or DW Internally in Dataflow GEN2 the default storage destination will alternate rapidly between DataflowStagingLakehouse and DataflowStagingWarehouse. If I turn on additional logs for the dataflow I see the SQL statements sent to the WH. But they are truncated to [---] chars or so. Is there another way to inspect SQL query traffic to a WH or LH I would like to see the queries to review for perf problems costs and bugs. Sometimes they may help me identify workarounds while I'm waiting on a problem to be fixed that is out of my control. (I have a case"
Reddit Link 2025-04-15T15:40Z [--] followers, [--] engagements
"Hitting Reset on a DW Workspace in Fabric MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-04-18T00:08Z [--] followers, [--] engagements
"Cheaper Power Query Hosting I'm a conventional software programmer but I often use Power Query transformations. I rely on them for a lot of our simple models or when prototyping something new. The biggest issue I encounter with PQ is the cost that is incurred when my PQ is blocking (on an API for example). For Gen1 dataflows it was not expensive to wait on an API. But in Gen2 the costs have become unreasonable. Microsoft sets a stopwatch and charges us for the total duration of our PQ even when PQ is simply blocking on another third-party service. It leads me to think about other options for"
Reddit Link 2025-05-02T02:39Z [---] followers, [--] engagements
"HDInsight outages this month I truly love HDInsight on Azure. It is a workhorse; it can process massive amounts of data at low cost. And there is very little drama related to outages and bugs (unlike Microsoft Synapse and Fabric). It runs smoothly day after day and year after year. In rare cases when I need CSS support it is normally a high quality experience (both pro and premier). This past month I've started experiencing severe outages as a result of cluster scaling problems. It is very surprising to have these sorts of experiences in HDI for the first time. The most recent was a four day"
Reddit Link 2025-05-31T17:33Z [---] followers, [--] engagements
"Who is responsible for DAX Out of curiosity I looked at the Wikipedia page for DAX and MDX. There is an engineer named in the credits for MDX and there is vendor adoption outside of Microsoft. For DAX there are no engineers named and no vendor outside of Microsoft have ever introduced the query language into another product so far as I'm aware. Are there any Microsoft engineers or PM names associated with the DAX language The highest profile names I'm aware of are folks outside of Microsoft who have been cheerleading it (eg the Italians for example) Nobody has ever attached their name to it"
Reddit Link 2025-06-15T02:19Z [---] followers, [--] engagements
"Various questions about directlake on onelake I am just starting to take a look at directlake on onelake. I really appreciate having this additional layer of control. It feels almost like we are being given a "back-door" approach for populating a tabular model with the necessary data. We will have more control to manage the data structures used for storing the model's data. And it gives us a way to repurpose the same delta tables for purposes unrelated to the model (giving us a much bigger bang for the buck). The normal ("front door") way to import data into a model is via "import" operations"
Reddit Link 2025-06-16T02:49Z [--] followers, [--] engagements
"Getting Deeper into Hype re: DirectLake Plus Import I started hearing about DirectLake plus Import recently. Marco Russo is a big advocate. Here is a link to a blog and video: Direct Lake vs Import vs Direct Lake+Import Fabric semantic models (May 2025) - SQLBI(https://www.sqlbi.com/blog/marco/2025/05/13/direct-lake-vs-import-vs-direct-lakeimport-fabric-semantic-models-may-2025/) I'm starting to drink the coolaid. But before I chug a whole pitcher of it I wanted to focus on a more couple performance concerns. Marco seems overly optimistic and claims things that seem too good to be true ie.: -"
Reddit Link 2025-06-19T12:35Z [---] followers, [--] engagements
"Deploying an MDX Script into a Tabular Model We are nearing the end of our migration from on-prem multidimensional models to PBI tabular models. Some of our calcs in DAX are still pretty convoluted and slow (compared to MDX) especially where hierarchies are concerned. It is discouraging and I think it is an artificially imposed kind of problem since tabular models are perfectly capable of MDX. On the Excel side our users miss the ability to share their MDX solutions back to the I.T. team and deploy them as part of their cubes so that other users can share specialized calcs sets and so on. I'm"
Reddit Link 2025-06-20T13:41Z [---] followers, [--] engagements
"Is there no Path to get a Pbip for this Model (directlake on onelake plus import) I'm trying to evaluate the "directlake on onelake" with "plus import" tables. We can find this approach here: https://www.sqlbi.com/blog/marco/2025/05/13/direct-lake-vs-import-vs-direct-lakeimport-fabric-semantic-models-may-2025/(https://www.sqlbi.com/blog/marco/2025/05/13/direct-lake-vs-import-vs-direct-lakeimport-fabric-semantic-models-may-2025/) I'm not able to open in PBI desktop for some reason once the import tables are introduced into the model. The error is: Live editing is only available for models"
Reddit Link 2025-06-20T15:07Z [--] followers, [--] engagements
"Is Azure Analysis Services Dead Can we say Azure Analysis Services is dead I'm looking at the available data sources: https://learn.microsoft.com/en-us/analysis-services/azure-analysis-services/analysis-services-datasourceview=sql-analysis-services-2025#azure-data-sources(https://learn.microsoft.com/en-us/analysis-services/azure-analysis-services/analysis-services-datasourceview=sql-analysis-services-2025#azure-data-sources)"
Reddit Link 2025-06-22T23:55Z [---] followers, [--] engagements
"Would Fabric be able to Compete as a Multi-Cloud SaaS Could Fabric could go toe-to-toe with Databricks as a first-party platform on multiple clouds (AWS and GCP) Would it even be profitable if it was available on another cloud What would it take for Microsoft to make it available I'm guessing it would never happen but I'm having a hard time finding the right language to explain why. I think the simple explanation is that nobody wants it anywhere else. (There are too many great options for doing data analytics on the other clouds and Fabric would be crowded out.) Even in Azure it may not keep"
Reddit Link 2025-06-27T00:28Z [---] followers, [--] engagements
"Remote Code Execution Bad or Good A few decades ago when someone mentioned the phrase "remote code execution" it indicated a serious vulnerability. In those days single identity or principal should NEVER have rights to do BOTH a deployment of code AND subsequently execute it. We rarely hear the phrase being mentioned anymore especially not in the context of data engineering. Our execution sandboxes are very restricted and Fabric developers who can deploy are also able to execute. The risks are ultimately very small. It is hard to envision a python notebook in Fabric which can replicate itself"
Reddit Link 2025-06-27T01:00Z [---] followers, [--] engagements
"Direct-lake on OneLake performance I'm a little frustrated by my experiences with direct-lake on OneLake. I think there is misinformation circling about the source of performance regressions as compared to import. I'm seeing various problems - even after I've started importing all my dim tables (strategy called "plus import") . This still isnt making the model as fast as import. . The biggest problems are when using pivot tables in Excel and "stacking" multiple dimensions on rows. When evaluating these queries it requires jumping across multiple dims all joined back to the fact table. The"
Reddit Link 2025-06-30T01:05Z [--] followers, [--] engagements
"Partition Questions related to DirectLake-on-OneLake The "DirectLake-on-OneLake" (DL-on-OL) is pretty compelling. I do have some concerns that it is likely to stay in preview for quite a LONG while (at least the parts I care about). For my purpose I want to allow most of my model to remain "import" for the sake of Excel hierarches and MDX. . I would ONLY use DirectLake-on-Onelake for a few isolated tables. This approach is called a "with import" model or "hybrid" (I think). If this "with import" feature is going to remain in preview for a couple of years I'm trying to brainstorm how to"
Reddit Link 2025-07-18T15:13Z [--] followers, [--] engagements
"Incredibly slow semantic model metadata via xmla/ssms My semantic models are hosted in an Azure region that is only [--] ms away from me. However it is a painfully slow process to use SSMS to connect to workspaces list models create scripted operations get the TMSL of the tables and so on. Eg. it can take [--] to [--] seconds to do simple things with the metadata of a model (read-only operations which should be instantaneous.) Does anyone experience this much pain with xmla endpoints in ssms or other tools Is this performance something that the Microsoft PG might improve one day I've been waiting 2"
Reddit Link 2025-07-18T15:53Z [--] followers, [--] engagements
"Azure managed spark We are moving an apache spark solution to azure for our staging and production environments. We would like to host on a managed spark service. The criteria for a selection would be to (1) Avoid proprietary extensions so that workloads can run the same way on premise as in azure and (2) Avoid vendor lock-in and (3) keep costs as low as possible. Fabric is already ruled out where spark is concerned given that it fails to meet any of these basic goals. Are the remaining options just Databricks and HDI and Synapse Where can I find one that doesn't have all the bells and"
Reddit Link 2025-07-20T13:53Z [--] followers, [--] engagements
"Smaller Clusters for Spark The smallest Spark cluster I can create seems to be a 4-core driver and 4-core executor both consuming up to [--] GB. This seems excessive and soaks up lots of CU's. Excessive(https://preview.redd.it/ix69i0b5zhef1.pngwidth=531&format=png&auto=webp&s=ce28510e4f07edb7845164e9f0c9e115b8eede79) . Can someone share a cheaper way to use Spark on Fabric About [--] years ago when we were migrating from Databricks to Synapse Analytics Workspaces the CSS engineers at Microsoft had said they were working on providing "single node clusters" which is an inexpensive way to run a Spark"
Reddit Link 2025-07-22T22:01Z [--] followers, [--] engagements
"Anyone know anything about HDInsight (2025) I'm really confused about the prospects of a platform in Azure called Microsoft HDInsight. Given that I've been a customer of this platform for a number of years I probably shouldn't be this confused. I really like HDInsight aside from the fact that it isn't keeping up with the latest open source Spark runtimes. There appears to be no public roadmap or announcements about its fate. I have tried to get in touch with product/program managers at Microsoft and had no luck. The version we use is v.5.1 and seems to be the only version left. There are no"
Reddit Link 2025-07-22T22:44Z [--] followers, [--] engagements
"HDInsight Spark is Delivered in Azure with High-Severity Vulnerabilities I'm pretty confused by the lack of any public-facing communication or roadmaps for HDInsight. It is heartbreaking that such a great product is now ending its life in this way Everyone is probably aware that HDInsight had outdated components like Ubunto (18.04) and Spark (3.3.1). EG. Here is the doc showing Spark 3.3.1 is delivered with V.5.1: https://learn.microsoft.com/en-us/azure/hdinsight/hdinsight-5x-component-versioning(https://learn.microsoft.com/en-us/azure/hdinsight/hdinsight-5x-component-versioning) However I"
Reddit Link 2025-07-27T16:55Z [--] followers, [--] engagements
"Semantic Model Query Execution from Databricks (like Sempy) We are migrating Spark workloads from Fabric to Databricks for reduced costs and improved notebook experiences. The "semantic models" are a type of component that has a pretty central place in our "Fabric" environment. We use them in a variety of ways. Eg. In Fabric an ipynb user can connect to them (via "sempy"). But in Databricks we are finding it to be more cumbersome to reach our data. I never expected our semantic models to be so inaccessible to remote python developers. I've done a small amount of investigation but I'm not"
Reddit Link 2025-07-27T20:12Z [--] followers, [--] engagements
"Should Engineers Pick their Storage Engine or should the Cloud Vendor Databricks a large and multi-cloud Spark service seems to be VERY motivated in getting customers to store their data one particular DW format. It seems odd to me considering they are supposed to be a flexible and open-source player. Why would an open-source vendor be so close-minded about using relational databases for storage Personally I am NOT a fan of storing ALL data in deltatables. Certainly that makes sense for data coming in and leaving the system (temp/bronze files on one side and gold/presentation on the other)."
Reddit Link 2025-08-01T13:28Z [--] followers, [--] engagements
"Where do pyspark devs put checkpoints in fabric Oddly this is hard to find in a web search. At least in the context of fabric. Where do others put there checkpoint data (setcheckpointdir) Should I drop it in a temp for in the default lakehouse Is there a cheaper place for it (normal azure storage) Checkpoints are needed to truncate a logical plan in spark and avoid repeating cpu intensive operations. Cpu is not free even in spark I've been using local checkpoint in the past but it is known to be unreliable if spark executors are being dynamically deallocated (by choice). I think I need to use"
Reddit Link 2025-08-01T16:40Z [--] followers, [--] engagements
"Ghost artifacts in workspace (typically they are deleted notebooks) Sometimes I need to clear some notebooks and redeploy or delete and re-upload. For whatever reason Fabric makes this super painful. Google AI says there are ghost artifacts and the moderators in the forums agreed: https://preview.redd.it/3qkhx1v3p8hf1.pngwidth=749&format=png&auto=webp&s=effc779cea7e334362c828f80da57acad6d6d9a0 The error presented to the user looks like this: Message: OperationConflictError: A notebook with the same name "Whatever" already exists in workspace whatever. Can someone tell me how long it takes"
Reddit Link 2025-08-05T17:59Z [--] followers, [--] engagements
"DirectLake on OneLake - another unexpected gotcha in Excel I was pretty excited about the "DirectLake on OneLake" models in Power BI. Especially the variety where some part of the data is imported (called "D/L on O/L plus import" models). The idea behind the "plus import" model is that they would be more compatible with Excel pivot tables. After investing many days of effort into this architecture we find that users are NOT actually allowed to create calculated measures as we assumed they would. The error says "**MDX session-scope statements like CREATE MEMBER are not allowed on"
Reddit Link 2025-08-05T20:28Z [--] followers, [--] engagements
"Always being throttled on data IO in Azure SQL Database (forced to use hints) We are always throttled on I/O in Azure SQL. We pay for [--] vcores in a sql elastic pool. It is about $1600 per month. The "per-database settings" will allow all [--] vcores to be allocated to a single database. I do most of my testing on a single database off-hours in order to explore the underlying problems. My databases are continually getting throttled on IO ("data" and "logs" is often at 100% on the database). I have no problem with compute so it is disappointing to have to increase our vcores simply for the sake of"
Reddit Link 2025-08-06T00:41Z [--] followers, [--] engagements
"Another One Bites the Dust (Azure SQL Connector for Spark) I wasn't paying attention at the time. The Spark connector we use for interacting with Azure SQL was killed in February. Microsoft seems unreliable when it comes to offering long-term support for data engineering solutions. At least once a year we get the rug pulled on us in one place or another. Here lies the remains of the Azure SQL connector that we had been using in various Azure-hosted Spark environments. https://github.com/microsoft/sql-spark-connector(https://github.com/microsoft/sql-spark-connector)"
Reddit Link 2025-08-06T02:02Z [--] followers, [--] engagements
"Standard Tier on Azure is Still Available. I used the pricing calculator today and noticed that the standard tier is about 25% cheaper for a common scenario on Azure. We typically define an average-sized cluster of five vm's of DS4v2 and we submit spark jobs on it via the API. Does anyone know why the Azure standard tier wasn't phased out yet It is odd that it didn't happen at the same time as AWS and Google Cloud. Given that the vast majority of our Spark jobs are NOT interactive it seems very compelling to save the 25%. If we also wish to have the interactive experience with unity catalog"
Reddit Link 2025-08-14T21:01Z [--] followers, [--] engagements
"Are Databricks SQL Warehouses opensource Most of my exposure to spark has been outside of databricks. I'm spending more time in databricks again after a three year break or so. I see there is now a concept of a SQL warehouse aka SQL endpoint. Is this stuff opensource I'm assuming it is built on lots of proprietary extensions to spark (eg. serverless and photon and whatnot). I'm assuming there is NOT any way for me to get a so-called SQL warehouse running on my own laptop (. with the full set of DML and DDL capabilities). True Do the proprietary aspects of "SQL warehouses" make these things"
Reddit Link 2025-09-04T21:31Z [--] followers, [--] engagements
"Missing from Fabric - a Reverse ETL Tool Anyone hear of "Reverse ETL" I've been in the Fabric community for a while and don't see this term. Another data engineering subreddit uses it from time to time and I was a little jealous that they have both ETL and Reverse ETL tools In the context of Fabric I'm guessing that the term "Reverse ETL" would just be considered meaningless technobabble. It probably corresponds to retrieving data from a client after it has been added into the data platform. As such I'm guessing ALL the following might be considered "reverse ETL" tools with different"
Reddit Link 2025-09-08T22:21Z [--] followers, [--] engagements
"Need a name. Adopted as Lori but there is one at my work. Have a Lori at work so want to change it. Looking for another name. Maybe Tessa Or Ellie Or viking Very curious and friendly kitty who is getting used to her surroundings. NameMyCat NameMyCat"
Reddit Link 2025-09-13T18:51Z [--] followers, [---] engagements
"Frustrating Throttling Problem with an Azure SQL Query I have a query that runs for about [--] mins and gets about [--] million rows out of an Azure SQL database. It is doing an index seek on a clustered index with a predicate that limits to the current year. Based on the execution plan details it appears to be happening on a single thread (not a parallel plan) The problem is that I'm on a general purpose sku with [--] vcores. While the query is running the database becomes unusable to others. I need to be able to use the sql database for other things during this time. The query is consuming all of"
Reddit Link 2025-09-17T17:01Z [--] followers, [--] engagements
"Minimum Viable DirectLake on OneLake I just looked at the roadmap for Power BI https://roadmap.fabric.microsoft.com/product=powerbi(https://roadmap.fabric.microsoft.com/product=powerbi) I'm not seeing anything about DirectLake on OneLake. (aka DirectLake v2) I think it is still in preview without a planned GA date. Is there any list of milestones that need to be reached before this goes to GA Can we see the list How much longer might it take before we reach the first GA I was hoping to use this feature in production in [----] and the only major show-stopper for us are the Excel issues (Pivot"
Reddit Link 2025-10-02T00:08Z [--] followers, [--] engagements
"How to isolate dev and test (unity catalog) I'm starting to use databricks unity catalog for the first time and at first glance I have concerns. I'm in a DEVELOPMENT workspace (instance of azure databricks) but it cannot be fully isolated from production. If someone shares something with me it appears in my list of catalogs even though I intend to remain isolated in my development "sandbox". I'm told there is no way to create an isolated metadata catalog to keep my dev and prod far away from each other in a given region. So I'm guessing I will be forced to create separate entra account for"
Reddit Link 2025-10-07T22:21Z [--] followers, [--] engagements
"Where to see Spark CPU and Memory in my Spark Cluster I have noticed that many of my notebooks are doing odd things like frenetically killing and recreating executors (yarn containers) every couple minutes. There are no YARN logs to be found in Fabric despite the fact that yarn appears to be the scheduler (according to the Spark UI). In many other Spark hosting environments I am shown the CPU and memory usage of the worker nodes. These resources (CPU and memory) are pretty essential things to monitor on any spark cluster yet I haven't found where these things are exposed to me for my worker"
Reddit Link 2025-10-10T18:21Z [--] followers, [--] engagements
"Azure data factory is a miserable pile of crap. dataengineering dataengineering"
Reddit Link 2024-08-07T14:07Z [--] followers, [----] engagements
"Spark is excessively buggy MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-01-16T18:30Z [--] followers, [--] engagements
"Alternatives to Service Bus We are planning on migrating our legacy service broker to Azure from on-prem. We have been using Jboss AMQ from Redhat which is based on Apache Active MQ and openwire (we were not using AMQP). We are considering all cloud-hosting options that are based on the open AMQP protocol. Azure service bus was obviously at the top of our list to be evaluate primarily because it is from Microsoft. I'm really not happy with what I'm seeing in azure service bus. The management tooling is poor. There is no visibility to see connected clients. There are arbitrary technical"
Reddit Link 2025-02-15T00:37Z [--] followers, [--] engagements
"Dataflow Gen2 wetting the bed MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-02-21T03:10Z [--] followers, [---] engagements
"Material left in my garage I found a package of this material in my garage. The door was open and I'm guessing someone dropped it in there when we were away. Is it is a reasonable theory Never heard of bahai going from door to door like Jehovah's Witnesses. bahai bahai"
Reddit Link 2025-02-22T22:40Z [---] followers, [---] engagements
"There is no formal QA department MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-03-08T15:38Z [--] followers, [---] engagements
"More Adventures in Support MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-03-12T22:53Z [--] followers, [--] engagements
"Half day outage w/GEN2 dataflows MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-03-15T20:23Z [--] followers, [---] engagements
"Timeout in service after three minutes MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-03-22T02:44Z [--] followers, [--] engagements
"Fabric DW Software Lifecycles MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-04-16T22:25Z [--] followers, [--] engagements
"Is developer mode of power BI generally available (2025) MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-06-03T14:57Z [--] followers, [--] engagements
"PQ imports from Excel MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-07-01T14:28Z [--] followers, [--] engagements
"My notebook in DEV is randomly accessing PROD lakehouse MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-07-29T19:14Z [--] followers, [--] engagements
"Spark notebook can corrupt delta MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-10-22T14:25Z [--] followers, [--] engagements
"Direct Lake on OneLake for Semantic Models (is it done yet) MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-10-28T15:39Z [--] followers, [--] engagements
"Use Source Control for Model Permissions (or Just Backups) MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-11-04T01:08Z [--] followers, [--] engagements
"DataflowsStagingLakehouse in my workspace MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-11-19T19:07Z [--] followers, [--] engagements
"Spark Connect for Building Applications databricks databricks"
Reddit Link 2025-11-21T15:33Z [--] followers, [--] engagements
"The DAX bug Involving Auto-Exist and ALL() MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-11-25T03:21Z [--] followers, [--] engagements
"Why did Microsoft kill their Spark on Containers/Kubernetes dataengineering dataengineering"
Reddit Link 2025-11-30T17:35Z [--] followers, [--] engagements
"What is a reasonable timeframe for a feature to GA MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-11-30T23:11Z [--] followers, [--] engagements
"The Spark Notebook Monitoring UI is Removing my Stuff MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-12-03T23:51Z [--] followers, [--] engagements
"Amazon Delivery Gone Wrong homeowners homeowners"
Reddit Link 2025-12-04T00:57Z [--] followers, [--] engagements
"Possible Amazon Delivery Mishap (Claim Filed ) AmazonFlexDrivers AmazonFlexDrivers"
Reddit Link 2025-12-04T01:11Z [--] followers, [--] engagements
""Dataset" name in docs (still) MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-12-05T22:12Z [--] followers, [--] engagements
"Is there any database mirroring feature in the databricks ecosystem databricks databricks"
Reddit Link 2025-12-11T23:10Z [--] followers, [--] engagements
"Questions about roadmap for python API (UDF worker) apachespark apachespark"
Reddit Link 2025-12-13T22:54Z [--] followers, [--] engagements
"Select/SelectMany vs Map/FlatMap csharp csharp"
Reddit Link 2025-12-15T03:00Z [--] followers, [--] engagements
"Databricks SQL DW - stating the obvious. dataengineering dataengineering"
Reddit Link 2025-12-28T22:32Z [--] followers, [--] engagements
"Databricks SQL innovations planned databricks databricks"
Reddit Link 2025-12-30T05:19Z [--] followers, [--] engagements
"Vast numbers of explorer.exe launched on windows [--] Windows11 Windows11"
Reddit Link 2026-01-06T10:43Z [--] followers, [--] engagements
"Isolation of sql context in interactive cluster databricks databricks"
Reddit Link 2026-01-06T16:09Z [--] followers, [--] engagements
"Capacity Throttling and Smoothing is a Failure of Biblical Proportions MicrosoftFabric MicrosoftFabric"
Reddit Link 2026-01-07T23:47Z [--] followers, [---] engagements
"AsyncContextThread in Nito.AsyncEx - any replacements available on .net [--] or later dotnet dotnet"
Reddit Link 2026-01-12T18:43Z [--] followers, [--] engagements
"PQ online auto-pain MicrosoftFabric MicrosoftFabric"
Reddit Link 2026-01-15T19:33Z [--] followers, [--] engagements
"Why Enforce Lowercase Queue Names in Service Bus AZURE AZURE"
Reddit Link 2026-01-16T17:24Z [--] followers, [--] engagements
"Refresh icon in wrong spot (DF GEN2 CICD) MicrosoftFabric MicrosoftFabric"
Reddit Link 2026-01-21T20:04Z [--] followers, [--] engagements
"Top Secret Technical Comms for Sharepoint sharepoint sharepoint"
Reddit Link 2026-01-24T22:07Z [--] followers, [--] engagements
"ASWL New Era and Leadership MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-06-08T18:06Z [--] followers, [--] engagements
"DirectLake development in connected mode MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-06-19T20:21Z [--] followers, [--] engagements
"Any Chance of Multi-Threaded Query Plans for PBI Semantic Models MicrosoftFabric MicrosoftFabric"
Reddit Link 2025-07-18T15:31Z [--] followers, [--] engagements
"Managed Airflow in Databricks databricks databricks"
Reddit Link 2026-01-12T04:09Z [--] followers, [--] engagements
"SQL endpoint names for LH MicrosoftFabric MicrosoftFabric"
Reddit Link 2026-01-22T16:01Z [--] followers, [--] engagements
"Do Pythons hate Windows Python Python"
Reddit Link 2026-01-23T21:30Z [--] followers, [--] engagements
"Deployment from ADO yet (2026) MicrosoftFabric MicrosoftFabric"
Reddit Link 2026-01-25T00:52Z [--] followers, [--] engagements
"Managing your Perspectives (2026) MicrosoftFabric MicrosoftFabric"
Reddit Link 2026-01-28T22:54Z [--] followers, [--] engagements
"Maintaining Perspectives in [----] PowerBI PowerBI"
Reddit Link 2026-01-29T15:16Z [--] followers, [--] engagements
"Azure Everything [---] AZURE AZURE"
Reddit Link 2026-01-29T17:53Z [--] followers, [--] engagements
"Spark connector for SQL databases dead/reborn MicrosoftFabric MicrosoftFabric"
Reddit Link 2026-01-30T14:52Z [--] followers, [--] engagements
"The unnecessary ceremony related unappliedChanges.json in pbip PowerBI PowerBI"
Reddit Link 2026-02-03T17:33Z [--] followers, [--] engagements
"SQL endpoint synchronization requirement (2026) MicrosoftFabric MicrosoftFabric"
Reddit Link 2026-02-03T21:18Z [--] followers, [--] engagements
"MPEs for PLS - Why sooooo long MicrosoftFabric MicrosoftFabric"
Reddit Link 2026-01-16T15:37Z [--] followers, [--] engagements
"AI as the end user (lakebase) databricks databricks"
Reddit Link 2026-01-25T19:16Z [--] followers, [--] engagements
Limited data mode. Full metrics available with subscription: lunarcrush.com/pricing
/creator/reddit::SmallAd3697