[GUEST ACCESS MODE: Data is scrambled or limited to provide examples. Make requests using your API key to unlock full data. Check https://lunarcrush.ai/auth for authentication information.] #  @bemikelive Michael Bendersky Michael Bendersky posts on X about databricks, grounded, drops, blog the most. They currently have XXXXX followers and XX posts still getting attention that total XXX engagements in the last XX hours. ### Engagements: XXX [#](/creator/twitter::139811295/interactions)  - X Month XXXXX -XX% ### Mentions: X [#](/creator/twitter::139811295/posts_active)  ### Followers: XXXXX [#](/creator/twitter::139811295/followers)  - X Month XXXXX +0.88% ### CreatorRank: XXXXXXXXX [#](/creator/twitter::139811295/influencer_rank)  ### Social Influence **Social category influence** [technology brands](/list/technology-brands) **Social topic influence** [databricks](/topic/databricks) #96, [grounded](/topic/grounded) #39, [drops](/topic/drops) #958, [blog](/topic/blog) **Top accounts mentioned or mentioned by** [@databricks](/creator/undefined) [@jefrankle](/creator/undefined) [@rishabhs](/creator/undefined) [@mateizaharia](/creator/undefined) [@jacobianneuro](/creator/undefined) [@deyneka_e](/creator/undefined) ### Top Social Posts Top posts by engagements in the last XX hours "We released OfficeQA today -- a hard benchmark for evaluating agents on grounded reasoning tasks. More details in our blog and the thread below" [X Link](https://x.com/bemikelive/status/1998491671609405748) 2025-12-09T20:35Z 1717 followers, 1996 engagements "Grounded Reasoning question answering and analysis based on large complex proprietary datasets comprised of unstructured documents and tabular data is a task many Databricks customers face daily" [X Link](https://x.com/bemikelive/status/1998488282972434829) 2025-12-09T20:21Z 1716 followers, XX engagements "We find that out of the box even frontier LLMs struggle with OfficeQA scoring XX% on a subset of the hardest questions. Furthermore all models display "correctness decay" performance drops steeply as we tighten our error tolerance" [X Link](https://x.com/bemikelive/status/1998488286759973173) 2025-12-09T20:21Z 1716 followers, XX engagements "While document pre-processing tools like Databricks ai_parse_document can boost performance challenges such as parsing errors dealing with ambiguity and visual reasoning remain" [X Link](https://x.com/bemikelive/status/1998488290144694533) 2025-12-09T20:21Z 1716 followers, XX engagements "Grounded Reasoning question answering and analysis based on large complex proprietary datasets comprised of unstructured documents and tabular data is a task many Databricks customers face daily" [X Link](https://x.com/bemikelive/status/1998491672880238631) 2025-12-09T20:35Z 1716 followers, XXX engagements "We find that out of the box even frontier LLMs struggle with OfficeQA scoring XX% on a subset of the hardest questions. Furthermore all models display "correctness decay" performance drops steeply as we tighten our error tolerance" [X Link](https://x.com/bemikelive/status/1998491677821120818) 2025-12-09T20:35Z 1716 followers, XX engagements "While document pre-processing tools like Databricks ai_parse_document can boost performance challenges such as parsing errors dealing with ambiguity and visual reasoning remain" [X Link](https://x.com/bemikelive/status/1998491680924893452) 2025-12-09T20:35Z 1716 followers, XX engagements "OfficeQA dataset is now available on Github: We are looking forward to seeing how your agents perform on it" [X Link](https://x.com/bemikelive/status/1998491682351054957) 2025-12-09T20:35Z 1716 followers, XX engagements
[GUEST ACCESS MODE: Data is scrambled or limited to provide examples. Make requests using your API key to unlock full data. Check https://lunarcrush.ai/auth for authentication information.]
@bemikelive Michael BenderskyMichael Bendersky posts on X about databricks, grounded, drops, blog the most. They currently have XXXXX followers and XX posts still getting attention that total XXX engagements in the last XX hours.
Social category influence technology brands
Social topic influence databricks #96, grounded #39, drops #958, blog
Top accounts mentioned or mentioned by @databricks @jefrankle @rishabhs @mateizaharia @jacobianneuro @deyneka_e
Top posts by engagements in the last XX hours
"We released OfficeQA today -- a hard benchmark for evaluating agents on grounded reasoning tasks. More details in our blog and the thread below"
X Link 2025-12-09T20:35Z 1717 followers, 1996 engagements
"Grounded Reasoning question answering and analysis based on large complex proprietary datasets comprised of unstructured documents and tabular data is a task many Databricks customers face daily"
X Link 2025-12-09T20:21Z 1716 followers, XX engagements
"We find that out of the box even frontier LLMs struggle with OfficeQA scoring XX% on a subset of the hardest questions. Furthermore all models display "correctness decay" performance drops steeply as we tighten our error tolerance"
X Link 2025-12-09T20:21Z 1716 followers, XX engagements
"While document pre-processing tools like Databricks ai_parse_document can boost performance challenges such as parsing errors dealing with ambiguity and visual reasoning remain"
X Link 2025-12-09T20:21Z 1716 followers, XX engagements
"Grounded Reasoning question answering and analysis based on large complex proprietary datasets comprised of unstructured documents and tabular data is a task many Databricks customers face daily"
X Link 2025-12-09T20:35Z 1716 followers, XXX engagements
"We find that out of the box even frontier LLMs struggle with OfficeQA scoring XX% on a subset of the hardest questions. Furthermore all models display "correctness decay" performance drops steeply as we tighten our error tolerance"
X Link 2025-12-09T20:35Z 1716 followers, XX engagements
"While document pre-processing tools like Databricks ai_parse_document can boost performance challenges such as parsing errors dealing with ambiguity and visual reasoning remain"
X Link 2025-12-09T20:35Z 1716 followers, XX engagements
"OfficeQA dataset is now available on Github: We are looking forward to seeing how your agents perform on it"
X Link 2025-12-09T20:35Z 1716 followers, XX engagements
/creator/twitter::bemikelive