Disruptive innovation comes in two flavors: (1) New-market disruption, where the company creates and claims a new segment in an existing market by catering to an underserved customer base, or (2) Low-end disruption, in which a company uses a low-cost business model to enter at the bottom of an existing market and claim a segment.
pair-preference-model-LLaMA3-8B by RLHFlow: Really strong reward model, trained to take in two inputs at once, which is the top open reward model on RewardBench (beating one of Cohere’s).
DeepSeek-V2 by deepseek-ai (21B active, 236B total param.): Another strong MoE base model from the DeepSeek team. Some people are questioning the very high MMLU sc... See more
How do models represent style, and how can we more precisely extract and steer it?
A commonly requested feature in almost any LLM-based writing application is “I want the AI to respond in my style of writing,” or “I want the AI to adhere to this style guide.” Aside from costly and complicated multi-stage finetuning processes like Anthropic’s RL with... See more
We can detect factually inconsistent summaries via the natural language inference (NLI) task. The NLI task works like this: Given a premise sentence and a hypothesis sentence, the goal is to predict if the hypothesis is entailed by, neutral, or contradicts the premise.
“Writing shows up within a company in two primary ways: paper trails and curations.
Paper trails are documented accounts of what happened, typically produced in the run of work while it’s happening. Meeting notes are the most obvious example.
Curations are artifacts of work produced to contribute to the system of shared knowledge. These are typically... See more
In the digital world, self-nudging aims to empower people to be citizen ‘choice architects’ by designing their informational environments in ways that work best for them and that constrain their activities in beneficial ways. We can, for instance, remove distracting and irresistible notifications. We may set specific times in which messages can be ... See more
It’s underdigitized. According to the McKinsey Industry Digitalization Index, only the agriculture sector is less digitized than construction. The typical IT spend for construction companies is 1-2% of the revenue, compared with the 3-5% average across industries. Moreover, there are many barriers to digital technology adoption including skill mana... See more
Token Embeddings: These are vector representations of tokens, which can be characters, subwords, or other text units. Token embeddings are particularly useful in languages with complex morphology or when handling out-of-vocabulary words. Models like BERT use token embeddings to represent subword units.
Word Embeddings: These are dense vector represe... See more