使用位运算解决问题

Basics

位运算包含如下基本操作： &(and), |(or), ~(not), ^(exclusive-or, xor), << （left shift), >> (right shift)

合并：A | B
交集：A & B
减法：A & ~B
取反：ALL_BITS ^ A or ~A
设置某一位为1：A |= 1 << bit
清除某一位为0：A &= ~(1 << bit)
判断某一位是否为1：(A & 1 << bit) != 0
抽取最后一位bit位：A&-A or A&~(A-1) or x^(x&(x-1))
清除最后一位bit位：A & (A-1)
Get all 1-bits ~0 ????

Examples

int转换成二进制之后包含的1个数

def countOne(n):
    count = 0
    while n:
        n = n & (n  - 1)
        count += 1
    return count

int 求和

def getSum(a, b):
    return a if b == 0 else getSum(a ^ b, (a & b) << 1)

int 减法

def getSubtract(a, b):
    return a if b == 0 else getSubtract(a ^ b, (~a & b) << 1)

查找丢失的数字，长度为n的数字，内部包含 0，1，2，...，n，且不重复。现在有一个数字丢失了，找出哪个数字丢失。如n = 4, nums = [0, 1, 3], 丢失的是2

def missingNumber(nums):
    ret = 0
    for i, num in enumerate(nums):
        ret ^= i
        ret ^= num
    return ret ^ len(nums)

找出小于N的最大 2^x（每次清除最后一位1，找到n变成0之前的最后一个梳子状态也可以）

def largest_power(n):
    n |= n >> 1
    n |= n >> 2
    n |= n >> 4
    n |= n >> 8
    n |= n >> 16

    return (n + 1) >> 1
    
def largest_power2(n):
    pre = n
    while n:
        pre = n
        n &= n - 1
    return pre

给定一个无符号的int，反转对应的二进制

def reverseBits2(n):
    res = 0
    for i in range(32):
        res <<= 1
        res |= n & 1
        n >>= 1
    return res

给出区间[m, n],(0 <= m <= n <= 2147483647),返回区间内数字按位and之后的结果；如[5, 7] = 5 & 6 & 7 = 4

def rangeBitwiseAnd(m, n):
    a = 0
    while m != n:
        m >>= 1
        n >>= 1
        a += 1
    return m << a

给定无符号的n，转换成二进制之后包含1的数量(Hamming weight)

def hammingWeight(n):
    count = 0
    while n:
        n &= n - 1
        count += 1
    return count

Application

REPEATED DNA SEQUENCES

在一个DNA字符串(只包含ACGT)中找出重复的长度为10的子串，例：
Given s = "AAAAACCCCCAAAAACCCCCCAAAAAGGGTTT",
Return: ["AAAAACCCCC", "CCCCCAAAAA"].

解法-1：遍历所有的子串，通过dict判断是否重复

class Solution(object):
    def findRepeatedDnaSequences(self, s):
        """
        :type s: str
        :rtype: List[str]
        """
        if len(s) < 10: return []
        
        sub_dict = {}
        ans = []
        for i in range(len(s) - 10 + 1):
            substring = s[i:i + 10]
            if sub_dict.has_key(substring):
                if sub_dict[substring] == 1:
                    ans.append(substring)
                sub_dict[substring] += 1
            else:
                sub_dict[substring] = 1
        return ans

解法-2：上述需要把所有长度为10的子串都记录下来，会占一定的空间，我们使用二进制来优化空间占用。因为只有ACGT四个字符，所以我们可以考虑用两位二进制来表示一个字符，A = 00，C = 01，G = 10，T = 11，这样对于每个子串，我们可以使用 20位二进制来表示，节省了上述key占有的空间

class Solution(object):
    def findRepeatedDnaSequences(self, s):
        """
        :type s: str
        :rtype: List[str]
        """
        if len(s) <= 10: return []

        char_dict = {'A': 0, 'C': 1, 'G': 2, 'T': 3}
        sub_dict = {}
        ans = []
        cur = 0
        for i in range(9):
            cur = (cur << 2) | char_dict[s[i]]

        for i in range(9, len(s)):
            cur = (cur & 0x3ffff) << 2 | char_dict[s[i]]
            if sub_dict.has_key(cur):
                if sub_dict[cur] == 1:
                    ans.append(s[i - 9:i + 1])
                sub_dict[cur] += 1
            else:
                sub_dict[cur] = 1
        return ans

Majority Element

给定长度为n的数字，找出其中出现次数超过⌊ n/2 ⌋的数字，众数

解法-1：dict记录出现次数，遍历查看出现次数最多的

class Solution(object):
    def majorityElement(self, nums):
        """
        :type nums: List[int]
        :rtype: int
        """
        num_dict = {}
        for i, num in enumerate(nums):
            if not num_dict.has_key(num):
                num_dict[num] = 0
            num_dict[num] += 1

        for num in num_dict.keys():
            if num_dict[num] >= len(nums) / 2 + len(nums) % 2:
                return num

解法-2：迭代32次，每次计算所有n个数第i位为1的个数，由于众数一定存在，所以如果第i位为1的个数大于 ⌊ n/2 ⌋，则众数中相应位肯定也为1

class Solution(object):
    def majorityElement_bit(self, nums):
        mask, ret = 1, 0
        for i in range(32):
            count = 0
            for num in nums:
                if mask & num: count += 1
            if count > len(nums) // 2:
                # if the 31th bit if 1,
                # it means it's a negative number
                if i == 31:
                    ret = -((1 << 31) - ret)
                else:
                    ret |= mask
            mask <<= 1
        return ret

Single Number III

给定一组数，其中有两个元素只出现一次，其他都出现两次，找出只出现一次的两个元素

解法-1：dict计数

class Solution(object):
    def singleNumber(self, nums):
        """
        :type nums: List[int]
        :rtype: List[int]
        """
        num_dict = {}
        ans = []
        for num in nums:
            if not num_dict.has_key(num):
                num_dict[num] = 0
            num_dict[num] += 1

        for num in num_dict.keys():
            if num_dict[num] == 1:
                ans.append(num)

        return ans

解法-2：把所有数字都xor下，可得到a ^ b，通过a ^ b随便取其中等于1的一位(表示a和b在这一位中一个是0，一个是1)，去和数组所有的数字 & 下，可以把数组分成两组，且两组数都满足只有一个数字出现一次，其他数字都出现两次。分别对两组数求只出现一次的数字。

class Solution(object):
    def singleNumber_bit(self, nums):
        """
        :type nums: List[int]
        :rtype: List[int]
        """
        s = 0
        for x in nums:
            s ^= x
        s &= -s

        ans = [0, 0]
        for x in nums:
            if x & s:
                ans[0] ^= x
            else:
                ans[1] ^= x

        return ans

Maximum Product of Word Lengths

给定一串单词，从中找出两个没有公用字符的单子，且length(word[i]) * length(word[j])最大，返回最大值。

解法：复杂度O(n^2),主要问题是判断两个word是否有公用字符,把使用到的字符转换成二进制数进行记录,两个word对应的二进制数 & 一下如果 = 0，则表示没有公用字符。

class Solution(object):
    def getBitNum(self, word):
        num = 0
        for c in word:
            num |= 1 << (ord(c) - ord('a'))
        return num


    def maxProduct(self, words):
        """
        :type words: List[str]
        :rtype: int
        """
        word_bit = {}
        ans = 0
        for word in words:
            word_bit[word] = self.getBitNum(word)

        l = len(words)

        for i, word1 in enumerate(words):
            for j, word2 in enumerate(words, i + 1):
                if not word_bit[word1] & word_bit[word2]:
                    ans = max(ans, len(word1) * len(word2))

        return ans

原文：A summary: how to use bit manipulation to solve problems easily and efficiently