修改后的 baugh-wooley 算法乘法 verilog 代码不能正确乘法

2024-01-04

以下 verilog 源代码和/或测试平台可以很好地工作商业模拟器，iverilog https://www.edaplayground.com/x/3TuQ也形式化验证工具（yosys-smtbmc） https://gist.github.com/promach/5f2d9a9494704ed93cf65687c982198c

请将有关 `ifdef FORMAL 的投诉保留到稍后。我需要它们与 yosys-smtbmc 一起使用，它还不支持绑定命令。

我现在正在调试生成编码，因为乘法（使用修改的 baugh-wooley 算法）尚未起作用。

当 o_valid 置位时，乘法代码应给出 o_p = i_a * i_b = 3*2 = 6，但波形清楚地显示代码给出 o_p = 0x20 = 32

test_multiply.v

// Testbench
module test_multiply;

  parameter A_WIDTH=4, B_WIDTH=4;

  reg i_clk;
  reg i_reset;
  reg i_ce;
  reg signed[(A_WIDTH-1):0] i_a;
  reg signed[(B_WIDTH-1):0] i_b;
  wire signed[(A_WIDTH+B_WIDTH-1):0] o_p;
  wire o_valid;

  // Instantiate design under test
  multiply #(A_WIDTH, B_WIDTH) mul(.clk(i_clk), .reset(i_reset), .in_valid(i_ce), .in_A(i_a), .in_B(i_b), .out_valid(o_valid), .out_C(o_p));


  initial begin
    // Dump waves
    $dumpfile("test_multiply.vcd");
    $dumpvars(0, test_multiply);

    i_clk = 0;
    i_reset = 0;
    i_ce = 0;
    i_a = 0;
    i_b = 0;

  end

  localparam SMALLER_WIDTH = (A_WIDTH <= B_WIDTH) ? A_WIDTH : B_WIDTH;
  localparam NUM_OF_INTERMEDIATE_LAYERS = $clog2(SMALLER_WIDTH);

  genvar i, j; // array index

  generate
    for(i = 0; i < NUM_OF_INTERMEDIATE_LAYERS; i = i + 1) begin
        for(j = 0; j < SMALLER_WIDTH; j = j + 1) begin
            initial $dumpvars(0, test_multiply.mul.middle_layers[i][j]);
        end
    end
  endgenerate

  always #5 i_clk = !i_clk;

  initial begin

    @(posedge i_clk);
    @(posedge i_clk);

    $display("Reset flop.");

    i_reset = 1;

    @(posedge i_clk);
    @(posedge i_clk);

    i_reset = 0;

    @(posedge i_clk);
    @(posedge i_clk);

    i_ce = 1;
    i_a = 3;
    i_b = 2;

    #50 $finish;

  end

endmodule

乘法.v

module multiply #(parameter A_WIDTH=16, B_WIDTH=16)
(clk, reset, in_valid, out_valid, in_A, in_B, out_C); // C=A*B

`ifdef FORMAL
parameter A_WIDTH = 4;
parameter B_WIDTH = 4;
`endif

input clk, reset;
input in_valid; // to signify that in_A, in_B are valid, multiplication process can start
input signed [(A_WIDTH-1):0] in_A;
input signed [(B_WIDTH-1):0] in_B;
output signed [(A_WIDTH+B_WIDTH-1):0] out_C;
output reg out_valid; // to signify that out_C is valid, multiplication finished

/* 
   This signed multiplier code architecture is a combination of row adder tree and 
   modified baugh-wooley algorithm, thus requires an area of O(N*M*logN) and time O(logN)
   with M, N being the length(bitwidth) of the multiplicand and multiplier respectively

   see [url]https://i.imgur.com/NaqjC6G.png[/url] or 
   Row Adder Tree Multipliers in [url]http://www.andraka.com/multipli.php[/url] or
   [url]https://pdfs.semanticscholar.org/415c/d98dafb5c9cb358c94189927e1f3216b7494.pdf#page=10[/url]
   regarding the mechanisms within all layers

   In terms of fmax consideration: In the case of an adder tree, the adders making up the levels
   closer to the input take up real estate (remember the structure of row adder tree).  As the 
   size of the input multiplicand bitwidth grows, it becomes more and more difficult to find a
   placement that does not use long routes involving multiple switch nodes for FPGA.  The result
   is the maximum clocking speed degrades quickly as the size of the bitwidth grows.

   For signed multiplication, see also modified baugh-wooley algorithm for trick in skipping 
   sign extension (implemented as verilog example in [url]https://www.dsprelated.com/showarticle/555.php[/url]),
   thus smaller final routed silicon area.

   [url]https://stackoverflow.com/questions/54268192/understanding-modified-baugh-wooley-multiplication-algorithm/[/url]

   All layers are pipelined, so throughput = one result for each clock cycle 
   but each multiplication result still have latency = NUM_OF_INTERMEDIATE_LAYERS 
*/


// The multiplication of two numbers is equivalent to adding as many copies of one 
// of them, the multiplicand, as the value of the other one, the multiplier.
// Therefore, multiplicand always have the larger width compared to multipliers

localparam SMALLER_WIDTH = (A_WIDTH <= B_WIDTH) ? A_WIDTH : B_WIDTH;
localparam LARGER_WIDTH = (A_WIDTH > B_WIDTH) ? A_WIDTH : B_WIDTH;

wire [(LARGER_WIDTH-1):0] MULTIPLICAND = (A_WIDTH > B_WIDTH) ? in_A : in_B ;
wire [(SMALLER_WIDTH-1):0] MULTIPLIPLIER = (A_WIDTH <= B_WIDTH) ? in_A : in_B ;

`ifdef FORMAL
// to keep the values of multiplicand and multiplier before the multiplication finishes 
reg [(LARGER_WIDTH-1):0] MULTIPLICAND_reg;
reg [(SMALLER_WIDTH-1):0] MULTIPLIPLIER_reg;

always @(posedge clk)
begin
    if(reset) begin
        MULTIPLICAND_reg <= 0;
        MULTIPLIPLIER_reg <= 0;
    end

    else if(in_valid) begin
        MULTIPLICAND_reg <= MULTIPLICAND;
        MULTIPLIPLIER_reg <= MULTIPLIPLIER;
    end
end
`endif

localparam NUM_OF_INTERMEDIATE_LAYERS = $clog2(SMALLER_WIDTH);


/*Binary multiplications and additions for partial products rows*/

// first layer has "SMALLER_WIDTH" entries of data of width "LARGER_WIDTH"
// This resulted in a binary tree with faster vertical addition processes as we have 
// lesser (NUM_OF_INTERMEDIATE_LAYERS) rows to add

// intermediate partial product rows additions
// Imagine a rhombus of height of "SMALLER_WIDTH" and width of "LARGER_WIDTH"
// being re-arranged into binary row adder tree
// such that additions can be done in O(logN) time

//reg [(NUM_OF_INTERMEDIATE_LAYERS-1):0][(SMALLER_WIDTH-1):0][(A_WIDTH+B_WIDTH-1):0] middle_layers;
reg [(A_WIDTH+B_WIDTH-1):0] middle_layers[(NUM_OF_INTERMEDIATE_LAYERS-1):0][0:(SMALLER_WIDTH-1)];
//reg [(NUM_OF_INTERMEDIATE_LAYERS-1):0] middle_layers [0:(SMALLER_WIDTH-1)] [(A_WIDTH+B_WIDTH-1):0];
//reg middle_layers [(NUM_OF_INTERMEDIATE_LAYERS-1):0][0:(SMALLER_WIDTH-1)][(A_WIDTH+B_WIDTH-1):0];

generate // duplicates the leafs of the binary tree

    genvar layer; // layer 0 means the youngest leaf, layer N means the tree trunk

    for(layer=0; layer<NUM_OF_INTERMEDIATE_LAYERS; layer=layer+1) begin: intermediate_layers

        integer pp_index; // leaf index within each layer of the tree
        integer bit_index; // index of binary string within each leaf

        always @(posedge clk)
        begin
            if(reset) 
            begin
                for(pp_index=0; pp_index<SMALLER_WIDTH ; pp_index=pp_index+1)
                    middle_layers[layer][pp_index] <= 0;
            end

            else begin

                if(layer == 0)  // all partial products rows are in first layer
                begin

                    // generation of partial products rows
                    for(pp_index=0; pp_index<SMALLER_WIDTH ; pp_index=pp_index+1)
                        middle_layers[layer][pp_index] <= 
                        (MULTIPLICAND & MULTIPLIPLIER[pp_index]);

                    // see modified baugh-wooley algorithm: [url]https://i.imgur.com/VcgbY4g.png[/url] from
                                        // page 122 of book "Ultra-Low-Voltage Design of Energy-Efficient Digital Circuits"
                    for(pp_index=0; pp_index<SMALLER_WIDTH ; pp_index=pp_index+1)
                        middle_layers[layer][pp_index][LARGER_WIDTH-1] <= 
                       !middle_layers[layer][pp_index][LARGER_WIDTH-1];

                    middle_layers[layer][SMALLER_WIDTH-1] <= !middle_layers[layer][SMALLER_WIDTH-1];
                    middle_layers[layer][0][LARGER_WIDTH] <= 1;
                    middle_layers[layer][SMALLER_WIDTH-1][LARGER_WIDTH] <= 1;
                end

                // adding the partial product rows according to row adder tree architecture
                else begin
                    for(pp_index=0; pp_index<(SMALLER_WIDTH >> layer) ; pp_index=pp_index+1)
                        middle_layers[layer][pp_index] <=
                        middle_layers[layer-1][pp_index<<1] +
                      (middle_layers[layer-1][(pp_index<<1) + 1]) << 1;

                    // bit-level additions using full adders
                    /*for(pp_index=0; pp_index<SMALLER_WIDTH ; pp_index=pp_index+1)
                        for(bit_index=0; bit_index<(LARGER_WIDTH+layer); bit_index=bit_index+1)
                            full_adder fa(.clk(clk), .reset(reset), .ain(), .bin(), .cin(), .sum(), .cout());*/
                end
            end
        end
    end

endgenerate

assign out_C = (reset)? 0 : middle_layers[NUM_OF_INTERMEDIATE_LAYERS-1][0];


/*Checking if the final multiplication result is ready or not*/

reg [($clog2(NUM_OF_INTERMEDIATE_LAYERS)-1):0] out_valid_counter; // to track the multiply stages
reg multiply_had_started;

always @(posedge clk)
begin
    if(reset) 
    begin
        multiply_had_started <= 0;
        out_valid <= 0;
        out_valid_counter <= 0;
    end

    else if(out_valid_counter == NUM_OF_INTERMEDIATE_LAYERS-1) begin
        multiply_had_started <= 0;
        out_valid <= 1;
        out_valid_counter <= 0;
    end

    else if(in_valid && !multiply_had_started) begin
        multiply_had_started <= 1;
        out_valid <= 0; // for consecutive multiplication
    end

    else begin
        out_valid <= 0;
        if(multiply_had_started) out_valid_counter <= out_valid_counter + 1;
    end
end


`ifdef FORMAL

initial assume(reset);
initial assume(in_valid == 0);

wire sign_bit = MULTIPLICAND_reg[LARGER_WIDTH-1] ^ MULTIPLIPLIER_reg[SMALLER_WIDTH-1];

always @(posedge clk)
begin
    if(reset) assert(out_C == 0);

    else if(out_valid) begin
        assert(out_C == (MULTIPLICAND_reg * MULTIPLIPLIER_reg));
        assert(out_C[A_WIDTH+B_WIDTH-1] == sign_bit);
    end
end

`endif

`ifdef FORMAL

localparam user_A = 3;
localparam user_B = 2;

always @(posedge clk)
begin
    cover(in_valid && (in_A == user_A) && (in_B == user_B));
    cover(out_valid);
end

`endif

endmodule

问题已解决：以下代码现在在 vivado 模拟和形式验证中的 cover() 中给出了正确的有符号乘法结果。

See 乘法.v https://gist.github.com/promach/5f2d9a9494704ed93cf65687c982198c#file-multiply-v以及对应的正确波形

module multiply #(parameter A_WIDTH=16, B_WIDTH=16)
(clk, reset, in_valid, out_valid, in_A, in_B, out_C); // C=A*B

`ifdef FORMAL
parameter A_WIDTH = 4;
parameter B_WIDTH = 6;
`endif

input clk, reset;
input in_valid; // to signify that in_A, in_B are valid, multiplication process can start
input signed [(A_WIDTH-1):0] in_A;
input signed [(B_WIDTH-1):0] in_B;
output signed [(A_WIDTH+B_WIDTH-1):0] out_C;
output reg out_valid; // to signify that out_C is valid, multiplication finished

/* 
   This signed multiplier code architecture is a combination of row adder tree and 
   modified baugh-wooley algorithm, thus requires an area of O(N*M*logN) and time O(logN)
   with M, N being the length(bitwidth) of the multiplicand and multiplier respectively

   see https://i.imgur.com/NaqjC6G.png or 
   Row Adder Tree Multipliers in http://www.andraka.com/multipli.php or
   https://pdfs.semanticscholar.org/415c/d98dafb5c9cb358c94189927e1f3216b7494.pdf#page=10
   regarding the mechanisms within all layers

   In terms of fmax consideration: In the case of an adder tree, the adders making up the levels
   closer to the input take up real estate (remember the structure of row adder tree).  As the 
   size of the input multiplicand bitwidth grows, it becomes more and more difficult to find a
   placement that does not use long routes involving multiple switch nodes for FPGA.  The result
   is the maximum clocking speed degrades quickly as the size of the bitwidth grows.

   For signed multiplication, see also modified baugh-wooley algorithm for trick in skipping 
   sign extension (implemented as verilog example in https://www.dsprelated.com/showarticle/555.php),
   thus smaller final routed silicon area.
   https://stackoverflow.com/questions/54268192/understanding-modified-baugh-wooley-multiplication-algorithm/

   All layers are pipelined, so throughput = one result for each clock cycle 
   but each multiplication result still have latency = NUM_OF_INTERMEDIATE_LAYERS 
*/


// The multiplication of two numbers is equivalent to adding as many copies of one 
// of them, the multiplicand, as the value of the other one, the multiplier.
// Therefore, multiplicand always have the larger width compared to multipliers

localparam SMALLER_WIDTH = (A_WIDTH <= B_WIDTH) ? A_WIDTH : B_WIDTH;
localparam LARGER_WIDTH = (A_WIDTH > B_WIDTH) ? A_WIDTH : B_WIDTH;

wire [(LARGER_WIDTH-1):0] MULTIPLICAND = (A_WIDTH > B_WIDTH) ? in_A : in_B ;
wire [(SMALLER_WIDTH-1):0] MULTIPLIER = (A_WIDTH <= B_WIDTH) ? in_A : in_B ;


// to keep the values of multiplicand and multiplier before the multiplication finishes 
reg signed [(LARGER_WIDTH-1):0] MULTIPLICAND_reg;
reg signed [(SMALLER_WIDTH-1):0] MULTIPLIER_reg;

always @(posedge clk)
begin
    if(reset) begin
        MULTIPLICAND_reg <= 0;
        MULTIPLIER_reg <= 0;
    end

    else if(in_valid) begin
        MULTIPLICAND_reg <= MULTIPLICAND;
        MULTIPLIER_reg <= MULTIPLIER;
    end
end


localparam NUM_OF_INTERMEDIATE_LAYERS = $clog2(SMALLER_WIDTH);


/*Binary multiplications and additions for partial products rows*/

// first layer has "SMALLER_WIDTH" entries of data of width "LARGER_WIDTH"
// This resulted in a binary tree with faster vertical addition processes as we have 
// lesser (NUM_OF_INTERMEDIATE_LAYERS) rows to add

// intermediate partial product rows additions
// Imagine a rhombus of height of "SMALLER_WIDTH" and width of "LARGER_WIDTH"
// being re-arranged into binary row adder tree
// such that additions can be done in O(logN) time

//reg [(NUM_OF_INTERMEDIATE_LAYERS-1):0][(SMALLER_WIDTH-1):0][(A_WIDTH+B_WIDTH-1):0] middle_layers;
reg signed [(A_WIDTH+B_WIDTH-1):0] middle_layers[NUM_OF_INTERMEDIATE_LAYERS:0][0:(SMALLER_WIDTH-1)];
//reg [(NUM_OF_INTERMEDIATE_LAYERS-1):0] middle_layers [0:(SMALLER_WIDTH-1)] [(A_WIDTH+B_WIDTH-1):0];
//reg middle_layers [(NUM_OF_INTERMEDIATE_LAYERS-1):0][0:(SMALLER_WIDTH-1)][(A_WIDTH+B_WIDTH-1):0];

generate // duplicates the leafs of the binary tree

    genvar layer; // layer 0 means the youngest leaf, layer N means the tree trunk

    for(layer=0; layer<=NUM_OF_INTERMEDIATE_LAYERS; layer=layer+1) begin: intermediate_layers

        integer pp_index; // leaf index within each layer of the tree

        always @(posedge clk)
        begin
            if(reset) 
            begin
                for(pp_index=0; pp_index<SMALLER_WIDTH ; pp_index=pp_index+1)
                    middle_layers[layer][pp_index] <= 0;
            end

            else begin      

                if(layer == 0)  // all partial products rows are in first layer
                begin               
                    // generation of partial products rows
                    for(pp_index=0; pp_index<SMALLER_WIDTH ; pp_index=pp_index+1)
                        middle_layers[layer][pp_index] <= MULTIPLIER[pp_index] ? MULTIPLICAND:0;    

                    // see modified baugh-wooley algorithm: https://i.imgur.com/VcgbY4g.png from
                    // page 122 of book: Ultra-Low-Voltage Design of Energy-Efficient Digital Circuits
                    for(pp_index=0; pp_index<(SMALLER_WIDTH-1) ; pp_index=pp_index+1) // MSB inversion
                        middle_layers[layer][pp_index][LARGER_WIDTH-1] <= 
                        (MULTIPLICAND[LARGER_WIDTH-1] & MULTIPLIER[pp_index]) ? 0:1;

                    for(pp_index=(LARGER_WIDTH-SMALLER_WIDTH); pp_index<(LARGER_WIDTH-1) ; pp_index=pp_index+1) // last partial product row inversion
                        //the starting index is to consider the condition where A_WIDTH != B_WIDTH
                        middle_layers[layer][SMALLER_WIDTH-1][pp_index] <= 
                        (MULTIPLICAND[pp_index] & MULTIPLIER[SMALLER_WIDTH-1]) ? 0:1;

                    middle_layers[layer][0][LARGER_WIDTH] <= 1;
                    middle_layers[layer][SMALLER_WIDTH-1][LARGER_WIDTH] <= 1;
                end

                // adding the partial product rows according to row adder tree architecture
                else begin
                    for(pp_index=0; pp_index<(SMALLER_WIDTH >> layer) ; pp_index=pp_index+1)
                    begin
                        if(pp_index==0)
                            middle_layers[layer][pp_index] <=
                            middle_layers[layer-1][0] +
                            (middle_layers[layer-1][1] << layer);

                        else middle_layers[layer][pp_index] <=
                            middle_layers[layer-1][pp_index<<1] +
                            (middle_layers[layer-1][(pp_index<<1) + 1] << layer);
                    end
                end
            end
        end
    end

endgenerate


assign out_C = (reset)? 0 : mul_result;


// both A and B are of negative numbers
wire both_negative = MULTIPLICAND_reg[LARGER_WIDTH-1] & MULTIPLIER_reg[SMALLER_WIDTH-1]; 

/*
the following is to deal with the shortcomings of the published modified baugh-wooley algorithm
which does not handle the case where A_WIDTH != B_WIDTH

The countermeasure does not do "To build a 6x4 multplier you can build a 6x6 multiplier, but replicate 
the sign bit of the short word 3 times, and ignore the top 2 bits of the result." , instead it uses 
some smart tricks/logic described by the signal 'modify_result'. The signal 'modify_result' is not 
asserted when one number is positive, and another is negative.

Please use pencil and paper method (and signals waveform) to verify or understand this.
I did not do a rigorous math proof on this countermeasure.

Instead I modify the "modified baugh-wooley algorithm" by debugging wrong multiplication results from
formal verification cover(in_valid && (in_A == A_value) && (in_B == B_value)); waveforms 
together with manual handwritten multiplication on paper. 

The countermeasure is considered successful when assert(out_C == (MULTIPLICAND_reg * MULTIPLIER_reg)); 
passed during cover() verification

Besides, the last partial product row inversion mechanism is also modified to handle this shortcoming
*/

wire modify_result = (A_WIDTH == B_WIDTH) || ((A_WIDTH != B_WIDTH) && both_negative);

wire signed [(A_WIDTH+B_WIDTH-1):0] mul_result;

assign mul_result = (modify_result) ?
        middle_layers[NUM_OF_INTERMEDIATE_LAYERS][0] : 
       {{(LARGER_WIDTH-SMALLER_WIDTH){sign_bit}} ,
        middle_layers[NUM_OF_INTERMEDIATE_LAYERS][0][LARGER_WIDTH +: SMALLER_WIDTH] , 
        middle_layers[NUM_OF_INTERMEDIATE_LAYERS][0][0 +: SMALLER_WIDTH]} ;



/*Checking if the final multiplication result is ready or not*/

reg [($clog2(NUM_OF_INTERMEDIATE_LAYERS)-1):0] out_valid_counter; // to track the multiply stages
reg multiply_had_started;

always @(posedge clk)
begin
    if(reset) 
    begin
        multiply_had_started <= 0;
        out_valid <= 0;
        out_valid_counter <= 0;
    end

    else if(out_valid_counter == NUM_OF_INTERMEDIATE_LAYERS-1) begin
        multiply_had_started <= 0;
        out_valid <= 1;
        out_valid_counter <= 0;
    end

    else if(in_valid && !multiply_had_started) begin
        multiply_had_started <= 1;
        out_valid <= 0; // for consecutive multiplication
    end

    else begin
        out_valid <= 0;
        if(multiply_had_started) out_valid_counter <= out_valid_counter + 1;
    end
end


wire sign_bit = MULTIPLICAND_reg[LARGER_WIDTH-1] ^ MULTIPLIER_reg[SMALLER_WIDTH-1];

`ifdef FORMAL

initial assume(reset);
initial assume(in_valid == 0);


always @(posedge clk)
begin
    if(reset) assert(out_C == 0);

    else if(out_valid) begin
        assert(out_C == (MULTIPLICAND_reg * MULTIPLIER_reg));
        assert(out_C[A_WIDTH+B_WIDTH-1] == sign_bit);
    end
end

`endif

`ifdef FORMAL

wire signed [(A_WIDTH-1):0] A_value = $anyconst;
wire signed [(B_WIDTH-1):0] B_value = $anyconst;

always @(posedge clk)
begin
    assume(A_value != 0);
    assume(B_value != 0);
    cover(in_valid && (in_A == A_value) && (in_B == B_value));
    cover(out_valid);
end

`endif

endmodule

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

Algorithm

verilog

FPGA

Systemverilog

multiplication

修改后的 baugh-wooley 算法乘法 verilog 代码不能正确乘法的相关文章

二进制字符串到十进制字符串

下午好如何将字符数多于语言最大整数类型中位数的二进制字符串转换为十进制字符串换句话说假设你有字符串 111001101001110100100 1001001111011100100 并且您不能先将其转换为整数那么您将如何以 10
Ruby - 乘法问题

我的输出是这样的 ruby 1 9 2 p290 011 gt 2 32 3 gt 6 959999999999999 我记得有一天在另一台机器上我得到了它就像 2 32 3 6 我的错误是什么非常感谢您阅读本文如果您确实想向下舍入为整
添加到数组连续数字

这是我向SO提出的第一个问题我希望能答对在 PHP 中如果你不会 Python 或伪语言也可以给定一个包含 n 个元素的数组 old array 1 2 3 5 7 8 9 20 21 23 29 我需要向新数组添加连续数字如果不
给定两个（大）点集，我如何有效地找到彼此最接近的点对？

我需要解决一个计算问题该问题归结为搜索两个集合之间最接近的点对问题是这样的给定欧几里德空间中的一组点 A 和一组点 B 找到所有对 a b 使得 b 是 B 中与 a 最近的点 a 是 A 中与 b 最近的点集合 A 和 B 的大小
当目标是查找某个字符串的所有出现情况时，KMP 最坏情况的复杂度是多少？

我还想知道哪种算法在查找另一个字符串中所有出现的字符串时具有最坏情况的复杂性博耶摩尔算法似乎具有线性时间复杂度 KMP 算法在查找字符串中所有出现的模式时具有线性复杂度如 Boyer Moore 算法1 如果您尝试在 aaaaaaaa
检查数独字段的很酷的算法？

有谁知道一个简单的算法来检查数独配置是否有效我想出的最简单的算法是对于大小为 n 的板伪代码 for each row for each number k in 1 n if k is not in the row using ano
在无向图中查找强连通分量

我想在无向图中找到强连接的组件即如果我从节点开始A然后我会回到节点A并且每条边都被恰好访问一次对于有向图可以使用Tarjan算法来寻找强连通分量但是对于无向图怎么办我认为您错过了强连通分量的含义强连接组件如果所有顶点对之间都存在
加密成本高，解密成本低

我希望该用户攻击者加密数据并发送给服务器现在我想要一种与标准算法完全相反的算法使用快难以解密即很难使用服务器发送的密钥来加密密码等数据以防止随机攻击但很容易解密这样服务器在验证用户时消耗的时间非常少但是对于攻击者来说每次使
Python 给定 k 个分区的整数分区

我正在尝试寻找或开发Python 的整数分区代码仅供参考整数分区将给定整数 n 表示为小于 n 的整数之和例如整数5可以表示为4 1 3 2 3 1 1 2 2 1 2 1 1 1 1 1 1 1 1 我为此找到了许多解决方案 ht
线性代数如何在算法中使用？

我的几个同行都提到学习算法时线性代数非常重要我研究了各种算法并学习了一些线性代数课程但我没有看到其中的联系那么线性代数如何应用在算法中呢例如图的连接矩阵可以带来哪些有趣的事情三个具体例子线性代数是现代 3D 图形的基础
如何从一组重叠的圆计算多边形集？

这个问题是一些计算细节的扩展这个问题 https stackoverflow com questions 1667310 combined area of overlapping circles 假设有一组可能重叠的圆并且希望计算这组
寻找距离原点最近的 100 颗恒星的算法

首先让我提出正确的问题问有一个文件包含超过一百万个点 x y 每个点代表一颗星星 a b 处有一颗行星地球现在任务是构建一种算法返回距离地球最近的 100 颗恒星您的算法的时间和空间复杂度是多少这个问题在各种采访中被问过很多次
固定大小集以包含给定集的最大数量

我有大约 1000 组尺寸 1 4 1 3 3 5 6 4 5 6 7 5 25 42 67 100 是否有可能找到包含最大数量的给定集合的大小为 20 的集合检查每一个100 80 20 集效率低下我不太确定这是 NP 完全的考虑
快速搜索压缩文本文件

我需要能够在大量压缩文件 txt 中搜索文本压缩可能会改变为其他东西甚至成为专有的我想避免解压所有文件并压缩编码搜索字符串并在压缩文件中搜索这应该可以通过对所有文件使用相同的码本使用霍夫曼压缩来实现我不想重新发明轮子所以任
如何确定算法函数的复杂度？

您如何知道算法函数对于特定操作是否需要线性常数对数时间它取决于CPU周期吗您可以通过三种方式至少做到这一点在网上查找算法看看它是如何描述其时间复杂度的根据输入大小自己检查算法查看嵌套循环和递归条件等内容以及每个循环运
学习 Verilog 的资源 [关闭]

Closed 这个问题正在寻求书籍工具软件库等的推荐不满足堆栈溢出指南 help closed questions 目前不接受答案我是 Verilog 新手有人可以推荐学习资源书籍视频博客或任何他们有良好个人经验并帮助他们更
解释一下从 N 个给定集合中的每一个中给出第 K 个最大数字的示例？

今天我尝试解决一个Facebook 编程挑战赛 https facebook interviewstreet com recruit challenges 我遇到的挑战是酒吧问题可以找到here https github com alo
如何求小于给定数的最大2次方

我需要找到小于给定数字的最大 2 次幂我陷入困境找不到任何解决方案 Code public class MathPow public int largestPowerOf2 int n int res 2 while res lt n
由周期表元素形成的最大单词的算法

我想为以下问题场景编写一个算法根据元素周期表元素的名称找到可以组成的最大单词符号如Na Ne等应被视为单个元素这是在一家知名公司的求职面试中被问到的有人可以帮我解决这个问题吗我认为更好的方法是检查字典中的每个单词看看是否可以从
数组中连续元素的最大乘积

我在现场面试的时候被问到了这个算法问题由于没有要求我签署保密协议我将其发布在这里寻求答案给定一个数组REAL不包含 0 的数字找到产生最大乘积的连续元素该算法应在线性时间内运行我考虑过以下方法使用两个数组第一个是利用DP思想

随机推荐

Docker-compose 在已安装的卷上设置用户和组

我正在尝试将 docker compose 中的卷安装到 apache 映像问题是我的 docker 中的 apache 运行在www data www data但挂载的目录是在下面创建的root root 如何指定挂载目录的用户我尝
如何在svn中以vimdiff风格查看svn diff

我开始在 Linux 上使用 Subversion svn diff给人一种非常神秘的感觉对眼睛来说非常非常不友好我如何解释它的输出更重要的是有没有一种方法可以查看 vimdiff 那种简洁风格的差异两个文件将并排打开编辑文件
如何从mysql表中删除重复行

我需要从mysql表中删除重复记录所以我有一个表名 employee 字段是 empid empname empssn 为了获取重复记录我编写了一个查询 SELECT COUNT empssn empssn FROM employee
clang：错误：-Z-reserved-lib-stdc++：更新到 xCode 4.4 后存在“-c”时，“链接器”输入未使用[关闭]

Closed 这个问题是无法重现或由拼写错误引起 help closed questions 目前不接受答案我刚刚将 xcode 更新到 4 4 版本我当前正在开发的项目正在抱怨 clang error Z reserved lib s
通过视图模型将滚动查看器滚动到顶部

我将 ScrollViewer 与 MVVM 模式一起使用并且 ScrollViewer 包装了一个项目列表例如
.net 如何将字符串生成器与字符串进行比较

在 vb net 或 C 中我无法弄清楚如何将字符串生成器与字符串进行比较我已经搜索了很多但找不到答案我必须编写自己的例程难道就没有更好的办法吗这不起作用 Dim s As String abc Dim sb As New St
使用 SNOW 在 R 2.11 Windows 64 位中进行并行处理不太有效

我在具有 8 个处理器的 WinXP64 计算机上运行 R 2 11 64 位在 R 2 10 1 中以下代码生成了 6 个 R 进程以进行并行处理 require foreach require doSNOW cl makeClust
为什么32位机器上的mongodb数据大小限制为2GB，而不是4GB？

as here http docs mongodb org manual faq fundamentals faq 32 bit limitations值得一提的是在具有单个 mongod 实例的 32 位机器中 mongodb 的数据大
为什么 std::function 的初始化器必须是可复制构造的？

根据http en cppreference com w cpp utility function function function http en cppreference com w cpp utility functional fu
任何 Javascript DateTime 函数都会返回 MySQL 日期时间格式吗？

我有一个函数可以将 ajax 请求发送到带有参数的服务器文件问题是我想发送当前的日期时间值以与数据库进行比较作为 MySQL 日期时间格式所有日期时间均显示为 2012 02 03 19 50 28 格式我如何在 Javascri
Rpy2导入外部R文件

我想执行 R 文件中的 R 脚本我怎样才能加载它我想source path to file R 在 python 代码中但是使用这个名称只需导入 Rsource功能和其他一样 import rpy2 robjects as robj
将 Angular 2 应用程序部署到 AWS Elastic Beanstalk 中

我现在正在使用 Angular 2 开发一个前端应用程序该 id 用于与部署在云中的另一个微服务进行通信在此 Angular 2 应用程序中我没有使用数据库交互我需要在 elastic beanstalk AWS PaaS 中部署这
遍历继承层次结构的 static_cast 与dynamic_cast

我看到一本关于 C 的书提到使用静态强制转换来导航继承层次结构比使用动态强制转换更有效 Example include
使用 ctree 显示每个节点的体积，在 R 中绘图

任何人都可以告诉我如何在每个节点中添加体积而不是最终的节点体积 t lt ctree is return a b c plot t type simple 我的树看起来像我如何修改该图使其在每个圆形节点上显示 N 而不仅仅是黑色或最终
为什么 LEFT JOIN 与 WHERE 子句一起使用时不返回 NULL 记录？ [复制]

这个问题在这里已经有答案了今天我尝试了一些更复杂的 MySQL 查询我注意到 MySQL 的 LEFT JOIN 不适用于 WHERE 子句我的意思是它确实返回一些记录但不会返回右侧为空的记录例如假设我们有表 albums a
了解：从 PHP 数组到 Python？

这是 PHP 和其他编程语言中的常见任务我从 PHP 开发人员转过来我想通过这个收藏来确定一下任何精通Python的人请帮助我清楚地理解这是我从 PHP 代码中收集的 gt Python 里有什么 gt Python 里有什么
Angular 2 测试 - process.env

我尝试在应用程序中模拟请求但流程变量存在问题我将后端 API 的 URL 存储在 process env backend Url 中然后在休息服务中我有 constructor private http Http this rest
将 char* 复制到 char*

这是我的代码的一部分 extern C REGISTRATION API int extreme char lKey string s1 char p NULL try ifstream myfile extreme txt int i 0
Android在没有edittext视图的情况下捕获硬件键盘事件

如何在不使用 EditText 字段的情况下捕获硬件键盘事件例如在一个简单的活动中在屏幕上显示一个正方形当在滑动键盘上按下 B 时我想将其变为蓝色当按下 G 时将其变为绿色等等我不需要颜色代码的帮助只需要如何拦截按键这
修改后的 baugh-wooley 算法乘法 verilog 代码不能正确乘法

以下 verilog 源代码和或测试平台可以很好地工作商业模拟器 iverilog https www edaplayground com x 3TuQ也形式化验证工具 yosys smtbmc https gist github com

修改后的 baugh-wooley 算法乘法 verilog 代码不能正确乘法

修改后的 baugh-wooley 算法乘法 verilog 代码不能正确乘法 的相关文章

随机推荐

热门标签

修改后的 baugh-wooley 算法乘法 verilog 代码不能正确乘法的相关文章