GENSCAN 1.0 Date run: 6-Nov-116 Time: 11:18:24 Sequence gi568815591r:75711799_75913715 : 201917 bp : 48.86% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.05 Intr - 27191 27003 189 2 0 20 94 174 0.036 10.98 1.04 Intr - 27755 27518 238 1 1 54 62 100 0.014 1.62 1.03 Intr - 46451 46371 81 1 0 35 105 59 0.062 1.05 1.02 Intr - 60205 60091 115 2 1 94 76 78 0.193 6.71 1.01 Init - 60378 60306 73 0 1 80 75 157 0.890 12.83 1.00 Prom - 63580 63541 40 -6.26 2.00 Prom + 64882 64921 40 -3.16 2.01 Init + 67600 67657 58 0 1 42 106 28 0.489 1.47 2.02 Intr + 71487 71621 135 1 0 101 80 30 0.565 4.04 2.03 Intr + 75380 75525 146 0 2 70 93 46 0.314 3.30 2.04 Intr + 81776 81901 126 1 0 50 113 11 0.059 0.68 2.05 Term + 89492 89635 144 1 0 42 42 85 0.143 -2.79 2.06 PlyA + 89685 89690 6 1.05 3.04 PlyA - 91977 91972 6 1.05 3.03 Term - 100166 99998 169 1 1 135 36 194 0.999 16.25 3.02 Intr - 101625 101508 118 1 1 114 45 50 0.995 2.82 3.01 Init - 101917 101845 73 1 1 78 94 49 0.927 4.89 3.00 Prom - 101995 101956 40 -5.76 4.03 PlyA - 102398 102393 6 1.05 4.02 Term - 108398 108210 189 2 0 -17 42 616 0.991 43.95 4.01 Init - 111660 111589 72 0 0 90 105 5 0.395 1.48 4.00 Prom - 112411 112372 40 -6.56 5.00 Prom + 115159 115198 40 -4.66 5.01 Init + 118029 118133 105 2 0 70 16 67 0.212 -2.06 5.02 Term + 123914 124135 222 1 0 38 42 373 0.942 24.62 5.03 PlyA + 125110 125115 6 1.05 6.03 PlyA - 125926 125921 6 1.05 6.02 Term - 126966 126842 125 1 2 32 44 136 0.436 2.15 6.01 Init - 129828 129810 19 0 1 72 96 7 0.380 0.40 6.00 Prom - 131793 131754 40 -3.36 7.00 Prom + 143820 143859 40 -2.66 7.01 Init + 167150 167462 313 1 1 63 105 346 0.947 29.29 7.02 Intr + 170031 170369 339 1 0 53 -14 436 0.681 25.35 7.03 Intr + 171876 172050 175 1 1 78 100 163 0.766 15.50 7.04 Intr + 175848 175929 82 0 1 83 41 56 0.503 0.04 7.05 Intr + 176708 176962 255 1 0 65 58 103 0.372 2.64 7.06 Intr + 179088 179203 116 2 2 82 96 28 0.479 2.25 7.07 Term + 184617 184821 205 0 1 60 48 100 0.328 0.04 7.08 PlyA + 186086 186091 6 1.05 8.00 Prom + 190922 190961 40 -3.36 8.01 Init + 195750 195833 84 2 0 41 94 66 0.969 3.32 8.02 Term + 197741 198127 387 1 0 3 42 329 0.974 14.74 8.03 PlyA + 198663 198668 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 27122 27003 120 2 0 103 94 156 0.890 17.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:75711799_75913715|GENSCAN_predicted_peptide_1|232_aa MMGLSLASAVLLASLLSLHLGTATRGSDISKTCCFQYSHKPLPWTWVRSYEFTSNSCSQR AVMPGVPNPWAMDQYWSICGLLGTGPHSRRAGQSGAPGSQARKELTGYSQAGWDSPGGGP TARPASSPPISTGRRGIAPCRSGQGSGRRLAEGSTLGARGDCGRGEERAGGAGGAVGIPG QPRAPDSAPRGDMDRMASSMKQVPNPLPKVLSRRGVGAGLEAAERESFERTQ >gi568815591r:75711799_75913715|GENSCAN_predicted_CDS_1|696_bp atgatgggcctctccttggcctctgctgtgctcctggcctccctcctgagtctccacctt ggaactgccacacgtgggagtgacatatccaagacctgctgcttccaatacagccacaag ccccttccctggacctgggtgcgaagctatgaattcaccagtaacagctgctcccagcgg gctgtgatgccaggggtccccaacccctgggccatggaccagtactggtccatctgtggc ctgttaggaactgggcctcacagcaggagggctggtcagtccggcgctcccgggtcccag gcccggaaggagctaacgggctattcgcaggcgggctgggattcccccgggggaggcccc actgcccggcccgcgtcatccccgcccatctccacgggccgtcgcgggatagccccctgc aggagcgggcagggtagtgggcggcgcttggcggagggcagcacgctcggggcgcgcggg gactgcggccgaggggaggagagggcgggcggggcgggcggcgccgtggggatcccgggg cagccgagggcccctgactcggctcctcgcggcgacatggatcggatggccagctccatg aagcaggtgcccaacccactgcccaaggtgctgagccggcgcggggtcggcgctgggctg gaggcggcggagcgcgagagcttcgagcggactcag >gi568815591r:75711799_75913715|GENSCAN_predicted_peptide_2|202_aa MTSGPQTDQPKEHLTNFKLDEQERVFSLAQSHTDNRRLHEPDLQEVIRAVPLGDPKWNYQ ADSPDVIPRFGLPTSIQSDNGLAFISQITQAVSQALGIQWKLRTPYHPQSSGKCWDYRHK SLHPAKCLLNSYSPSNVQGSGDTVVDRTPSLSLRKSSLWYKAPQSRDGDTIGDGKTSGAH QLGQPNKKRHGYHIISFPDFLS >gi568815591r:75711799_75913715|GENSCAN_predicted_CDS_2|609_bp atgacctcaggtcctcagaccgaccagcccaaggaacatctcaccaattttaaattggat gaacaggaaagagttttttctctagcccaatctcacactgataaccgccggcttcacgag ccagacctccaggaagttattagagcagttcccctaggagatccaaaatggaactatcag gctgattccccagacgtaattcctcggtttggccttcccacctctatacagtccgataat ggactggcctttattagtcaaatcacccaagcagtttctcaggctcttggtattcagtgg aaacttcgtaccccttaccatcctcaatcttcaggaaagtgctgggattacaggcataag tcactgcatccagccaaatgtttgctgaattcttacagccccagcaatgtccaaggctct ggagacaccgtggtggacaggacaccatccctgtccttaagaaagagtagcctttggtat aaggcaccccagtccagagatggagatactattggcgatgggaaaacatcaggagcacac caactggggcagcccaataaaaagaggcatggatatcacatcatcagcttccctgacttt ttatcataa >gi568815591r:75711799_75913715|GENSCAN_predicted_peptide_3|119_aa MAGLMTIVTSLLFLGVCAHHIIPTGSVVIPSPCCMFFVSKRIPENRVVSYQLSSRSTCLK AGVIFTTKKGQQFCGDPKQEWVQRYMKNLDAKQKKASPRARAVAVKGPVQRYPGNQTTC >gi568815591r:75711799_75913715|GENSCAN_predicted_CDS_3|360_bp atggcaggcctgatgaccatagtaaccagccttctgttccttggtgtctgtgcccaccac atcatccctacgggctctgtggtcatcccctctccctgctgcatgttctttgtttccaag agaattcctgagaaccgagtggtcagctaccagctgtccagcaggagcacatgcctcaag gcaggagtgatcttcaccaccaagaagggccagcagttctgtggcgaccccaagcaggag tgggtccagaggtacatgaagaacctggacgccaagcagaagaaggcttcccctagggcc agggcagtggctgtcaagggccctgtccagagatatcctggcaaccaaaccacctgctaa >gi568815591r:75711799_75913715|GENSCAN_predicted_peptide_4|86_aa MQGRRPSLPRCPSSRPILPAIPDLKEKEKEKEKEKEKEKEEEEEEEKKKKKKKKEEEEEE EEEEEEEEEEEEEEEEEEEEEEEVVY >gi568815591r:75711799_75913715|GENSCAN_predicted_CDS_4|261_bp atgcaggggaggcgcccttccctccccaggtgcccctcctctcgtcccattcttcctgcc atcccagatctgaaggagaaggagaaggagaaggagaaggagaaggagaaggagaaggag gaggaggaggaggaggagaagaagaagaagaagaagaagaaggaagaagaagaagaagag gaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaagaagtagtttactaa >gi568815591r:75711799_75913715|GENSCAN_predicted_peptide_5|108_aa MNSEMTAMPQPPNPPGNQLLFISSGHKGSLWTSWIVLKYQTLIFGNSAEFGTLKKAAVHY DGSGRSLGSADMRFERKAHALKAMKQYYGTPLAGRPVNIQLVTSQIDT >gi568815591r:75711799_75913715|GENSCAN_predicted_CDS_5|327_bp atgaacagtgaaatgacggcgatgccccagccacccaacccccctgggaatcagcttctg ttcatcagctcaggccacaaaggaagtctttggacaagctggatagttttgaagtatcag acactgatatttgggaactctgcagaatttggaacgctgaagaaggcggctgtgcactac gatggctctggccgcagcttaggatcagcagacatgcgctttgagcggaaggcacacgcc ctgaaggccatgaagcagtactacggcacccctctggctggccgccctgtgaacattcag cttgtcacatcacagattgatacataa >gi568815591r:75711799_75913715|GENSCAN_predicted_peptide_6|47_aa MTLHKAEELLFHGTGPFYIATNNVEYLQIGPLSDIHGKSIHRYTRLQ >gi568815591r:75711799_75913715|GENSCAN_predicted_CDS_6|144_bp atgaccctccacaaagcagaggaactgcttttccacggtactggaccattttacatcgcc accaacaatgtagagtacctgcagataggtcctttgtctgatattcacggcaagtcaata cacagatacaccaggttgcagtag >gi568815591r:75711799_75913715|GENSCAN_predicted_peptide_7|494_aa MGRGLWEAWPPAGSSAVAKGNCREEAEGAEDRQPASRRGAGTTAAMAASGPGCRSWCLCP EVPSATFFTALLSLLVSGPRLFLLQQPLAPSGLTLKSEALRNWQVYRLVTYIFVYENPIS LLCGAIIIWRFAGNFERTVGTVRHCFFTVIFAIFSAIIFLSFEAVSSLSKLGEVEDARGF TPVAFAMLGVTTVRSRMRRALVFGMVVPSVLVPWLLLVSLNTPTSDGLTYCYSIDLSERV ALKLDQTFPFSLMRRISVFKYVSGSSAERRAAQSRKIVEPATQALAAVTWSQRSIWFRSC CEQPCGVRCTGPAYRCRKRDVLPSDKVTWLFSNTDKAPRQGTRGRRGQQLAVAADVCPQL HHSSLWLCRARGFSVRVSMCLPFRSSQLTGQDPQRPEKLPDIISFPARLAEGSREADSCP ARSANLRMRKSEVGEKPRCALGTEASARRTIGMYGGFEDSRSDVAQGSAETRPFMSHGRD KHGDLPQPAVAWMK >gi568815591r:75711799_75913715|GENSCAN_predicted_CDS_7|1485_bp atggggcggggcctctgggaggcgtggcctccggccggctcctctgctgttgccaaggga aactgccgcgaggaggcggaaggagcagaggaccggcagccggcgtcgaggcggggcgcg ggaacgacggcggccatggcggcctcggggcccgggtgtcgcagctggtgcttgtgtccc gaggtgccatccgccaccttcttcactgcgctgctctcgctgctggtttccgggcctcgc ctgttcctgctgcagcagcccctggcgccctcgggcctcacgctgaagtccgaggccctt cgcaactggcaagtttacaggctggtaacctacatctttgtctacgagaatcccatctcc ctgctctgcggcgctatcatcatctggcgctttgctggcaatttcgagagaaccgtgggc accgtccgccactgcttcttcaccgtgatcttcgccatcttctccgctatcatcttcctg tcattcgaggctgtgtcatcactgtcaaagctgggggaagtggaggatgccagaggtttc accccagtggcctttgccatgctgggagtcaccaccgtccgttctcggatgaggcgggcc ctggtgtttggcatggttgtgccctcagtcctggttccgtggctcctgctggtttcactt aacacgccaacctcagatggcctcacctactgctattccatcgacctctcagagcgagtg gcactgaagctcgatcagaccttccccttcagcctgatgaggaggatatccgtgttcaag tacgtctcagggtcttcagccgagaggagggcagcccagagccggaaaattgtcgagcca gccacacaggcactggcagctgtgacctggtcccaacgctccatctggtttagaagctgc tgtgagcagccctgtggagtacggtgtactggcccagcttacagatgcagaaagcgagac gttctgccatcagataaagtcacgtggctctttagtaacacggacaaggctcctcgccaa ggaactcgtggcagaagagggcagcagttggcagtagctgccgatgtctgtccccagctc caccattcctccctgtggctgtgccgtgctcgtggtttcagtgtccgtgtgtccatgtgt ctgcccttcaggagctcgcagctgacagggcaggacccacagcggccagagaagcttcca gacattatcagtttccctgctcggttggcggagggcagcagagaagcagattcatgccct gctcgctctgcaaacctcagaatgagaaagagtgaggttggtgagaagccgcgctgtgca ctgggaacagaagccagtgccaggaggaccatcgggatgtacggaggctttgaggacagt agatcagatgttgcacagggcagtgctgagaccaggccctttatgtcccatggcagggac aagcatggagacctcccccaaccagctgtggcatggatgaaatga >gi568815591r:75711799_75913715|GENSCAN_predicted_peptide_8|156_aa MLGGPRGVGQQGDRKKVWGSYLDNEESQKRKEKKRRTRRRRRKKKEKKRKRKEEEEEEEE EEEEGEEGEVGEEREEGEEGKEEGEEKRKEEKKEKRKEEERRRKEEGRGGRGREEEGGGR RRREKKKEEEEKKVTGQNQPEKHVCTSMAVMTRRKC >gi568815591r:75711799_75913715|GENSCAN_predicted_CDS_8|471_bp atgttaggaggacctaggggtgtggggcagcaaggagaccggaagaaggtctggggaagt tatttggacaatgaagagagccagaaaagaaaagaaaagaaaagaagaacaaggaggagg aggagaaagaagaaagagaagaagaggaagaggaaggaagaagaagaggaggaagaggaa gaagaggaagagggagaggagggagaggtgggagaggaaagagaggaaggagaggaagga aaagaggaaggagaggagaagaggaaggaggagaagaaggagaaaagaaaagaagaagaa agaagaagaaaagaagaaggaagaggaggaagaggaagagaagaagaaggaggaggaaga agaagaagagagaagaagaaagaagaagaagaaaagaaagtcacaggtcaaaatcagcca gaaaaacatgtatgtaccagcatggctgtaatgaccagaagaaaatgctaa