GENSCAN 1.0 Date run: 4-Nov-116 Time: 20:34:33 Sequence gi568815589f:130593792_130803771 : 209980 bp : 47.41% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1692 1797 106 2 1 79 107 81 0.867 8.89 1.02 Intr + 15703 15858 156 2 0 54 75 55 0.524 0.78 1.03 Intr + 16163 16196 34 0 1 93 121 7 0.878 1.78 1.04 Intr + 22564 22726 163 1 1 78 89 221 0.977 21.18 1.05 Intr + 24006 24104 99 2 0 103 91 98 0.975 11.91 1.06 Intr + 26563 26667 105 0 0 79 69 72 0.885 4.81 1.07 Intr + 28917 29019 103 2 1 83 94 56 0.915 5.45 1.08 Intr + 29820 29920 101 1 2 76 94 75 0.711 6.73 1.09 Intr + 32573 32714 142 1 1 61 116 164 0.998 16.43 1.10 Intr + 33749 33859 111 0 0 76 105 4 0.648 1.25 1.11 Intr + 36837 36997 161 1 2 113 50 189 0.712 17.31 1.12 Intr + 37766 37839 74 1 2 81 87 11 0.353 -1.50 1.13 Intr + 38151 38231 81 0 0 73 92 24 0.196 0.05 1.14 Intr + 38411 38487 77 2 2 73 108 44 0.442 4.06 1.15 Intr + 40876 40947 72 2 0 110 70 70 0.855 6.78 1.16 Intr + 41618 41722 105 0 0 -3 59 157 0.728 3.89 1.17 Intr + 42208 42335 128 2 2 118 62 73 0.948 8.10 1.18 Intr + 58046 58247 202 1 1 -8 87 249 0.831 14.06 1.19 Term + 58333 58592 260 2 2 23 42 404 0.999 25.01 1.20 PlyA + 58659 58664 6 1.05 2.00 Prom + 60301 60340 40 -3.66 2.01 Init + 61036 61105 70 0 1 62 82 9 0.657 -1.04 2.02 Intr + 68052 68270 219 1 0 37 55 164 0.839 6.27 2.03 Intr + 68611 68753 143 2 2 69 69 123 0.904 8.57 2.04 Intr + 70770 71085 316 2 1 124 101 432 0.509 43.74 2.05 Intr + 72817 73007 191 2 2 83 85 344 0.941 32.90 2.06 Intr + 74367 74522 156 2 0 82 85 392 0.978 38.51 2.07 Intr + 84738 84849 112 2 1 162 78 114 0.993 17.75 2.08 Term + 87457 87878 422 2 2 53 53 875 0.971 75.95 2.09 PlyA + 88979 88984 6 1.05 3.00 Prom + 97672 97711 40 -4.56 3.01 Init + 97824 97910 87 2 0 59 98 50 0.864 2.15 3.02 Term + 98685 98924 240 2 0 63 42 197 0.962 8.63 3.03 PlyA + 98955 98960 6 -3.44 4.00 Prom + 99162 99201 40 -3.46 4.01 Init + 100001 100122 122 1 2 115 88 99 0.912 12.27 4.02 Intr + 101701 101802 102 1 0 47 94 104 0.829 6.19 4.03 Intr + 103791 103836 46 0 1 114 82 60 0.997 6.41 4.04 Intr + 104371 104460 90 0 0 11 95 76 0.626 0.79 4.05 Intr + 108343 108519 177 0 0 48 83 103 0.513 5.92 4.06 Intr + 109262 109390 129 1 0 96 78 145 0.999 15.19 4.07 Term + 109903 109983 81 0 0 104 49 85 0.997 3.89 4.08 PlyA + 110439 110444 6 1.05 5.04 PlyA - 113149 113144 6 1.05 5.03 Term - 117752 117713 40 1 1 133 42 -8 0.332 -4.24 5.02 Intr - 119412 119057 356 0 2 69 113 173 0.113 11.99 5.01 Init - 126129 126004 126 0 0 63 -8 135 0.102 1.56 5.00 Prom - 127137 127098 40 -4.06 6.02 PlyA - 127271 127266 6 1.05 6.01 Sngl - 131259 131047 213 0 0 63 43 180 0.167 6.18 6.00 Prom - 131993 131954 40 -3.26 7.00 Prom + 133348 133387 40 -5.16 7.01 Init + 146459 146533 75 1 0 80 94 -6 0.401 0.29 7.02 Intr + 157780 157882 103 0 1 35 80 94 0.462 3.05 7.03 Term + 161982 162163 182 1 2 71 37 149 0.450 5.87 7.04 PlyA + 164075 164080 6 1.05 8.06 PlyA - 164951 164946 6 1.05 8.05 Term - 165663 165447 217 2 1 22 42 240 0.217 9.42 8.04 Intr - 172574 172403 172 0 1 114 76 59 0.094 6.30 8.03 Intr - 184959 184926 34 0 1 86 81 34 0.027 0.30 8.02 Intr - 186772 186652 121 0 1 -79 94 156 0.088 -0.00 8.01 Init - 205445 205384 62 2 2 76 88 23 0.122 1.84 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:130593792_130803771|GENSCAN_predicted_peptide_1|759_aa IAAKIDSIPHLNNSTPLVDPSVYGYGVQKRPLDDGGGAYVRVPHDYSCCGAVLNNASKEG KSSHLFLKEKEGSLSIFTLSLDPEQEGVGNQLGALVHQRQAKRLLGQIVDRCRNGPGFHN DIDSNSTIQEILIPASKVGLVIGRGGETIKQLQERTGVKMVMIQDGPLPTGADKPLRITG DAFKVQQAREMVLEIIREKDQADFRGVRGDFNSRMGGGSIEVSVPRFAVGIVIGRNGEMI KKIQNDAGVRIQFKPDDGISPERAAQVMGPPDRCQHAAHIISELILTAQERDGFGGLAAA RGRGRGRGDWSVGAPGGVQEITYTVPADKCGLVIGKDTVGLCSCKMQETVLSFRLNHSEL VWVRVHLPAGVRAGGENIKSINQQSGAHVELQRNPPPNSDPNLRRFTIRGVPQQIEVARQ LIDEKVGGTNLGAPGAFGQSPFSQPPAPPHQNTFPPRSSGCFPNMAAKVNGNPHSTPVSG PPAFLTQGWGSTYQAWQQPTQQVPSQQSQPQSSQPNYSKAWEDYYKKQRCFLLVGRDRVE GYQRSGPPGVGIIARFMALLGVGGHAASAAPQASSPPDYTMAWAEYYRQQVAFYGQTLGQ AQAHSQCPPLWREEGLVGPDETNEIPNANFRQQIRKLIKDGLILRHRKPVTVHSRAQCWK STLARRKGRHLGIESKKIDRHMYHSLYLKLKGNVFKHKRILTEHSHKLKADKARKKPLAD QAEARGSKTKEARKLREEHLQTKKEEIIKTLSQEEKAKK >gi568815589f:130593792_130803771|GENSCAN_predicted_CDS_1|2280_bp attgctgctaaaattgattcaattcctcacttgaataattccacacctctagtggacccc tcagtatatggatacggagtacaaaaacggcccttggatgatggaggaggagcatatgtg cgtgttcctcatgactacagctgctgtggagcagtccttaacaatgcatccaaggagggt aaatcttctcatttgtttttaaaagagaaagaaggatctctttccatctttacattgagt ttggatccagagcaggagggagtaggtaaccagttaggggccttggtacatcaaagacaa gccaaacggctcctgggacagattgtggaccgctgtcgaaatggacctggctttcataat gacatagacagcaacagcacaatccaggagattctcattcccgcatctaaagtgggtctg gtcatcggcagaggaggggaaacaatcaagcagttgcaggagcggacaggggtgaagatg gtcatgatccaggatggcccattgcccacgggagcagacaagcctcttcgtatcactgga gatgcatttaaagtacagcaagcaagagaaatggtactagagattatccgagaaaaagac caagctgactttcggggtgtacgcggcgatttcaactctcgaatgggaggaggcagtata gaggtatctgtgcctaggtttgctgtggggattgtaataggaagaaacggggaaatgatc aaaaagatccagaatgatgctggtgtgaggattcagtttaaaccagatgatgggattagt ccagaaagagctgcccaggtcatgggccctccggatcggtgtcagcatgcagcgcatatc atcagcgagctgattcttacagcccaggaaagagacggctttggaggcctggcagcagcc agaggaagaggtcgtggccgtggcgactggagcgtgggagcccctggtggcgtccaggag ataacatacacggtgccagccgataagtgtggcctcgtcataggcaaagacactgtgggc ttatgctcttgcaaaatgcaagaaaccgtgttgtcttttagactgaaccactcagagttg gtatgggttcgtgttcatttgccggcaggtgtgcgggctgggggtgagaacatcaaaagc atcaaccagcagtcaggggcgcacgtggagcttcagaggaacccccctcccaacagcgac cccaacctgcggagattcaccatcaggggggttccccagcagatcgaggtggccaggcag ctcatagatgagaaagttggcgggaccaatctcggagcacctggagccttcggacagagt ccattcagccagccacctgccccacctcatcaaaatacctttcctccaaggagctccggg tgcttcccaaacatggctgccaaggtgaatgggaacccccacagcacccctgtgagtggt cctccggcctttctgacccagggctggggcagcacctaccaggcgtggcagcagcccaca cagcaggtcccaagccagcagagccagccgcagagcagccagcccaactacagcaaggcc tgggaagactattacaaaaaacagagatgtttcttgctggttggaagagatcgggttgaa ggctaccagagaagtggtccccctggtgttggcatcatcgccaggttcatggcccttcta ggggttggaggtcacgccgccagcgctgctcctcaggccagctccccaccggactacaca atggcctgggcagaatattacagacagcaggtcgctttctacggacagacgttagggcag gcgcaggcccacagccagtgtcctccactgtggcgagaagaaggtctggttggacccgat gagaccaatgaaatccccaatgccaacttccgtcagcagatccggaagctgatcaaagat gggctgatcttgcgtcaccgcaagcctgtgacggtccattcccgggctcaatgctggaaa agcaccttggcccgccggaagggcaggcacttgggcatagaatcgaagaagattgatcgc cacatgtatcacagcctgtacctgaagttgaaggggaatgtgttcaaacacaagcggatt ctcacggaacacagccacaagctgaaggcagacaaagcccgcaagaagcccctggctgac caggctgaggcccgcgggtctaagaccaaggaagcacgcaagctccgtgaagagcacctc cagaccaagaaggaagagatcatcaagactttgtcccaggaggaaaaggccaagaaataa >gi568815589f:130593792_130803771|GENSCAN_predicted_peptide_2|542_aa MVWALSTGNNTSWPEPSWFFFGLVLLSGLAGPEERNGTPWPQSEPVVTAPAESKPLTLSR GSGPKRCTPARRMHRPEGFPQWDVALERAKSSGLSWDSSSGQGADATHKLRRKMSGGFGD PNPPPSSAALPSAGALPNGPAFCQRSPLAPWAPLSPAHLPRRPGRPPAPGSSGPPMMGSV LPAEALVLKTGLKAPGLALAEVITSDILHSFLYGRWRNVLGEQLFEDKSHHASPKTAFTA EVLAQSFSGEVQKLSSLVLPAEVIIAQSSIPGEGLGIFSKTWIKAGTEMGPFTGRVIAPE HVDICKNNNLMWEVFNEDGTVRYFIDASQEDHRSWMTYIKCARNEQEQNLEVVQIGTSIF YKAIEMIPPDQELLVWYGNSHNTFLGIPGVPGLEEDQKKNKHEDFHPADSAAGPAGRMRC VICHRGFNSRSNLRSHMRIHTLDKPFVCRFCNRRFSQSSTLRNHVRLHTGERPYKCQVCQ SAYSQLAGLRAHQKSARHRPPSTALQAHSPALPAPHAHAPALAAAAAAAAAAAAHHLPAM VL >gi568815589f:130593792_130803771|GENSCAN_predicted_CDS_2|1629_bp atggtgtgggcactgtccacaggaaacaacacctcatggccagaacccagctggttcttc tttggcctcgtactgctctccgggcttgcgggtccggaggagaggaatgggactccctgg ccgcagtccgagcccgtggtcactgcgccagcggaatccaagcctctaactctaagccgg ggttcaggacccaaacgctgcaccccagcacggaggatgcacagacctgagggcttcccg cagtgggacgtggccctggagagggcaaaatcctcggggttgtcgtgggactcgagctct ggccagggagccgatgcgacccacaagctgcgtcgaaaaatgagtggagggttcggcgac cctaacccacctccctcctcagcagcgctgccctctgctggcgccctgcccaacggtcct gccttctgccagcgctctcctctcgccccttgggctcccctctcgcccgcccacctcccc cgtcggcccggccgtcccccggcgccggggagctccgggccgcccatgatgggctccgtg ctcccggctgaggccctggtgctcaagaccgggctgaaggcgccgggactggcgctggcc gaggttatcacctccgacatcctgcacagcttcctgtacggccgctggcgcaacgtgctc ggggagcagctcttcgaggacaagagccaccacgccagccccaagacagccttcaccgcc gaggtgctggcgcagtccttctccggcgaagtgcagaagctgtccagcctggtgctgcct gcggaggtgatcatcgctcagagctccatccctggcgagggcctcggcatcttctccaag acgtggatcaaggcgggaaccgagatgggccccttcaccggccgcgtgatcgccccggag cacgtggacatctgcaagaacaacaacctcatgtgggaggtgttcaatgaggatggcacg gtgcgctacttcatcgatgccagccaggaggaccaccggagctggatgacctacatcaag tgtgcacgtaacgaacaggagcagaacctggaggtggtccagatcggcaccagcatcttc tacaaggccattgagatgatcccacctgaccaggaactgctggtgtggtacggaaactca cacaacaccttcctggggatcccaggtgtgcccgggctagaggaggaccagaaaaagaac aagcatgaggacttccacccggcggactcggcggctggccccgcgggccgcatgcgatgc gtcatctgccaccgcggcttcaactcgcgcagcaacctgcgctcgcacatgcgcatccac acgctggacaagcccttcgtgtgccgcttctgcaaccgccgcttcagccagtcgtccacg ctgcgcaaccacgtgcgcctgcacacgggcgagcgcccctacaagtgccaggtgtgccag agcgcctactcgcagctggccggcctgcgcgcccaccagaagagcgcgcggcaccggccg cccagcaccgcgctgcaggcacactcgcccgcgctgcccgccccgcacgcgcacgcgccc gcgctcgccgccgccgccgccgccgccgccgccgccgccgcgcaccacctgccggccatg gtgctgtga >gi568815589f:130593792_130803771|GENSCAN_predicted_peptide_3|108_aa MPKESILFTSPLGCAASSPGTHLSTGRKGGRSLGLAPRLPGYASSEDTIALLWPRQDWTE GLGSSAFLQARQRARVWQPAYPGPRGTSIHVIDCLAGLAFLSVVVRLA >gi568815589f:130593792_130803771|GENSCAN_predicted_CDS_3|327_bp atgcctaaggaatctatactttttacaagccccctgggatgtgctgcttcgtctccgggt acccacttgtctactggtcgtaaagggggccgctcgctgggcctggctccccggctgccg ggctacgcgagctcagaggacaccatcgccctcttatggccgcggcaggactggaccgag gggctgggctctagtgccttcctccaggcacggcagagggcgagggtctggcagcccgct tacccagggcctcggggcactagcattcatgtcatcgactgcctggcagggctggctttc ttaagtgtggtcgtgagacttgcttaa >gi568815589f:130593792_130803771|GENSCAN_predicted_peptide_4|248_aa MAMEMRLPVARKPLSERLGRDTKKHLVVPGDTITTDTGFMRGHGTYMGEEKLIASVAGSV ERVNKLICVKALKTRYIGEVGDIVVGRITEVQQKRWKVETNSRLDSVLLLSSMNLPGGEL LGQGVLVQVSPSLVKRQKTHFHDLPCGASVILGNNGFIWIYPTPEHKEEEAGGFIANLEP VSLADREVISRLRNCIISLVTQRMMLYDTSILYCYEASLPHQIKDILKPEIMEEIVMETR QRLLEQEG >gi568815589f:130593792_130803771|GENSCAN_predicted_CDS_4|747_bp atggcgatggagatgaggcttccagtggctcgcaagcctcttagcgagagactgggccgc gacactaagaaacatctagtggtgccgggggatacaatcactacggacacaggattcatg cggggccatggaacgtatatgggagaagagaagctcattgcatctgttgctggctctgtg gagagagtaaacaagttgatctgtgtgaaagctttgaaaaccagatacattggtgaagta ggagacatcgtagtgggacgaatcacagaggttcaacagaagaggtggaaggtggagacc aactccaggctggattcggtcttgctgctctcgtccatgaaccttcctggaggagagctg ctaggtcagggggttttggtccaggtttccccctccctggtgaaacggcagaagacccac tttcatgatttgccatgtggtgcctcagtgattctcggtaacaacggcttcatctggatt tacccaacacctgagcacaaagaagaggaagcagggggcttcattgcaaacctggagcct gtctctcttgctgatcgagaggtgatatcccggcttcggaactgcatcatctcgctggta actcagaggatgatgctgtatgataccagcatcctgtactgctatgaagcatcccttcca catcagatcaaagacatcttaaagccagaaataatggaggagattgtgatggaaacacgc cagaggcttttggaacaggagggataa >gi568815589f:130593792_130803771|GENSCAN_predicted_peptide_5|173_aa MTSPLTLNDSGKKLPSAHSFRFTINNNTNAILGFQIHVCSNTAVTSPELRRGTQPPPAGR GRGQRRGSSAAGPGNRLRLQRCSGTDTKPRGRHDGLRAAPQRGQAPPSAPPTATSGARLE GTRMPQEPLSRDTAAPPGSGPTACESPARHDLLNVASLSARWSFALVAQAAMQ >gi568815589f:130593792_130803771|GENSCAN_predicted_CDS_5|522_bp atgactagccctttgaccctgaatgattctggcaaaaagttgccatctgcccactctttc agattcaccatcaacaacaacaccaacgctattcttggcttccagatccatgtctgctcc aacaccgcagtcacatctcccgagctccgccgagggacccagccgccccccgccggtagg ggccgtggacagcggaggggctcctcagccgcaggacccgggaaccggctccggctccag cgctgctcaggcacagacaccaaaccccgcggccgccatgatggattacgagccgctcca caacgaggccaggccccgccctcggccccgcccacggcgacgtccggagcccgcctcgag gggacccggatgccccaagagccgctttcccgggacaccgccgcccctcccggctctggt cccactgcctgcgaaagcccggcccggcacgaccttttaaatgtggcttctctgagcgca agatggagttttgctcttgtcgcccaggctgcgatgcagtaa >gi568815589f:130593792_130803771|GENSCAN_predicted_peptide_6|70_aa MTKGTLSFEKRCNMMHTLCGHCGSKACHLPKSTCGKCGYPAKRKRKYDWCAKAKRLKATG TGRMKRRGFP >gi568815589f:130593792_130803771|GENSCAN_predicted_CDS_6|213_bp atgacaaagggaacgttatcgtttgaaaagcgttgcaatatgatgcacacgttgtgcggc cactgtggctctaaggcctgtcaccttccgaagtctacctgtggcaaatgtggctacccc gccaagcgcaagaggaagtatgactggtgtgccaaggctaaaagactaaaggccacaggg actgggcgaatgaagcggcgtggatttccataa >gi568815589f:130593792_130803771|GENSCAN_predicted_peptide_7|119_aa MNHRDGSEGQSTDGPSSGLEVPWMLTWATEEAAASSGIGKVSAVESATIKALLLLAPEPG VGGILAVCRSSWPPEIRAVTFHLQMAVLAARLVAEEDEDTAQGLSFKGFRVQQQHSVIS >gi568815589f:130593792_130803771|GENSCAN_predicted_CDS_7|360_bp atgaaccacagggatggctcagaaggtcagagcactgatggaccctcaagtgggttggaa gtcccttggatgctaacctgggccactgaggaagctgcagcctcttcaggaattgggaag gtgtcagctgtagaatctgccacaatcaaagccttactgctcctggctccagagccaggg gttggtggaatcctggcagtctgccgctcttcctggcctccagagatcagggctgtgact ttccatctgcagatggcagttctagctgcgaggttggtggccgaggaggacgaggacaca gctcagggcctgtccttcaagggcttccgtgtgcagcagcagcactctgtaatatcatag >gi568815589f:130593792_130803771|GENSCAN_predicted_peptide_8|201_aa MEHLRWVQTPPRGQTLLQILANSRYEYRYEVSTIQKRKLQEFSSGDWSDPDSSKISPNIT KPRSKKGINHDNGPYAEVCLRSWISSWAYSKASVLTVRHLSCPARPAFQAPVIRMPLSRL AGALGAEASWSKWIVKHWRQLATSTTHLAKELLMKRTVQWWFKKFYKGDESLEDEECNDR PSEVDNDQLGDQSQKLILLQP >gi568815589f:130593792_130803771|GENSCAN_predicted_CDS_8|606_bp atggagcatctccgatgggtgcaaacacctccaagaggccaaacgctgctccaaatactg gcaaatagcaggtacgagtacagatacgaggtgagcactatccaaaaacgaaagttgcaa gagttttcatctggagactggtcagatccagacagctccaagatcagccccaatatcaca aagccaaggagtaaaaaaggtataaaccacgacaatgggccctacgcagaagtctgtctg cggtcatggatctcttcctgggcctacagtaaggcatctgtcctgacagtaaggcatctg tcctgcccagcaagacccgccttccaggctccagtcatccgcatgcccctgagcaggctg gcgggggcactgggggccgaggcttcttggtcaaaatggattgtaaagcactggagacaa ctcgcaacatcaacaacgcatttggccaaggagctgctaatgaaacgtacagtgcagtgg tggttcaagaagttttacaaaggagacgagagccttgaagatgaggagtgcaatgaccgg ccatcagaagttgacaacgaccagttgggagatcaatcacagaagctgatcctcttacaa ccataa