GENSCAN 1.0 Date run: 5-Nov-116 Time: 09:57:24 Sequence gi568815586f:120878769_121101189 : 222421 bp : 46.79% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 24720 24930 211 1 1 77 56 127 0.096 6.37 1.02 Intr + 25716 25942 227 0 2 30 21 192 0.142 4.33 1.03 Term + 35730 36358 629 1 2 3 46 605 0.040 42.32 1.04 PlyA + 36515 36520 6 1.05 2.00 Prom + 36577 36616 40 -10.35 2.01 Init + 37519 37567 49 0 1 39 55 4 0.371 -8.69 2.02 Term + 37933 38330 398 2 2 -20 42 414 0.684 21.24 2.03 PlyA + 38491 38496 6 1.05 3.06 PlyA - 38659 38654 6 1.05 3.05 Term - 52747 52643 105 1 0 77 43 52 0.061 -1.99 3.04 Intr - 63174 62988 187 2 1 70 37 153 0.030 8.09 3.03 Intr - 76796 76672 125 2 2 -2 63 122 0.029 0.08 3.02 Intr - 90281 90180 102 2 0 103 45 43 0.415 1.87 3.01 Init - 90893 90891 3 2 0 113 81 0 0.471 1.80 3.00 Prom - 97549 97510 40 -7.16 4.00 Prom + 99207 99246 40 -3.26 4.01 Init + 100001 100326 326 1 2 83 64 486 0.830 40.50 4.02 Intr + 100902 100989 88 0 1 69 55 69 0.612 1.67 4.03 Intr + 108252 108310 59 2 2 132 48 9 0.282 -1.02 4.04 Intr + 110065 110264 200 1 2 87 94 346 0.410 34.09 4.05 Intr + 114752 114938 187 0 1 100 55 166 0.861 13.25 4.06 Intr + 115396 115637 242 1 2 59 127 248 0.888 22.99 4.07 Intr + 117494 117645 152 0 2 79 105 137 0.999 14.38 4.08 Intr + 117773 117974 202 1 1 97 111 187 0.857 20.76 4.09 Intr + 118706 118897 192 0 0 110 101 245 0.991 27.46 4.10 Intr + 120500 120621 122 0 2 65 110 269 0.999 27.01 4.11 Intr + 120715 120859 145 0 1 95 94 126 0.902 13.76 4.12 Term + 122297 122424 128 0 2 63 41 162 0.992 7.44 4.13 PlyA + 123938 123943 6 -0.45 5.07 PlyA - 124113 124108 6 -0.45 5.06 Term - 125724 125385 340 2 1 90 46 443 0.970 34.31 5.05 Intr - 126325 126235 91 2 1 60 94 52 0.991 2.05 5.04 Intr - 127626 127553 74 2 2 89 101 72 0.998 7.65 5.03 Intr - 132158 132060 99 1 0 58 97 190 0.997 16.13 5.02 Intr - 132378 132336 43 1 1 119 79 -16 0.434 -2.10 5.01 Init - 137706 137562 145 0 1 89 70 108 0.986 9.38 5.00 Prom - 141289 141250 40 -7.76 6.07 PlyA - 141545 141540 6 1.05 6.06 Term - 142290 141793 498 0 0 133 29 657 0.987 58.82 6.05 Intr - 145369 145222 148 0 1 106 90 197 0.596 21.94 6.04 Intr - 149049 148808 242 0 2 90 69 268 0.996 21.45 6.03 Intr - 152849 152674 176 0 2 105 94 216 0.999 23.56 6.02 Intr - 154975 154693 283 1 1 106 82 278 0.999 25.99 6.01 Init - 160203 160006 198 0 0 78 116 199 0.932 20.19 6.00 Prom - 168184 168145 40 -5.46 7.06 PlyA - 169048 169043 6 1.05 7.05 Term - 169801 169617 185 2 2 54 43 105 0.357 0.31 7.04 Intr - 170091 170057 35 2 2 87 36 47 0.110 -2.73 7.03 Intr - 175053 174974 80 0 2 98 52 55 0.502 1.25 7.02 Intr - 175277 175086 192 2 0 98 49 114 0.078 8.19 7.01 Init - 188863 188756 108 1 0 92 31 127 0.283 7.62 7.00 Prom - 194360 194321 40 -2.86 8.05 PlyA - 202631 202626 6 1.05 8.04 Term - 207747 207475 273 0 0 58 47 154 0.470 3.77 8.03 Intr - 210444 210276 169 2 1 94 65 45 0.120 2.75 8.02 Intr - 217138 217038 101 1 2 91 30 67 0.196 0.11 8.01 Intr - 221961 221915 47 1 2 106 81 16 0.095 0.83 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 35745 36358 614 1 2 53 46 598 0.871 46.75 S.002 Term + 181729 181803 75 0 0 117 55 57 0.821 3.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:120878769_121101189|GENSCAN_predicted_peptide_1|355_aa XNTPTSQFRGACRIQLPIRGIPTPGVMLPFSITCSAHSQMTSAAASSSIPLLGRQVPGHS PPSVRASRPLRRLLLAQPEQLGRLPLPGAAPLPTRRCRLGYPLPPLSEGLLHQPIGNPDA RLLAATQLSTVSQQSAGHGPPNCQSTLLWVTGVTFNVTTIDTKRQTERVQKLCPGGQLPF LLHGTEVHTDTNKMVEFLEAVLCPPRYPKLAALNPESNTAGLDIFAKFSAYIKNSNPALN DNLQKGLLEALKVLDNYLTSPLPKEVDETSAEDEGISQRKFLNGNELTLADCNLLPKLHI VQVVCKKYRGFNIPEAFPGTRQHLSNAYAWEEPVSTCPDDEEIQLAYEQGANALK >gi568815586f:120878769_121101189|GENSCAN_predicted_CDS_1|1068_bp ngaaacacccccacgagtcagttccggggtgcctgtcggattcaacttcccatccgtggg attcccacccccggggtcatgctccccttctccatcacctgctccgcccatagccagatg acatccgccgctgcctcctcctccattccccttctcggacgccaggtccccggccactca ccgccctccgtgcgcgccagccgccccctccggcggctgctgctcgcacagccagagcag ttgggccgcctcccgctgcctggtgccgcccccctccccacgcggagatgtcgcctcggg taccctcttccgcccctctccgaggggctgctgcaccagccaatcgggaaccccgatgcg cgtctgttggctgccactcagctgtcaaccgtcagtcaacagtctgctggccacgggccc cccaactgtcagtcaaccttgctgtgggtcacaggagtcaccttcaacgttaccaccatt gacaccaagagacagactgagagagtgcagaagctgtgcccaggagggcagctcccattc ctgctgcatggcactgaagtgcacacagacaccaacaagatggtggaatttctggaggca gtactgtgccctccgagataccccaagctggcagctctgaaccctgagtccaacacagct gggctggacatatttgccaaattttctgcctacatcaagaattcaaacccagcactcaat gataatctgcagaagggactcctggaagccctgaaggttttagacaattacttaacatcc cccctcccaaaagaagtagatgaaaccagtgctgaagatgagggcatctcgcagaggaag tttctgaatggcaatgagctcaccctggctgactgcaacctgttgccaaagctacacata gtacaggtggtgtgtaagaagtaccggggatttaacatccctgaggccttcccgggaacg cgtcagcacttgagcaatgcttatgcatgggaagaacccgtctccacctgcccagatgat gaagagatccagctcgcctatgagcaaggggccaatgccctcaaataa >gi568815586f:120878769_121101189|GENSCAN_predicted_peptide_2|148_aa MPVIPALWEAEAGGSRVSRIRVHPTPAASTTPPKFNPNEIKVVYLRCTGGEVGATSALAP KIGPRGLSPKKVGDDIAKATGDWKGLSITVKLTIQNRQAQIEVVPSASALIIKALKEPPR VRKKQKNIKHSGSITFDEIVNIASQMQY >gi568815586f:120878769_121101189|GENSCAN_predicted_CDS_2|447_bp atgcctgtaatcccagccctttgggaggccgaggcaggtggatcacgagtctctcgaatc cgggttcatccaacaccagccgcctccaccacgccgccaaagttcaaccccaatgagatc aaagtcgtatacctgaggtgcaccggaggtgaagtcggtgccacttctgcactggccccc aagatcggcccccggggtctgtctccaaaaaaggttggtgatgacattgccaaggcaaca ggtgactggaagggcctgagtattacagtgaaactgaccattcagaacagacaagcccag attgaggtggtgccttctgcctctgccctgatcatcaaagctctcaaggaaccaccaaga gtcagaaagaaacagaaaaacattaaacacagtgggagtatcacttttgatgagattgtc aacattgcttcacagatgcagtactga >gi568815586f:120878769_121101189|GENSCAN_predicted_peptide_3|173_aa MNIPLTFSPLCLCASCSLYLETSICQAAAVFKIYLSTASASVKADSPHGHKVAATTHSGT FVSSGEKEELLTLISPEMRAKEFKMVEPYYAKEPNPSVHWKKTAQDFYLPGNICPESTSW KKTAQDFYLPGNICIRLCGWQQLFLAKSLALVAAIQLPLVSANFLILFLLPIL >gi568815586f:120878769_121101189|GENSCAN_predicted_CDS_3|522_bp atgaacatacccctaactttctcacctctgtgcctttgtgcaagctgctccctctacctg gaaacctccatttgtcaagctgctgctgtcttcaagatctacctgtctacagcatcagct tctgtgaaggctgattcccctcatggtcacaaagtggctgccaccactcactcaggcacc tttgtctcctctggggagaaggaggagctgctcaccctcatcagcccagagatgagggca aaagagtttaagatggtggagccttattatgcaaaggaacccaatcccagcgtccactgg aagaagactgcccaggacttctaccttcccgggaacatctgcccagagtccaccagctgg aagaagactgcccaggacttctaccttcctgggaacatctgcatcagactctgtgggtgg cagcagctgtttctggcaaaatccttagcgctggttgcagccattcagctccccttggtt tctgccaatttcctgatcctatttctcttgccaattctctaa >gi568815586f:120878769_121101189|GENSCAN_predicted_peptide_4|680_aa MVSKLSQLQTELLAALLESGLSKEALIQALGEPGPYLLAGEGPLDKGESCGGGRGELAEL PNGLGETRGSEDETDDDGEDFTPPILKELENLSPEEAAHQKAVVETLLQHPTSPAGAIRG CPFYIPIRWRTPRLLENWGDPQQRHDSQVASGPFESLWEDPWRVAKMVKSYLQQHNIPQR EVVDTTGLNQSHLSQHLNKGTPMKTQKRAALYTWYVRKQREVAQQFTHAGQGGLIEEPTG DELPTKKGRRNRFKWGPASQQILFQAYERQKNPSKEERETLVEECNRAECIQRGVSPSQA QGLGSNLVTEVRVYNWFANRRKEEAFRHKLAMDTYSGPPPGPGPGPALPAHSSPGLPPPA LSPSKVHGVRYGQPATSETAEVPSSSGGPLVTVSTPLHQVSPTGLEPSHSLLSTEAKLVS AAGGPLPPVSTLTALHSLEQTSPGLNQQPQNLIMASLPGVMTIGPGEPASLGPTFTNTGA STLVIGLASTQAQSVPVINSMGSSLTTLQPVQFSQPLHPSYQQPLMPPVQSHVTQSPFMA TMAQLQSPHALYSHKPEVAQYTHTGLLPQTMLITDTTNLSALASLTPTKQVFTSDTEASS ESGLHTPASQATTLHVPSQDPASIQHLQPAHRLSASPTVSSSSLVLYQSSDSSNGQSHLL PSNHSVIETFISTQMASSSQ >gi568815586f:120878769_121101189|GENSCAN_predicted_CDS_4|2043_bp atggtttctaaactgagccagctgcagacggagctcctggcggccctgctcgagtcaggg ctgagcaaagaggcactgatccaggcactgggtgagccggggccctacctcctggctgga gaaggccccctggacaagggggagtcctgcggcggcggtcgaggggagctggctgagctg cccaatgggctgggggagactcggggctccgaggacgagacggacgacgatggggaagac ttcacgccacccatcctcaaagagctggagaacctcagccctgaggaggcggcccaccag aaagccgtggtggagacccttctgcagcaccccacctcaccagcaggcgccattagaggc tgcccgttctacatccccatccgctggcggactccccgtctcctggagaactggggagac ccacagcagagacatgactcacaggtggcatcaggtccctttgagtctctctgggaggac ccgtggcgtgtggcgaagatggtcaagtcctacctgcagcagcacaacatcccacagcgg gaggtggtcgataccactggcctcaaccagtcccacctgtcccaacacctcaacaagggc actcccatgaagacgcagaagcgggccgccctgtacacctggtacgtccgcaagcagcga gaggtggcgcagcagttcacccatgcagggcagggagggctgattgaagagcccacaggt gatgagctaccaaccaagaaggggcggaggaaccgtttcaagtggggcccagcatcccag cagatcctgttccaggcctatgagaggcagaagaaccctagcaaggaggagcgagagacg ctagtggaggagtgcaatagggcggaatgcatccagagaggggtgtccccatcacaggca caggggctgggctccaacctcgtcacggaggtgcgtgtctacaactggtttgccaaccgg cgcaaagaagaagccttccggcacaagctggccatggacacgtacagcgggcccccccca gggccaggcccgggacctgcgctgcccgctcacagctcccctggcctgcctccacctgcc ctctcccccagtaaggtccacggtgtgcgctatggacagcctgcgaccagtgagactgca gaagtaccctcaagcagcggcggtcccttagtgacagtgtctacacccctccaccaagtg tcccccacgggcctggagcccagccacagcctgctgagtacagaagccaagctggtctca gcagctgggggccccctcccccctgtcagcaccctgacagcactgcacagcttggagcag acatccccaggcctcaaccagcagccccagaacctcatcatggcctcacttcctggggtc atgaccatcgggcctggtgagcctgcctccctgggtcctacgttcaccaacacaggtgcc tccaccctggtcatcggcctggcctccacgcaggcacagagtgtgccggtcatcaacagc atgggcagcagcctgaccaccctgcagcccgtccagttctcccagccgctgcacccctcc taccagcagccgctcatgccacctgtgcagagccatgtgacccagagccccttcatggcc accatggctcagctgcagagcccccacgccctctacagccacaagcccgaggtggcccag tacacccacacgggcctgctcccgcagactatgctcatcaccgacaccaccaacctgagc gccctggccagcctcacgcccaccaagcaggtcttcacctcagacactgaggcctccagt gagtccgggcttcacacgccggcatctcaggccaccaccctccacgtccccagccaggac cctgccagcatccagcacctgcagccggcccaccggctcagcgccagccccacagtgtcc tccagcagcctggtgctgtaccagagctcagactccagcaatggccagagccacctgctg ccatccaaccacagcgtcatcgagaccttcatctccacccagatggcctcttcctcccag taa >gi568815586f:120878769_121101189|GENSCAN_predicted_peptide_5|263_aa MAAPSGTVSDSESSNSSSDAEELERCREAAMPAWGLEQRPHVAGKPRAGAANSQLSTSQP SLRHKVNEHEQDGNELQTTPEFRAHVAKKLGALLDSFITISEAAKEPAKAKVQKVALEDD GFRLFFTSVPGGREKEESPQPRRKRQPSSSSSEDSDEEWRRCREAAVSASDILQESAIHS PGTVEKEAKKKRKLKKKAKKVASVDSAVAATTPTSMATVQKQKSGELNGDQVSLGTKKKK KAKKASETSPFPPAKSATAIPAN >gi568815586f:120878769_121101189|GENSCAN_predicted_CDS_5|792_bp atggcggcgcccagtggcacagtgagcgattcggaaagtagtaacagcagtagcgatgcg gaggagctggagcggtgccgcgaggcggcaatgccggcttggggcttggagcaacgcccg cacgtggcagggaagccaagagccggtgctgcaaatagccagttgtcaacctcccaaccg agcctcaggcataaggtgaatgagcatgaacaagatggcaacgagcttcagaccacccct gaattccgagcccacgtagccaagaagctgggagccctgctggacagcttcattaccatc tcagaagcagcaaaggagccagcaaaagctaaggtacagaaagtcgctttggaggatgat ggtttccgccttttcttcacatctgtccctggaggccgtgagaaggaagagtctccccaa ccccgccgaaagcgacagccctccagctccagcagtgaggacagtgacgaggagtggcgg cggtgccgggaggcagctgtgtcggcgtccgacatcctacaggagtcagccatccacagc cctggaacagtggagaaggaggcaaagaagaaaaggaagttgaaaaagaaagccaagaag gtggccagtgtcgactcggctgtcgctgccaccacccccaccagcatggccacagtccag aagcagaagtcaggtgagctcaacggggaccaggtgtcgcttgggaccaaaaagaagaaa aaggcaaagaaggccagcgagacctctccattcccaccagcaaagagtgctacagctata cctgcaaactga >gi568815586f:120878769_121101189|GENSCAN_predicted_peptide_6|514_aa MALMQELYSTPASRLDSFVAQWLQPHREWKEEVLDAVRTVEEFLRQEHFQGKRGLDQDVR VLKVVKVGSFGNGTVLRSTREVELVAFLSCFHSFQEAAKHHKDVLRLIWKTMWQSQDLLD LGLEDLRMEQRVPDALVFTIQTRGTAEPITVTIVPAYRALGPSLPNSQPPPEVYVSLIKA CGGPGNFCPSFSELQRNFVKHRPTKLKSLLRLVKHWYQQYVKARSPRANLPPLYALELLT IYAWEMGTEEDENFMLDEGFTTVMDLLLEYEVICIYWTKYYTLHNAIIEDCVRKQLKKER PIILDPADPTLNVAEGYRWDIVAQRASQCLKQDCCYDNRENPISSWNVKRARDIHLTVEQ RGYPDFNLIVNPYEPIRKVKEKIRRTRGYSGLQRLSFQVPGSERQLLSSRCSLAKYGIFS HTHIYLLETIPSEIQVFVKNPDGGSYAYAINPNSFILGLKQQIEDQQGLPKKQQQLEFQG QVLQDWLGLGIYGIQDSDTLILSKKKGEALFPAS >gi568815586f:120878769_121101189|GENSCAN_predicted_CDS_6|1545_bp atggcactgatgcaggaactgtatagcacaccagcctccaggctggactccttcgtggct cagtggctgcagccccaccgggagtggaaggaagaggtgctagacgctgtgcggaccgtg gaggagtttctgaggcaggagcatttccaggggaagcgtgggctggaccaggatgtgcgg gtgctgaaggtagtcaaggtgggctccttcgggaatggcacggttctcaggagcaccaga gaggtggagctggtggcgtttctgagctgtttccacagcttccaggaggcagccaagcat cacaaagatgttctgaggctgatatggaaaaccatgtggcaaagccaggacctgctggac ctcgggctcgaggacctgaggatggagcagagagtccccgatgctctcgtcttcaccatc cagaccagggggactgcggagcccatcacggtcaccattgtgcctgcctacagagccctg gggccttctcttcccaactcccagccaccccctgaggtctatgtgagcctgatcaaggcc tgcggtggtcctggaaatttctgcccatccttcagcgagctgcagagaaatttcgtgaaa catcggccaactaagctgaagagcctcctgcgcctggtgaaacactggtaccagcagtat gtgaaagccaggtcccccagagccaatctgccccctctctatgctcttgaacttctaacc atctatgcctgggaaatgggtactgaagaagacgagaatttcatgttggacgaaggcttc accactgtgatggacctgctcctggagtatgaagtcatctgtatctactggaccaagtac tacacactccacaatgcaatcattgaggattgtgtcagaaaacagctcaaaaaagagagg cccatcatcctggatccggccgaccccaccctcaacgtggcagaagggtacagatgggac atcgttgctcagagggcctcccagtgcctgaaacaggactgttgctatgacaacagggag aaccccatctccagctggaacgtgaagagggcacgagacatccacttgacagtggagcag aggggttacccagatttcaacctcatcgtgaacccttatgagcccataaggaaggttaaa gagaaaatccggaggaccaggggctactctggcctgcagcgtctgtccttccaggttcct ggcagtgagaggcagcttctcagcagcaggtgctccttagccaaatatgggatcttctcc cacactcacatctatctgctggagaccatcccctccgagatccaggtcttcgtgaagaat cctgatggtgggagctacgcctatgccatcaaccccaacagcttcatcctgggtctgaag cagcagattgaagaccagcaggggcttcctaaaaagcagcagcagctggaattccaaggc caagtcctgcaggactggttgggtctggggatctatggcatccaagacagtgacactctc atcctctcgaagaagaaaggagaggctctgtttccagccagttag >gi568815586f:120878769_121101189|GENSCAN_predicted_peptide_7|199_aa MGTQACGSSAEALSNGCTIITQLWDHQSLCKRVPSSSVSTGKGTMLNYSSDLDLILFLSC FSSVQDQAQLRDSIISFIEENWFTVARAWPTISLWSGTGRVQSRKSSGVIWMDKLPAFDA LGKDSDRLSVDWTVPTYIKQHKDYREARSRRGGDACSCKKMPGNRHRNSRSQISKTQQHK GSCLNKFIGESKAAIRTKL >gi568815586f:120878769_121101189|GENSCAN_predicted_CDS_7|600_bp atgggaacccaagcatgtggctccagcgccgaggctctgagcaacggatgcaccatcatc actcagctgtgggaccaccagtcactgtgcaaaagagttccatcatctagcgtctccaca gggaaggggacgatgctgaactacagctctgacctggacctgattctcttcctgagctgc ttctccagcgtccaagaccaggcacagctgcgagacagtatcatcagcttcattgaagag aattggttcactgtagcaagagcctggcctacaatatcactgtggtccggcacagggagg gtccagtccaggaagagcagcggagtcatttggatggataagctcccggctttcgatgct ctgggtaaagacagcgacaggctctcagtggattggacggtgcccacctacattaagcag cacaaagactacagggaagctagaagcagacgtggcggggatgcctgcagctgcaagaag atgcctgggaacagacacagaaactctcgctcccagataagcaaaacacagcagcacaaa ggcagctgtttgaataaattcattggagagtctaaggcagcaatccggaccaagctgtaa >gi568815586f:120878769_121101189|GENSCAN_predicted_peptide_8|196_aa XCSLLVYRTATSLGNMMCDPGFPLHPQPQTPPRFVQMCTLGAADPRLLHTAQPWAACLRP HEGLMKTPLGWALARVGKGSLLTLFSLLDLLTERGVFPGSDPSQRVGALFSMHFSKAHGS EIQQEQIMEDSNPDYARKVIWAEENLLAIPRSIPSASGQRQEEPQTSPPASSSTEGVLDL KLSGDWNHCCGLYSRA >gi568815586f:120878769_121101189|GENSCAN_predicted_CDS_8|591_bp naatgttctttgttggtgtatagaactgctactagcctgggcaacatgatgtgtgatcca ggatttcctctccacccacagccgcagacgcccccacggttcgtgcaaatgtgcacactc ggtgcagcagatccccgcctcctgcacaccgctcagccctgggcagcctgtcttaggcca catgaaggtctcatgaaaacacccctgggctgggctctagctcgagtgggcaaaggttcc ctcctcacactgttcagcctgttggaccttctcactgaacgaggcgtcttcccaggatct gacccaagtcagagggtgggggctctattttcaatgcatttttccaaagctcatggctca gagatacaacaagaacagatcatggaagattcaaatccagactatgccagaaaagtgatc tgggctgaagaaaacctcctggccattcccagaagcatcccgtcggcatctgggcaaagg caggaggagcctcagacatcaccacctgccagctcctccactgaaggcgtcctggacctg aagctatcaggagattggaatcattgctgtggtttgtacagcagggcctga