GENSCAN 1.0 Date run: 6-Nov-116 Time: 19:25:10 Sequence gi568815597r:30613585_30823551 : 209967 bp : 51.46% C+G : Isochore 3 (51 - 57 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3337 3396 60 0 0 96 78 -9 0.318 0.12 1.02 Intr + 3880 3954 75 0 0 33 77 74 0.254 1.01 1.03 Intr + 5787 5886 100 2 1 74 78 29 0.407 0.68 1.04 Term + 9821 9873 53 0 2 94 48 76 0.599 2.08 1.05 PlyA + 10505 10510 6 1.05 2.00 Prom + 13469 13508 40 -2.71 2.01 Init + 20218 20313 96 0 0 82 103 80 0.701 7.20 2.02 Intr + 22218 22318 101 2 2 128 80 3 0.436 2.91 2.03 Intr + 24501 24599 99 0 0 86 89 9 0.245 0.52 2.04 Intr + 34629 34756 128 0 2 44 79 83 0.749 3.83 2.05 Term + 35649 35767 119 1 2 50 52 75 0.100 -0.89 2.06 PlyA + 36445 36450 6 1.05 3.07 PlyA - 37317 37312 6 1.05 3.06 Term - 42215 41604 612 2 0 -23 55 1275 0.895 108.00 3.05 Intr - 43129 43084 46 0 1 77 105 5 0.126 -0.00 3.04 Intr - 48236 48160 77 2 2 118 24 67 0.024 2.11 3.03 Intr - 48910 48855 56 2 2 126 81 -16 0.147 0.69 3.02 Intr - 50178 50120 59 2 2 105 48 43 0.067 1.02 3.01 Init - 50975 50860 116 2 2 73 72 48 0.270 1.43 3.00 Prom - 55237 55198 40 -2.21 4.00 Prom + 60083 60122 40 -3.11 4.01 Init + 62615 62624 10 1 1 51 101 3 0.350 -1.60 4.02 Intr + 65290 65362 73 2 1 87 100 30 0.848 3.06 4.03 Intr + 65587 65692 106 1 1 83 45 107 0.813 6.62 4.04 Intr + 66879 66978 100 2 1 77 67 34 0.669 0.38 4.05 Term + 69180 69340 161 1 2 72 49 125 0.853 5.42 4.06 PlyA + 73073 73078 6 -0.45 5.11 PlyA - 73357 73352 6 1.05 5.10 Term - 79088 78874 215 0 2 128 36 91 0.070 5.62 5.09 Intr - 83133 83084 50 2 2 157 94 -15 0.105 4.81 5.08 Intr - 98206 98009 198 0 0 105 80 24 0.053 2.29 5.07 Intr - 101422 101332 91 2 1 -5 91 62 0.051 -3.25 5.06 Intr - 101725 101573 153 2 0 115 92 285 0.794 32.16 5.05 Intr - 102741 102325 417 1 0 112 86 651 0.995 61.87 5.04 Intr - 103331 103206 126 0 0 76 61 238 0.888 21.06 5.03 Intr - 105439 105151 289 1 1 60 62 635 0.896 55.46 5.02 Intr - 108167 107821 347 0 2 94 102 641 0.975 61.77 5.01 Init - 109967 109874 94 2 1 61 78 170 0.716 11.80 5.00 Prom - 110866 110827 40 -8.68 6.00 Prom + 111466 111505 40 -10.67 6.01 Init + 112211 112467 257 1 2 67 93 208 0.936 15.90 6.02 Term + 115886 115997 112 2 1 95 41 40 0.221 -1.67 6.03 PlyA + 117938 117943 6 1.05 7.20 PlyA - 118905 118900 6 1.05 7.19 Term - 120333 120244 90 0 0 146 46 85 0.609 8.12 7.18 Intr - 121681 121589 93 1 0 62 113 104 0.746 10.96 7.17 Intr - 124115 124020 96 2 0 63 113 118 0.958 12.51 7.16 Intr - 125478 125356 123 0 0 119 101 215 0.999 27.19 7.15 Intr - 126353 126225 129 2 0 77 68 189 0.922 17.20 7.14 Intr - 128132 128056 77 0 2 124 100 115 0.992 16.03 7.13 Intr - 128965 128872 94 1 1 116 86 110 0.726 13.64 7.12 Intr - 130591 130538 54 1 0 55 87 58 0.081 2.06 7.11 Intr - 134917 134891 27 1 0 111 94 -14 0.247 0.30 7.10 Intr - 144188 144075 114 2 0 97 89 180 0.444 20.15 7.09 Intr - 146902 146821 82 0 1 43 75 38 0.014 -1.86 7.08 Intr - 156122 156032 91 0 1 64 80 85 0.041 4.85 7.07 Intr - 156280 156233 48 2 0 107 76 18 0.032 1.64 7.06 Intr - 160870 160757 114 2 0 72 45 55 0.006 0.52 7.05 Intr - 169630 169517 114 2 0 93 4 93 0.249 2.32 7.04 Intr - 173286 172889 398 2 2 66 61 105 0.140 0.39 7.03 Intr - 176046 175959 88 1 1 102 121 49 0.659 9.13 7.02 Intr - 187723 187667 57 2 0 62 94 74 0.012 4.75 7.01 Init - 196661 196553 109 2 1 85 91 83 0.699 8.64 7.00 Prom - 196814 196775 40 -4.61 8.00 Prom + 198912 198951 40 -3.61 8.01 Init + 200256 200268 13 2 1 81 86 -2 0.781 -0.92 8.02 Intr + 200659 200771 113 2 2 94 121 42 0.791 8.70 8.03 Term + 204043 204105 63 0 0 44 48 121 0.851 1.88 8.04 PlyA + 204873 204878 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 16155 16037 119 0 2 77 64 150 0.972 9.16 S.002 Init + 79113 79354 242 2 2 49 81 184 0.891 11.13 S.003 Init - 155801 155695 107 2 2 64 64 124 0.800 5.46 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597r:30613585_30823551|GENSCAN_predicted_peptide_1|95_aa MRTSWVVSPGSGSHTCRFPEYRRHLSEEINKDKSSDCIENITDDMTSPFLDYTLGKQRET EITGKVLLLYTVPGKVCRAVFSLEYVMRLFSQQIL >gi568815597r:30613585_30823551|GENSCAN_predicted_CDS_1|288_bp atgaggacatcttgggtggtcagcccaggcagcggaagccacacgtgcagattcccagag taccgacgtcatctctcagaggagattaacaaagacaaatcttcagactgcatagaaaat atcactgatgatatgacttcaccatttcttgattacaccctgggcaaacagcgagagaca gaaatcacaggaaaggtgcttctgctatacacagtcccaggcaaggtgtgcagagcagtg ttctccctggagtacgtcatgcgtctgttttctcagcaaatcctgtga >gi568815597r:30613585_30823551|GENSCAN_predicted_peptide_2|180_aa MVTAFLRAPVSSLWPPLAPYPDKGLLTTFSLMGPLASQHPQQIPKEAGSPVCLSTCWILS TELSARAGRDESALASYAALQMRIQGGEVGKFAQSFTPNVGGPGGRAFGFTLERIQGPAS GVRQQLLLKQQCAAAACSSSKGKFQVSDHAKPPPLPSLEPIMRIVAWVHRMAPGADKAPA >gi568815597r:30613585_30823551|GENSCAN_predicted_CDS_2|543_bp atggtgacagcctttctgcgggctcctgtgagctcactgtggcctcccttggctccttac ccagacaagggccttctcaccaccttcagcctcatgggacctcttgcctcccagcatcct caacagatccccaaagaggccgggtccccagtatgcttaagcacctgctggatactaagt acagagctcagcgccagggctgggagggatgaaagcgccctagcaagctatgctgcttta cagatgagaatccaaggcggagaagtggggaaatttgcccaaagtttcacccccaatgtt ggagggcctggtgggagggcttttggcttcaccctggagagaattcaagggccagccagt ggtgttagacagcaacttttattgaagcagcagtgtgcagcagcagcatgcagcagcagc aaagggaagttccaagtgtctgaccacgccaagccgcctcccttaccctctctggagccc atcatgaggatagttgcctgggtccatcgcatggcacctggagctgacaaagcccctgcc tga >gi568815597r:30613585_30823551|GENSCAN_predicted_peptide_3|321_aa MAWAAKRERAHSAGPSEWGDGTRGQSAIFPGVLSQGALRLCETAQLDGTYPGLGDGRGGM PGVLLGPVGRSVHYSSLCPPHSAVIELPAGFENKAGRRELPSLDICLTASTTGNQMPEDK ELVQNCNTTTTTATTTITTTVTTTITTITTTITTTTTTVTTTITTTTTTITTTVTTTTVT TTTTTTVTTTTTTVTTTVTTTTTTTVTTTTTTTTTTTITTVTTTTTTTVTTVTTTTVTTV TTTTTTTITTTTTTTITTTTTTVTTVTTTTTTTTITTTTTTTTTTTTTITTTTTTTITTT ITTTTTTSITTTTTTITTTTY >gi568815597r:30613585_30823551|GENSCAN_predicted_CDS_3|966_bp atggcttgggccgcgaagcgtgagcgtgctcactcagctgggccctctgagtggggcgat gggacccgagggcagagcgccatcttccctggcgtcctgagccagggggctctcaggctg tgtgagacagcgcagctggatgggacctaccccgggcttggggacggcagaggaggcatg cctggagttctcctgggcccagttggcaggagtgtccactacagctcactgtgccctccc cacagtgcggtcatcgagctccctgcaggcttcgagaacaaggcagggaggagggaacta ccttcactggacatttgcctcacggccagcaccacaggaaaccaaatgcccgaggataaa gaactggtgcagaattgtaacaccaccaccaccaccgccaccaccaccatcactaccact gtcaccaccaccatcaccaccatcaccaccaccatcaccaccaccaccaccaccgtcacc accaccatcaccaccaccaccaccaccatcaccaccaccgtcaccaccaccaccgtcacc accaccaccaccaccaccgtcaccaccaccaccaccactgtcaccaccaccgtcaccacc accaccaccaccaccgtcaccaccaccaccaccaccaccaccaccaccaccatcaccacc gtcaccaccaccaccaccaccaccgtcaccaccgtcaccaccaccacagtcaccactgtc accaccaccaccaccaccaccatcaccaccaccaccaccaccaccatcaccaccaccacc accacagtcaccactgtcaccaccaccaccaccaccaccaccatcaccaccaccaccacc accaccaccaccaccaccaccaccatcaccaccaccaccaccaccaccatcaccaccacc atcaccaccaccactaccacaagcatcactaccaccaccaccaccatcaccaccaccact tactga >gi568815597r:30613585_30823551|GENSCAN_predicted_peptide_4|149_aa MQQWLTSRNGLRNYQNYQSTWISQSNHHKQQPGTHNPDLPQTPPVDGSNSPSRAATAITL AAAFLESPSFWVPLCSPVAAMEAACGVLGPAAALQRAEWQLQLQPHEIKWIKAEAKGMKT LTPCHFATSSARDKLAHKATEANKEMDRH >gi568815597r:30613585_30823551|GENSCAN_predicted_CDS_4|450_bp atgcagcaatggctaactagtagaaatgggctgagaaactaccaaaactaccagtcaacc tggatcagtcaatccaaccaccacaaacagcaacctggaactcacaacccagatcttccc cagactcctcctgttgatggcagcaacagtccatctagagcagccactgctattacactg gctgcagcattcctggagtctccaagcttctgggtgccactgtgttctccagtggccgcc atggaagctgcttgcggtgtacttggtccagctgcagccttgcagagagctgagtggcag ctacagctccagccccatgaaatcaagtggataaaagctgaagcaaagggcatgaagacc ttgactccctgtcattttgccaccagctctgccagagacaagctggcacacaaagccact gaagccaataaggaaatggatcggcactga >gi568815597r:30613585_30823551|GENSCAN_predicted_peptide_5|659_aa MRVLSGTSLMLCSLLLLLQALCSPGLAPQSRGHLCRTRPTDLVFVVDSSRSVRPVEFEKV KVFLSQVIESLDVGPNATRVGMVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMT GLAIQFAITKAFGDAEGGRSRSPDISKPPLLDSPVGFREPGVTPPRVSQVVIVVTDGRPQ DSVQDVSARARASGVELFAIGVGSVDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQE AFCVVSDLCATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSATDLVF LIDGSKSVRPENFELVKKFISQIVDTLDVSDKLAQVGLVQYSSSVRQEFPLGRFHTKKDI KAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGARPGAQKVGIVFTDGRSQDYINDAAKK AKDLGFKMFAVGVGNAVEDELREIASEPVAEHYFYTADFKTINQIGKKLQKKICVALTTS KHVECSLRGWQCSQCSPDITSSDPIWWGGGEAAMLIWTEKAQQCVPKEEERVISGRHGLG AGGQKQQMSLMSGVRDGVEALPGNRTSFFPARTPLPQALKALVSFSDPGRINPCYLPGPS VGSTSLGSSLIPQDGLGDPVAPTPSVSPSSDPTVLPSSVPHWTGGLEGSDGFSVPVVTE >gi568815597r:30613585_30823551|GENSCAN_predicted_CDS_5|1980_bp atgagggtcctctctggcactagcctcatgctctgcagcctgctgctgctgctccaggcc ctgtgcagccctggcctcgccccccagtccagaggccatctctgccggacgcggcccaca gacctggtgtttgttgtcgacagctctcgcagcgttcggcctgttgaatttgagaaagtg aaggtattcctgtcccaggtcatcgagtcgctggacgtggggcccaatgccacccgggtg ggcatggtcaactatgccagcaccgtgaagcaggagttctcgctgcgggctcatgtctcc aaggccgcactgctgcaggctgtgcgccgtatccagccgctgtccacaggcaccatgacc ggcctggccatccagttcgctatcaccaaagccttcggcgatgcagagggtggtcgttcc aggtcccctgacatcagcaagcctcctctcctggacagtcccgtgggcttccgggagccc ggtgtcacaccgccccgcgtgtcgcaggtggtcatcgtggtgacagacgggaggccccag gacagcgtgcaggacgtgtctgcgcgggcccgggccagcggcgtcgagctgttcgccatc ggagtgggcagcgtggacaaggccacgctgcggcagatcgccagcgagccgcaggacgaa cacgtcgattacgtggagagctacagcgtcatcgagaagctgtccaggaagttccaggag gccttctgcgtggtgtcagacctgtgcgccacaggggaccatgactgtgagcaggtgtgc atcagctcccccggttcctacacctgcgcctgccacgagggcttcactctgaacagcgac ggcaagacctgcaatgtctgcagtggtggtggtggcagctcggccactgacctggtcttc ctcattgacggatccaagagtgtgaggccagagaactttgagctggtgaagaagttcatc agtcagatcgtggatacgctggacgtgtcagacaagctggcccaggtggggctggtgcag tactcaagctctgtgcgccaggagttccccctgggtcgcttccacaccaagaaggacatc aaggcggctgtgcggaatatgtcctacatggagaagggcacaatgactggggctgctctc aagtacctcattgacaattccttcactgtgtccagtggggctaggcccggggcccagaag gtgggcattgtcttcactgatggccggagccaggactacattaatgatgctgccaagaag gccaaagacctcggctttaagatgtttgctgtgggtgtgggcaatgccgtggaggatgag ctgagggaaatagcctcagagcctgtggcagagcactacttctacacggctgacttcaag accatcaaccagataggcaagaagttgcagaagaagatctgtgtggctcttactaccagc aaacatgttgaatgctcgttacgtggctggcagtgttctcagtgctctccggatattaca tcatccgatcctatttggtggggagggggagaggcagccatgttgatatggactgagaag gctcagcagtgtgtccccaaggaggaggagagggtcattagcggacgccatggcctggga gctggcgggcagaagcaacagatgtccctgatgagtggagtcagagatggggtggaagcc ctgcctgggaataggacctcgttcttccctgcccggacacccctccctcaggccctcaag gccctagtttctttctctgatccaggtagaattaacccctgctatcttccaggtcccagt gtaggcagcacctccttggggagctccctgatcccccaggatgggctaggggaccctgta gctcccacaccctctgtgtctccctcatcagatcccacggtgctgccatcatctgttcct cactggactgggggcctggagggcagtgatggcttctctgtccccgtggtgactgaatag >gi568815597r:30613585_30823551|GENSCAN_predicted_peptide_6|122_aa MLEVNEENVSWGLLTNCSPHPNPPDGQGCVQGSGHKAPLAGQQLAHETALSAPPLNGSSM GSSEEHRRLLCVTSPLISRAAEIHYRPHAWSAKLESRVRRRVLSSAGLSGEATEQQHILL MG >gi568815597r:30613585_30823551|GENSCAN_predicted_CDS_6|369_bp atgctggaggtgaacgaggagaacgtgagctggggtctcctcaccaactgctccccccac cccaacccccctgacggccagggctgtgtccagggctcagggcacaaggctccgttagct gggcagcagctggcccatgaaacagccctttcagctccacctctgaatggcagttctatg ggctcctcggaagaacacaggcgccttctctgtgtcacttctccgctcattagcagagct gcagaaatacactacaggccccacgcatggtcagccaaactggaatccagggtgaggagg cgtgtgctttcatctgccggcctgtctggggaggccacggagcagcagcacatccttctg atgggttaa >gi568815597r:30613585_30823551|GENSCAN_predicted_peptide_7|665_aa MDIPEPLQVFRSCGSTGDLEWVAFGGAGIFCGSPEVVPSGCHYRLALGSPAKVVAGVDTL SPDVATESPADLWQVFVRGEACVPSQAAGAPLGHPGSGSWGPVGIPWGLSGRCEGPLSPF DQALVEPEVASDTHQTKGFFERPGSGELGFICPGICGTIRGPQGRSPGPHKPVSMMSLIF LPPPPCSALPESPSLFLLWPSKEPVTTVQLSCGAATAGEALGTHQIVCRHPPNRSPGLIN GSLLDFEPSATVAQPVRTTAGAAAETSFAKVIPVISGVKRLTHLHFMSMATLAVVSVFLP EASGKQQKSGSFSLKITNFDLQCIRSTNSGPQNRNSLGARAIQIMQDNLPTHLKILNFMA PVKFLCHCLRGGDGSTMDPRLSTVRQTCCCFNVRIATTALAIYHVATQGNQGGQVEGAVR ACTVLLVLLPNHIMSVLLFIEHSVEVAHGKASCKLSQMGYLRIADLISSFLLITMLFIIS LSLLIGVVKNREKYLLPFLSLQIMDYLLCLLTLLGSYIELPAYLKLASRSRASSSKFPLM TLQLLDFCLSILTLCSSYMEVPTYLNFKSMNHMNYLPSQEDMPHNQFIKMMIIFSIAFIT VLIFKVYMFKCVWRCYRLIKCMNSVEEKRNSKMLQKVVLPSYEEALSLPSKTPEGGPAPP PYSEV >gi568815597r:30613585_30823551|GENSCAN_predicted_CDS_7|1998_bp atggatattccagagcctttgcaggttttccggagctgcggctccacaggggacctggag tgggtggcctttggtggagcaggaatattttgtgggtctccagaggttgtccccagcggc tgccactatcgcctggcccttggttctccagcaaaggtggtggcaggagtagacacatta tccccagacgtggccactgagagtcctgcagacctgtggcaagtgtttgtcagaggggaa gcatgtgtcccaagccaggctgcgggggccccccttggccacccaggcagcggaagctgg gggccagtgggcatcccctggggcctctctggccggtgtgagggtcctttgtctccattt gatcaggccctcgtggagccggaagtagcctctgatacccaccagacaaagggcttcttt gagaggccaggaagtggcgagctgggatttatctgcccagggatttgtgggaccatccga gggccacagggcaggagccctggtcctcacaaacctgtttctatgatgtcactgatcttc ctccctccacctccttgctcagctctccctgagtcccccagtcttttcctgctctggccc tccaaggagcctgtcacaacagtgcagctctcatgcggagctgccacagcaggggaggcc ctggggacccatcagatagtttgccggcaccctcccaacagaagtcccggactgatcaat ggctccctgctggactttgagccttcagccacagtggcccagcctgtcaggacgaccgca ggggcagctgcggaaaccagctttgcaaaagttataccagttatatcgggggtcaaacgc ctcacacacctgcactttatgtccatggcaaccctagcagttgtttctgttttcctccct gaggcttctgggaagcagcagaagtcaggttccttctccctgaaaattacaaactttgac cttcagtgcatccgctcaactaattcaggtcctcaaaatcgaaattctttaggggccagg gccatccagataatgcaggataatctccccacccacctcaagatcctgaatttcatggcg cctgtgaagtttctttgccattgtctcagaggaggggacggcagcaccatggacccccgc ttgtccactgtccgccagacctgctgctgcttcaatgtccgcatcgcaaccaccgccctg gccatctaccatgtggcaacccagggaaatcagggtgggcaggtagagggggcagttaga gcatgcaccgtgctcctggtactactgccgaaccacatcatgagcgtcttgttgttcatc gagcactcagtagaggtggcccatggcaaggcgtcctgcaagctctcccagatgggctac ctcaggatcgctgacctgatctccagcttcctgctcatcaccatgctcttcatcatcagc ctgagcctactgatcggcgtagtcaagaaccgggagaagtacctgctgcccttcctgtcc ctgcaaatcatggactatctcctgtgcctgctcaccctgctgggctcctacattgagctg cccgcctacctcaagttggcctcccggagccgtgctagctcctccaagttccccctgatg acgctgcagctgctggacttctgcctgagcatcctgaccctctgcagctcctacatggaa gtgcccacctatctcaacttcaagtccatgaaccacatgaattacctccccagccaggag gatatgcctcataaccagttcatcaagatgatgatcatcttttccatcgccttcatcact gtccttatcttcaaggtctacatgttcaagtgcgtgtggcggtgctacagattgatcaag tgcatgaactcggtggaggagaagagaaactccaagatgctccagaaggtggtcctgccg tcctacgaggaagccctgtctttgccatcgaagaccccagaggggggcccagcaccaccc ccatactcagaggtgtga >gi568815597r:30613585_30823551|GENSCAN_predicted_peptide_8|62_aa MWLQGSNGQTGKVPAVALTSKAGAPGLRVCSRCLKHGPHQKKVRDDDMTLPGLIIRGLNW GK >gi568815597r:30613585_30823551|GENSCAN_predicted_CDS_8|189_bp atgtggttacaaggttctaatggccaaactggcaaagttcctgctgttgccctcacctcc aaggctggtgctcctggactcagggtgtgttccaggtgcctgaagcatggcccacaccag aaaaaggtgagggatgacgacatgacccttcctggtctcatcatccggggactcaactgg gggaagtga