GENSCAN 1.0 Date run: 5-Nov-116 Time: 19:02:16 Sequence gi568815595f:10041848_10249962 : 208115 bp : 46.41% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 712 816 105 0 0 95 69 30 0.565 2.21 1.02 Intr + 1203 1303 101 2 2 68 98 23 0.471 0.21 1.03 Intr + 1637 1745 109 2 1 52 100 28 0.694 0.69 1.04 Intr + 1982 2017 36 1 0 116 90 16 0.880 3.06 1.05 Intr + 4733 4876 144 1 0 32 83 149 0.977 9.28 1.06 Intr + 6070 6204 135 0 0 80 91 -1 0.480 0.16 1.07 Intr + 7527 7658 132 2 0 89 65 74 0.981 6.04 1.08 Intr + 10540 10650 111 0 0 98 110 80 0.980 11.78 1.09 Intr + 18447 18556 110 2 2 94 86 116 0.166 11.08 1.10 Intr + 20304 20364 61 0 1 73 109 29 0.917 2.24 1.11 Intr + 21945 22064 120 2 0 114 63 48 0.856 5.59 1.12 Intr + 22509 22582 74 2 2 60 73 38 0.905 -2.30 1.13 Intr + 22882 23028 147 1 0 96 32 135 0.435 8.05 1.14 Intr + 23547 23647 101 0 2 39 52 72 0.443 -1.55 1.15 Intr + 24017 24132 116 0 2 55 83 64 0.862 2.77 1.16 Intr + 24372 24443 72 2 0 84 70 80 0.976 5.40 1.17 Intr + 25362 25470 109 2 1 48 110 67 0.952 4.76 1.18 Intr + 31024 31134 111 2 0 46 97 72 0.838 4.25 1.19 Intr + 31406 31515 110 0 2 43 67 58 0.973 -0.80 1.20 Intr + 32683 32826 144 0 0 69 100 61 0.964 5.88 1.21 Intr + 36234 36350 117 2 0 68 96 61 0.971 5.56 1.22 Intr + 39499 39617 119 0 2 46 111 57 0.914 3.06 1.23 Intr + 43965 44075 111 0 0 85 109 12 0.620 2.49 1.24 Intr + 46981 47103 123 1 0 24 110 181 0.965 13.60 1.25 Intr + 48445 48538 94 1 1 88 109 82 0.999 10.27 1.26 Intr + 50334 50405 72 2 0 53 113 101 0.960 8.70 1.27 Intr + 51438 51476 39 2 0 101 100 0 0.608 0.92 1.28 Intr + 56873 56968 96 1 0 51 55 70 0.527 0.21 1.29 Term + 59341 59415 75 0 0 116 41 110 0.991 6.94 1.30 PlyA + 60063 60068 6 1.05 2.03 PlyA - 61081 61076 6 1.05 2.02 Term - 62935 62394 542 2 2 88 43 168 0.791 6.72 2.01 Init - 66714 66660 55 0 1 96 86 -1 0.597 2.01 2.00 Prom - 70339 70300 40 -4.56 3.00 Prom + 71915 71954 40 -6.86 3.01 Init + 73855 73972 118 0 1 117 57 172 0.989 17.36 3.02 Intr + 74398 74495 98 2 2 89 81 70 0.879 6.13 3.03 Term + 77059 77124 66 0 0 116 39 86 0.938 4.44 3.04 PlyA + 77297 77302 6 -0.45 4.00 Prom + 78090 78129 40 -4.86 4.01 Init + 79338 79426 89 2 2 77 84 22 0.160 0.81 4.02 Intr + 92989 93079 91 1 1 103 115 39 0.038 8.10 4.03 Term + 94971 95129 159 2 0 12 39 187 0.563 4.14 4.04 PlyA + 95777 95782 6 1.05 5.00 Prom + 98526 98565 40 -8.56 5.01 Init + 100001 100340 340 1 1 58 80 530 0.991 46.62 5.02 Intr + 104667 104789 123 1 0 103 84 30 0.960 4.66 5.03 Term + 107940 108118 179 1 2 71 49 232 0.991 15.45 5.04 PlyA + 114684 114689 6 1.05 6.00 Prom + 114817 114856 40 -5.36 6.01 Init + 123108 123201 94 2 1 86 109 219 0.404 24.34 6.02 Intr + 131306 131359 54 0 0 68 82 38 0.137 0.15 6.03 Intr + 135991 136173 183 2 0 86 74 358 0.320 33.96 6.04 Intr + 158522 158668 147 0 0 107 68 35 0.000 3.61 6.05 Intr + 167742 167845 104 1 2 101 94 -10 0.178 0.79 6.06 Intr + 171360 171554 195 2 0 81 83 171 0.954 15.51 6.07 Intr + 171637 171701 65 0 2 113 87 -25 0.879 -2.58 6.08 Intr + 175087 175201 115 1 1 62 110 70 0.891 7.05 6.09 Intr + 177833 177942 110 1 2 110 78 124 0.919 12.68 6.10 Intr + 180789 180984 196 0 1 75 55 171 0.736 11.92 6.11 Intr + 184524 184586 63 2 0 119 95 93 0.778 12.01 6.12 Intr + 192612 192812 201 2 0 88 83 245 0.996 23.48 6.13 Intr + 196901 197192 292 1 1 125 103 101 0.800 12.01 6.14 Term + 200269 200381 113 2 2 15 45 152 0.694 2.32 6.15 PlyA + 201708 201713 6 1.05 7.00 Prom + 204255 204294 40 -8.56 7.01 Init + 207354 207767 414 2 0 81 100 293 0.448 26.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 11382 11393 12 2 0 93 44 0 0.808 -5.70 S.002 Sngl - 16440 16150 291 0 0 13 35 317 0.939 14.55 S.003 Init + 93029 93079 51 1 0 89 115 32 0.834 7.17 S.004 Term - 157700 157536 165 2 0 58 53 156 0.909 7.02 S.005 Init - 159560 159414 147 2 0 48 77 108 0.847 5.79 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:10041848_10249962|GENSCAN_predicted_peptide_1|997_aa VRQLVMDKLSSIRLEDLPVIIKFILHSVTAMDTLEVISELREKLDLQHCVLPSRLQASQV KLKSKGRASSSGNQESSGQSCIILLFDVIKSAIRYEKTISEAWIKAIENTASVSEHKVFD LVMLFIIYSTNTQTKKYIDRVLRNKIRSGCIQEQLLQSTFSVHYLVLKDMCSSILSLAQS LLHSLDQSIISFGSLLYKYAFKFFDTYCQQEVVGALVTHICSGNEAEVDTALDVLLELVV LNPSAMMMNAVFVKGILDYLDNISPQQIRKLFYVLSTLAFSKQNEASSHIQDDMHLVIRK QLSSTVFKYKLIGIIGAVTMAGIMAADRSESPSLTQERANLSDEQCTQVTSLLQLVHSCS EQSPQASALYYDEFANLIQHEKLDPKALEWVGHTICNDFQDAFVVDSCVVPEGDFPFPVK ALYGLEEYDTQDGIAINLLPLLFSQDFAKDGGPVTSQESGQKLVSPLCLAPYFRLLRLCV ERQHNGNLEEIDGLLDCPIFLTDLEPGEKLESMSAKERSFMCSLIFLTLNWFRENDLLTF TVELLFVLAQAYAYCFDSIVNAFCQETSPEMKGKVLTRLKHIVELQIILEKYLAVTPDYV PPLGNFDVETLDITPHTVTAISAKIRKKGKIERKQKTDGSKTSSSDTLSEEKNSECDPTP SHRGQLNKEFTGKEEKTSLLLHNSHAFFRELDIEVFSILHCGLVTKFILDTEMHTEATEV VQLGPPELLFLLEDLSQKLESMLTPPIARRVPFLKCLAAENHGVVDGPGVKVQEYHIMSS CYQRLLQIFHGLFAWSGFSQPENQNLLYSALHVLSSRLKQGEHSQPLEELLSIYLEHTES ILKAIEEIAGVGVPELINSPKDASSSTFPTLTRHTFVVFFRVMMAELEKTVKKIEPGTAA DSQQIHEEKLLYWNMAVRDFSILINLIKVFDSHPVLHVCLKGEEIKSQNSQESTADESED DMSSQASKSKATEDGEEDEVSAGEKEQDSDESYDDSD >gi568815595f:10041848_10249962|GENSCAN_predicted_CDS_1|2994_bp gttcgccagttggtgatggataagttgtcgtctattagattggaggatttacctgtgata ataaagttcattcttcattccgtaacagccatggatacacttgaggtaatttctgagctt cgggagaagttggatctgcagcattgtgttttgccatcacggttacaggcttcccaagta aagttgaaaagtaaaggacgagcaagttcctcaggaaatcaagaaagcagcggtcagagc tgtattattctcctctttgatgtaataaagtcagctattagatatgagaaaaccatttca gaagcctggattaaggcaattgaaaacactgcctcagtatctgaacacaaggtgtttgac ctggtgatgcttttcatcatctatagcaccaatactcagacaaagaagtacattgacagg gtgctaagaaataagattcgatcaggctgcattcaagaacagctgctccagagtacattc tctgttcattacttagttcttaaggatatgtgttcatccattctgtcgctggctcagagt ttgcttcactctctagaccagagtataatttcatttggcagtctcctatacaaatatgca tttaagttttttgacacgtactgccagcaggaagtggttggtgccttagtgacccatatc tgcagtgggaatgaagctgaagttgatactgccttagatgtccttctagagttggtagtg ttaaacccatctgctatgatgatgaatgctgtctttgtaaagggcattttagattatctg gataacatatcccctcagcaaatacgaaaactcttctatgttctcagcacactggcattt agcaaacagaatgaagccagcagccacatccaggatgacatgcacttggtgataagaaag cagctctctagcaccgtattcaagtacaagctcattgggattattggtgctgtgaccatg gctggcatcatggcggcagacagaagtgaatcacctagtttgacccaagagagagccaac ctgagcgatgagcagtgcacacaggtgacctccttgttgcagttggttcattcctgcagt gagcagtctcctcaggcctctgcactttactatgatgaatttgccaacctgatccaacat gaaaagctggatccaaaagccctggaatgggttgggcataccatctgtaatgatttccag gatgccttcgtagtggactcctgtgttgttccggaaggtgactttccatttcctgtgaaa gcactgtacggactggaagaatacgacactcaggatgggattgccataaacctcctgccg ctgctgttttctcaggactttgcaaaagatgggggtccggtgacctcacaggaatcaggc caaaaattggtgtctccgctgtgcctggctccgtatttccggttactgagactttgtgtg gagagacagcataacggaaacttggaggagattgatggtctactagattgtcctatattc ctaactgacctggagcctggagagaagttggagtccatgtctgctaaagagcgttcattc atgtgttctctcatatttcttactctcaactggttccgagagaatgaccttctcaccttc actgtggaactactgttcgtcctggcacaggcatatgcctactgttttgattcaattgta aatgccttctgccaggaaacatcacctgagatgaaggggaaggtgctcactcggttaaag cacattgtagaattgcaaataatcctggaaaagtacttggcagtcaccccagactatgtc cctcctcttggaaactttgatgtggaaactttagatataacacctcatactgttactgct atttcagcaaaaatcagaaagaaaggaaaaatagaaaggaaacaaaaaacagatggcagc aagacatcctcctctgacacactttcagaagagaaaaattcagaatgtgaccctacgcca tctcatagaggccagctaaacaaggagttcacagggaaggaagaaaagacatcattgtta ctacataattcccatgcttttttccgagagctggacattgaggtcttctctattctacat tgtggacttgtgacgaagttcatcttagatactgaaatgcacactgaagctacagaagtt gtgcaacttgggccccctgagctgcttttcttgctggaagatctctcccagaagctggag agtatgctgacacctcctattgccaggagagtcccctttctcaagtgtttagctgctgag aatcacggtgtagttgatggaccaggagtgaaagttcaggagtaccacataatgtcttcc tgctatcagaggctgctgcagatttttcatgggctttttgcttggagtggattttctcaa cctgaaaatcagaatttactgtattcagccctccatgtccttagtagccgactgaaacag ggagaacacagccagcctttggaggaactactcagtatctacctggagcacacagagagc attctgaaggccatagaggagattgctggtgttggtgtcccagaactgatcaactctcct aaagatgcatcttcctccacattccctacactgaccaggcatacttttgttgttttcttc cgtgtgatgatggctgaactagagaagacggtgaaaaaaattgagcctggcacagcagca gactcgcagcagattcatgaagagaaactcctctactggaacatggctgttcgagacttc agtatcctcatcaacttgataaaggtatttgatagtcatcctgttctgcatgtatgtttg aagggtgaagagattaagtcccaaaattcccaggagagcacagcagatgagagtgaggat gacatgtcatcccaggcctccaagagcaaagccactgaggatggtgaagaagacgaagta agtgctggagaaaaggagcaagatagtgatgagagttatgatgactctgattag >gi568815595f:10041848_10249962|GENSCAN_predicted_peptide_2|198_aa MAIEVGVACPLWLDQRHTGLSMAGYQLWSPWTPLDESFQWLRHTTPTPSSKHPFKASPCF PHTPSDLEVQLCFQEVTLVLDSPFLESGVSPKLPCHTSELRTMNNKGLVRKPQPIRLSGV DSVFGRVITAQPPKWTGTFRVSDKSAFCKIISREHQWPIGLKEPQIQMTVTMCKQMLRSI LLLYATYKKCTFALQHSK >gi568815595f:10041848_10249962|GENSCAN_predicted_CDS_2|597_bp atggccattgaagtaggggtcgcctgtcctctgtggcttgatcaaaggcacacaggactg tcaatggcaggataccagctctggtcaccatggaccccactggatgagagtttccaatgg ctgcggcacacgacacctacaccttcctccaagcacccattcaaggcctccccctgcttc ccacacacaccgtccgaccttgaagtgcagctgtgctttcaagaggtcactctagtccta gacagcccattcctggaatctggagtgagtcccaagttaccctgccacacatcagagttg cgcacgatgaacaacaaaggactggtcaggaagccccagcccatccgcctcagtggagta gattctgtctttggcagggttatcacagctcagccaccaaagtggaccgggactttcaga gtttcagacaagtcagccttttgcaaaatcattagcagggagcaccagtggcccattgga ctgaaggagcctcagattcagatgacagtcactatgtgcaaacagatgctgcgctctatc ctcttgctgtatgcaacttacaaaaagtgcacctttgccttgcagcactccaagtaa >gi568815595f:10041848_10249962|GENSCAN_predicted_peptide_3|93_aa MAGQEDPVQREIHQDWANREYIEIITSSIKKIADFLNSFGSGRYPLASCLLVGPVVNHSR LVHSFLFPVSAAPTQQEDNEDEDLYDDPLPFNE >gi568815595f:10041848_10249962|GENSCAN_predicted_CDS_3|282_bp atggcgggacaggaggatccggtgcagcgggagattcaccaggactgggctaaccgggag tacattgagataatcaccagcagcatcaagaaaatcgcagactttctcaactcgttcgga tctggtcgataccctctagcgtcgtgccttttagttggacccgttgtcaaccactcccgt cttgtccattccttcctgtttcctgtgtctgctgcgcctactcagcaggaagacaatgag gatgaagacctttatgatgatccactgccatttaatgaatag >gi568815595f:10041848_10249962|GENSCAN_predicted_peptide_4|112_aa MGSDRTKRYKVSFGDDENILKVDYNYCTTLCTKIGKLKADSGDMPAAAERCTGTDNSPAQ CSRMLDSSSSVARVVAKAKEVPRVSEGREGCQHIVTSQKYKLNVNKIPLYIN >gi568815595f:10041848_10249962|GENSCAN_predicted_CDS_4|339_bp atggggagtgacaggactaagaggtataaggtatcttttggggatgatgaaaatattcta aaagtagattataattattgcacaactctgtgcactaagatagggaagctaaaagcagac tcaggggacatgcctgcagctgcagaaagatgtacgggaacagacaactctcccgcccag tgcagccgcatgctggacagctcctcaagcgtggccagagtggtggccaaggccaaggag gtgccgagagtgagcgagggccgcgagggctgccagcacattgtcacctctcagaagtac aaattaaacgtcaataagataccactatacatcaattag >gi568815595f:10041848_10249962|GENSCAN_predicted_peptide_5|213_aa MPRRAENWDEAEVGAEEAGVEEYGPEEDGGEESGAEESGPEESGPEELGAEEEMEAGRPR PVLRSVNSREPSQVIFCNRSPRVVLPVWLNFDGEPQPYPTLPPGTGRRIHSYRGHLWLFR DAGTHDGLLVNQTELFVPSLNVDGQPIFANITLPVYTLKERCLQVVRSLVKPENYRRLDI VRSLYEDLEDHPNVQKDLERLTQERIAHQRMGD >gi568815595f:10041848_10249962|GENSCAN_predicted_CDS_5|642_bp atgccccggagggcggagaactgggacgaggccgaggtaggcgcggaggaggcaggcgtc gaagagtacggccctgaagaagacggcggggaggagtcgggcgccgaggagtccggcccg gaagagtccggcccggaggaactgggcgccgaggaggagatggaggccgggcggccgcgg cccgtgctgcgctcggtgaactcgcgcgagccctcccaggtcatcttctgcaatcgcagt ccgcgcgtcgtgctgcccgtatggctcaacttcgacggcgagccgcagccctacccaacg ctgccgcctggcacgggccgccgcatccacagctaccgaggtcacctttggctcttcaga gatgcagggacacacgatgggcttctggttaaccaaactgaattatttgtgccatctctc aatgttgacggacagcctatttttgccaatatcacactgccagtgtatactctgaaagag cgatgcctccaggttgtccggagcctagtcaagcctgagaattacaggagactggacatc gtcaggtcgctctacgaagatctggaagaccacccaaatgtgcagaaagacctggagcgg ctgacacaggagcgcattgcacatcaacggatgggagattga >gi568815595f:10041848_10249962|GENSCAN_predicted_peptide_6|643_aa MACYIYQLPSWVLDDLCRNMDALSEWDWMEFDEETEAQGEMNTFPKNTATSYVITDLTQL RKIKSMERVQGVSITRELLWWWGMRQATVQQLVDLLCRLELYRAAQIILNWKPAPEIRCP IPAFPDSVKPEKPLAASVRKAEDEQEEGQPVRMATFPGPGSSPARAHQPAFLQPPEEDAP HSLRSDLPTSSDSKDFSTSIPKQEKLLSLAGDSLFWSEADVVQATDDFNQNRKISQGTFA DVYRGHRHGKPFVFKKLRETACSSPGSIERFFQAELQICLRCCHPNVLPVLGFCAARQFH SFIYPYMANGSLQDRLQGQGGSDPLPWPQRVSICSGLLCAVEYLHGLEIIHSNVKSSNVL LDQNLTPKLAHPMAHLCPVNKRSKYTMMKTHLLRTSAAYLPEDFIRVGQLTKRVDIFSCG IVLAEVLTGIPAMDNNRSPVYLKDLLLSDIPSSTASLCSRKTGVENVMAKEICQKYLEKG AGRLPEDCAEALATAACLCLRRRNTSLQEVCGSVAAVEERLRGRETLLPWSGLSEGTGSS SNTPEETDDVDNSSLDASSSMSVAPWAGAATPLLPTENGEGRLRVIVGREADSSSEACVG LEPPQDVTETSWQIEINEAKRKLMENILLYKEEKVDSIELFGP >gi568815595f:10041848_10249962|GENSCAN_predicted_CDS_6|1932_bp atggcctgctacatctaccagctgccctcctgggtgctggacgacctgtgccgcaacatg gacgcgctcagcgagtgggactggatggagttcgatgaggaaactgaggctcagggggag atgaacaccttccccaaaaacacagcaacctcctacgtgatcacagacctgacccagctg cggaagatcaagtccatggagcgggtgcagggtgtgagcatcacgcgggagctgctgtgg tggtggggcatgcggcaggccaccgtccagcaacttgtggacctcctgtgccgcctggag ctctaccgggctgcccagatcatcctgaactggaaaccggctcctgaaatcaggtgtccc attccagccttccctgactctgtgaagccagaaaagcctttggcagcttctgtaagaaag gctgaggatgaacaggaagaggggcagcctgtgaggatggccacctttccaggcccaggg tcctctccagccagagcccaccagccggcctttctccagcctcctgaagaagatgcccct cattccttgagaagcgacctccccacttcgtctgattcaaaggacttcagcacctccatt cctaagcaggaaaaacttttgagcttggctggagacagccttttctggagtgaggcagac gtggtccaggcaaccgatgacttcaatcaaaaccgcaaaatcagccaggggacctttgct gacgtctacagagggcacaggcacgggaagccattcgtcttcaagaagctcagagagaca gcctgttcaagtccaggatcaatcgaaagattcttccaggcagagttgcagatttgtctt agatgctgccaccccaatgtcttacctgtgctgggcttctgtgctgcaagacagtttcac agcttcatctacccctacatggcaaatggttccctacaggacagactgcagggtcagggt ggctcggaccccctcccctggccccagcgtgtcagcatctgctcagggctgctctgtgcc gtcgagtacctgcatggtctggagatcatccacagcaacgtcaagagctctaatgtcttg ctggaccaaaatctcacccccaaacttgctcacccaatggctcatctgtgtcctgtcaac aaaaggtcaaaatacaccatgatgaagactcacctgctccggacgtcagccgcgtatctg ccagaggatttcatccgggtggggcagctgacaaagcgagtggacatcttcagctgtgga atagtgttggccgaggtcctcacgggcatccctgcaatggataacaaccgaagcccggtt tacctgaaggacttactcctcagtgatattccaagcagcaccgcctcgctctgctccagg aagacgggcgtggagaacgtgatggcaaaggagatctgccagaagtacctggagaagggc gcagggaggcttccggaggactgcgccgaggccctggccacggctgcctgcctgtgcctg cggaggcgtaacaccagcctgcaggaggtgtgtggctctgtggctgctgtggaagagcgg ctccgaggtcgggagacgttgctcccttggagtgggctttctgagggtacaggctcttct tccaacaccccagaggaaacagacgacgttgacaattccagccttgatgcctcctcctcc atgagtgtggcaccctgggcaggggctgccaccccacttctccccacagagaatggggaa ggaaggctgcgggtcatcgtgggaagggaggctgactcctcctctgaggcctgtgttggc ctggagcctccccaggatgttacagaaacttcgtggcaaattgagatcaatgaggccaaa aggaaactgatggagaatattctgctctacaaagaggaaaaagtggacagcattgagctc tttggcccctga >gi568815595f:10041848_10249962|GENSCAN_predicted_peptide_7|138_aa MASERGKVKHNWSSTSEGCPRKRSCLREPCDVAPSSRPAQRSASRSGGPSSPKRLKAQKE DDVACSRRLSWGSSRRRNNSSSSFSPHFLGPGVGGAASKGCLIRNTRGFLSSGGSPLRPA NASLEEMASLEEEACSLK >gi568815595f:10041848_10249962|GENSCAN_predicted_CDS_7|414_bp atggcgtccgagcggggcaaggtcaagcacaactggagcagcacgtcggaagggtgtccc cgcaagcgcagctgcctccgggagccctgtgatgtggccccctccagccggccagctcag aggtctgcgtcgcgttctggagggcccagcagccccaagcgcctgaaagcccagaaggag gacgatgtggcttgctcgcggaggttatcctggggctcatcccgccgcagaaataactcc tcctcctccttctccccacatttcttgggccctggtgtgggcggggccgcctccaaaggc tgcctgattcggaacactcgggggttcctgtcttcagggggatcccctctgcgtcctgcc aacgcctctttggaagaaatggcttctctagaggaggaagcctgcagccttaag