GENSCAN 1.0 Date run: 5-Nov-116 Time: 06:29:21 Sequence gi568815590f:47943031_48160825 : 217795 bp : 44.18% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 16765 17226 462 2 0 58 74 285 0.845 17.23 1.02 Intr + 17707 17854 148 2 1 32 73 89 0.907 1.09 1.03 Intr + 18101 18184 84 2 0 91 77 99 0.933 8.04 1.04 Intr + 18249 18396 148 0 1 107 89 -43 0.573 -2.06 1.05 Intr + 18521 18650 130 1 1 42 96 58 0.838 2.27 1.06 Intr + 19023 19186 164 1 2 69 84 45 0.890 1.89 1.07 Intr + 19275 19376 102 2 0 90 99 49 0.989 6.57 1.08 Intr + 19734 19829 96 2 0 52 74 90 0.922 4.21 1.09 Intr + 19915 20010 96 0 0 90 110 14 0.941 4.01 1.10 Intr + 21544 21682 139 0 1 72 90 154 0.999 14.04 1.11 Intr + 23157 23377 221 1 2 126 62 312 0.993 30.42 1.12 Intr + 24335 24455 121 1 1 74 110 11 0.893 1.97 1.13 Intr + 26768 27027 260 0 2 73 87 277 0.917 23.28 1.14 Intr + 27496 27846 351 0 0 46 113 336 0.628 27.42 1.15 Intr + 28311 28438 128 2 2 71 62 73 0.962 2.48 1.16 Intr + 29827 30034 208 1 1 67 76 340 0.839 29.78 1.17 Intr + 31704 31932 229 2 1 46 21 255 0.859 12.04 1.18 Intr + 32685 32818 134 1 2 68 92 109 0.999 9.66 1.19 Term + 33656 33748 93 1 0 127 48 95 0.996 7.23 1.20 PlyA + 33937 33942 6 1.05 2.00 Prom + 39205 39244 40 -3.56 2.01 Init + 48628 48699 72 0 0 51 82 81 0.505 4.87 2.02 Intr + 49933 50128 196 0 1 60 37 52 0.385 -3.71 2.03 Term + 51747 51778 32 1 2 94 47 94 0.746 3.92 2.04 PlyA + 52050 52055 6 -0.45 3.04 PlyA - 54005 54000 6 1.05 3.03 Term - 55394 55267 128 0 2 78 38 152 0.664 7.64 3.02 Intr - 65743 65286 458 0 2 90 18 272 0.033 13.26 3.01 Init - 87254 87169 86 2 2 41 103 52 0.104 2.29 3.00 Prom - 88922 88883 40 -4.56 4.00 Prom + 89379 89418 40 -4.36 4.01 Init + 90343 90391 49 0 1 86 28 26 0.349 -4.49 4.02 Intr + 91912 92031 120 2 0 102 113 23 0.863 6.67 4.03 Intr + 100003 100151 149 2 2 58 56 105 0.870 4.25 4.04 Intr + 106823 106948 126 1 0 100 115 24 0.964 7.18 4.05 Term + 117652 117798 147 0 0 142 28 75 0.962 4.80 4.06 PlyA + 118322 118327 6 1.05 5.08 PlyA - 118641 118636 6 1.05 5.07 Term - 125164 124968 197 2 2 44 32 110 0.254 -1.43 5.06 Intr - 132621 132498 124 0 1 73 74 136 0.566 10.86 5.05 Intr - 140898 140645 254 1 2 42 33 191 0.117 6.15 5.04 Intr - 142446 142286 161 2 2 69 106 36 0.303 3.13 5.03 Intr - 142784 142654 131 2 2 63 45 74 0.427 0.09 5.02 Intr - 159275 159201 75 2 0 95 86 2 0.266 0.41 5.01 Init - 162399 162073 327 0 0 92 52 160 0.290 8.82 5.00 Prom - 175745 175706 40 -2.46 6.09 PlyA - 178943 178938 6 1.05 6.08 Term - 183884 183724 161 0 2 54 53 113 0.348 2.50 6.07 Intr - 187816 187770 47 0 2 108 77 62 0.400 5.15 6.06 Intr - 190695 190592 104 0 2 19 68 54 0.123 -4.53 6.05 Intr - 194347 194114 234 1 0 91 93 72 0.153 5.89 6.04 Intr - 194616 194516 101 1 2 72 10 47 0.010 -4.87 6.03 Intr - 203249 203115 135 0 0 91 49 102 0.177 7.14 6.02 Intr - 206933 206848 86 1 2 55 61 91 0.000 2.56 6.01 Init - 216069 215966 104 0 2 39 110 42 0.001 1.21 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 169138 169244 107 2 2 120 55 97 0.956 8.17 S.002 Intr + 208391 208527 137 2 2 44 53 154 0.915 7.71 S.003 Intr + 209615 209786 172 0 1 80 44 146 0.957 8.40 S.004 Intr + 210383 210465 83 2 2 93 76 100 0.906 8.58 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815590f:47943031_48160825|GENSCAN_predicted_peptide_1|1104_aa XGKKQYDKSLYTHTQKFPQESNCCTSKYRHLNTEKRRAARLQNDSGSQDPPAAQLGPVPT QHRGAAAQDAFLPQAADQLMTGQGSTAAVRSGQGLLQPQQGATHTGSGARHAAESRSRAC ARSARTRKCPYARRRGCGARRQELSRGPLREGLARGQTIWTKGGRARPTSGLRTGAERSG TVPPCRCAVAPPGGGATYERRCDWYSEHYVVPGVDPEPPRQPAWKGHPRPDAWQRWVGAR DPGAQPRAGRCRLVRTDTHSRLWPGCCLIRLPFASVWFGSVRRRGEDSTSTGELQPMPTS PGVDLQSPAAQDVLFSSPPQMHSSAIPLDFDVSSPLTYGTPSSRVEGTPRSGVRGTPVRQ RPDLGSAQKGLQVDLQSDGAAAEDIVASEQSLGQKLVIWGTDVNVAACKENFQRFLQRFI DPLAKEEENVGIDITEPLYMQRLGEINVIGEPFLNVNCEHIKSFDKNLYRQLISYPQEVI PTFDMAVNEIFFDRYPDSILEHQIQVRPFNALKTKNMRNLNPEDIDQLITISGMVIRTSQ LIPEMQEAFFQCQVCAHTTRVEMDRGRIAEPSVCGRCHTTHSMALIHNRSLFSDKQMIKL QESPEDMPAGQTPHTVILFAHNDLVDKVQPGDRVNVTGIYRAVPIRVNPRVSNVKSVYKT HIDVIHYRKTDAKRLHGLDEEAEQKLFSEKRVELLKELSRKPDIYERLASALAPSIYEHE DIKKLFGGTRKDFSHTGRGKFRAEINILLCGDPGTSKSQLLQYVYNLVPRGQYTSGKGSS AVGLTAYVMKDPETRQLVLQTGALVLSDNGICCIDEFDKMNESTRSVLHEVMEQQTLSIA KAGIICQLNARTSVLAAANPIESQWNPKKTTIENIQLPHTLLSRFDLIFLLLDPQDEAYD RRLAHHLVALYYQSEEQAEEELLDMAVLKDYIAYAHSTIMPRLSEEASQALIEAYVDMRK IGSSRGMVSAYPRQLESLIRLAEAHAKVRLSNKVEAIDVEEAKRLHREALKQSATDPRTG IVDISILTTGMSATSRKRKEELAEALKKLILSKGKTPALKYQQLFEDIRGQSDIAITKDM FEEALRALADDDFLTVTGKTVRLL >gi568815590f:47943031_48160825|GENSCAN_predicted_CDS_1|3315_bp nggggtaaaaagcagtatgacaaatccctttatacacatacacagaaattcccccaagaa tctaattgctgtacttcaaaataccgtcatctaaacacagagaagcgccgggctgcccgg ctccagaacgactcgggaagccaggacccacccgcggcccagctcgggccggtacccacc cagcaccgcggggctgctgctcaggacgcattcctgccccaggccgcggatcagttgatg accggccagggcagcaccgcagcggtccgcagcggacaaggtctcctgcagccgcagcag ggagcaacgcacaccggctccggagcccgccatgccgccgagtcccgctcccgcgcgtgc gcccgctcggcccggacccggaaatgcccctacgcgcggaggcggggctgcggggcgcgg cggcaggaactttcccggggacccctgcgggaaggcctggccaggggtcagaccatctgg accaaggggggccgagcgaggcctacttctggtttacgcacgggcgctgaaagaagcggc actgtccccccctgccgatgcgcagtggcgcctcccggaggcggagccacgtacgagcgc cgctgtgattggtactccgagcactatgtcgtccccggcgtcgaccccgagccgccgcgg cagccggcgtggaagggccacccccgcccagacgcctggcagcgctgggtgggtgcgcgg gacccgggcgctcagcctcgggctgggcgctgccgcttggtgcgcacagacacccacagc aggctgtggcctgggtgctgcttaattcgattgccatttgcctctgtttggtttggttca gtgagacgtagaggcgaggattccacctccacgggggagttgcagccgatgccaacctcg cctggagtggacctgcagagccctgctgcgcaggacgtgctgttttccagccctccccaa atgcattcttcagctatccctcttgactttgatgttagttcaccactgacatacggcact cccagctctcgggtagagggaaccccaagaagtggtgttaggggcacacctgtgagacag aggcctgacctgggctctgcacagaagggcctgcaagtggatctgcagtctgacggggca gcagcagaagatatagtggcaagtgagcagtctctaggccaaaaacttgtgatctgggga acagatgtaaatgtggcagcatgcaaagaaaactttcagagatttcttcagcgttttatt gaccctctggctaaagaagaagaaaatgttggcatagatattactgaacctctatacatg caacgacttggggagattaatgttattggtgagccatttttaaatgtgaactgtgaacac atcaaatcatttgacaaaaatttgtacagacaactcatctcttacccacaggaagttatt ccaacttttgacatggctgtcaatgaaatcttctttgaccgttaccctgactcaatctta gaacatcagattcaagtaagaccattcaacgcattgaagactaagaatatgagaaacctg aatccagaagacattgaccagctcatcaccatcagcggcatggtgatcaggacatcccag ctgattcccgagatgcaggaggccttcttccagtgccaagtgtgtgcccacacgacccgg gtggagatggaccgcggccgcattgcagagcccagtgtgtgcgggcgctgccacaccacc cacagcatggcactcatccacaaccgctccctcttctctgacaagcagatgatcaagctt caggagtctccggaagacatgcctgcagggcagacaccacacacagttatcctgtttgct cacaatgatctcgttgacaaggtccagcctggggacagagtgaatgttacaggcatctat cgagctgtgcctattcgagtcaatccaagagtgagtaatgtgaagtctgtctacaaaacc cacattgatgtcattcattatcggaaaacggatgcaaaacgtctgcatggccttgatgaa gaagcagaacagaaacttttttcagagaaacgtgtggaattgcttaaggaactttccagg aaaccagacatttatgagaggcttgcttcagccttggctccaagcatttatgaacatgaa gatataaagaagctctttggcgggacaaggaaggattttagtcacactggaaggggcaaa tttcgggctgagatcaacatcttgctgtgtggcgaccctggtaccagcaagtcccagctg ctgcagtacgtgtacaacctcgtccccaggggccagtacacgtctgggaagggctccagt gcagttggcctcactgcgtacgtaatgaaagaccctgagacaaggcagctggtcctgcag acaggtgctcttgtcctgagtgacaacggcatctgctgtatcgatgagttcgacaagatg aatgaaagtacaagatcggtattgcatgaagtcatggaacagcagactctgtccattgca aaggctgggatcatctgtcagctcaatgcgcgcacctctgtcctggcagcagcaaatccc attgagtctcagtggaatcctaaaaaaacaaccattgaaaacatccagctgcctcatact ttattatcaaggtttgatttgatcttcctcttgctggaccctcaggacgaagcctatgac aggcgtctggctcaccacctggtcgcactgtactaccagagcgaggagcaggcagaggag gagctcctggacatggcggtgctaaaggactacattgcctacgcgcacagcaccatcatg ccgcggctaagtgaggaagccagccaggctctcatcgaggcttatgtagacatgaggaag attggcagtagccggggaatggtttctgcataccctcgacagctagagtcattaatccgc ttagcagaagcccatgctaaagtaagattgtctaacaaagttgaagccattgatgtggaa gaggccaaacgcctccatcgggaagctctgaagcagtctgcaactgatccccggactggc atcgtggacatatctattcttactacggggatgagtgccacctctcgtaaacggaaagaa gaattagctgaagcattgaaaaagcttattttatctaagggcaaaacaccagctctaaaa taccagcaactttttgaagatattcggggacaatctgacatagcaattactaaagatatg tttgaagaagcactgcgtgccctggcagatgatgatttcctgacagtgactgggaagacc gtgcgcttgctctga >gi568815590f:47943031_48160825|GENSCAN_predicted_peptide_2|99_aa MDVRQTQFGPRNVLEYRLEVGEVQLGDVSLVSLSNPSGLKELVNKGAVSDRGPFGSEKSR LLPDGGVGMKNVIGIFGVCVNIDSSAFGNELKRLLLLAL >gi568815590f:47943031_48160825|GENSCAN_predicted_CDS_2|300_bp atggatgtgaggcagactcagtttggcccgaggaatgtgctggaataccgattagaagtt ggtgaagtgcagttgggagatgtaagcctagtaagtctgagtaatcctagtggcctgaag gagttggtaaataagggagccgttagtgataggggtccctttggcagtgagaaatcccgt ctcttgccagatggtggcgtgggaatgaagaatgtgataggcatatttggagtctgtgta aatattgactcgtctgcctttggaaacgagctaaagcggctgctgctgctagcgctctaa >gi568815590f:47943031_48160825|GENSCAN_predicted_peptide_3|223_aa MYKLTADVADTEHRILSPGHYYLVGKTTIAFSLQLGPAPAGTYGGARAGATPPPARRRAA DARARPANPTAAGASAPVPPGPRLRSAESAAFETAHSGFSRTVSARAPAADGGAEPGRSG KSRSPAREPTCGDRHLLLQPDAAARTHARTVARHFRERAALHASQRTPPPAGAGRPEGQR ARTPDTAVTFSIPDKFPRKGNALFHVLVTGSLFFFYVRPYFHS >gi568815590f:47943031_48160825|GENSCAN_predicted_CDS_3|672_bp atgtataaactgacagcagatgttgctgatactgaacacagaatcctttccccaggacat tactatctcgtggggaagacaaccatagccttcagcctacaactcggacccgcgcccgcc ggaacctacgggggcgcccgcgcgggagccaccccgcccccagcccgccggcgcgcggcc gacgcgcgagcacgacccgccaacccgacggccgccggggcctcggcgccggtcccaccc ggtcccaggctacgctctgcggagagcgcggcattcgagacagcgcactcgggcttctcc cgcacggtcagcgctcgggcgccagcagcagacggaggggcagagccgggtcggagcgga aaatcacgcagcccggcccgggaaccgacctgtggagaccgccatcttctcctgcagccc gacgcagccgcccgcacgcacgcacgaaccgtcgcgcgtcacttccgggagcgcgccgcc ctgcacgcgtcacagcggactccgcccccggcgggcgcggggcgacccgagggacagcgc gcacggacacctgacacggcggttaccttctctattccagacaagttccccaggaaaggg aacgccctcttccacgtccttgtcaccggcagtctcttcttcttctacgtgcgcccctac ttccattcataa >gi568815590f:47943031_48160825|GENSCAN_predicted_peptide_4|196_aa MGFHRVGQAGLELLDSTSPGGQLGLPHIPVVSGCSEFLHDGWLPKERKQKLAFLLQGVKV PRNFRLLEELEEGQKGVGDGTVSWGLEDDEDMTLTRWTGMIIGPPRTNYENRIYSLKVEC GPKYPEAPPSVRFVTKINMNGINNSSGMVDARSIPVLAKWQNSYSIKVVLQELRRLMMSK ENMKLPQPPEGQTYNN >gi568815590f:47943031_48160825|GENSCAN_predicted_CDS_4|591_bp atggggtttcaccgtgttggccaggctggtctcgaactcctggactcaacctctccaggt ggccagttaggacttcctcacatcccggtggtgtcagggtgttcagaattcttacatgat ggttggcttccaaaagaacgaaagcagaagttagcattcctcttacaaggagttaaagtt cctcgtaattttcgcttgttggaagaacttgaagaaggacaaaaaggagtaggcgacggt acagttagctggggccttgaagatgatgaagatatgacacttacaaggtggacaggcatg attattgggccaccaaggacaaattatgaaaacagaatatatagcctgaaagtagaatgt ggacctaaatacccagaagctcctccgtcagttagatttgtaacaaaaattaatatgaac ggaataaataattccagtgggatggtggatgcccggagcataccagtgttagcaaaatgg caaaattcatatagcattaaagttgtacttcaagagctaagacgtctaatgatgtccaaa gaaaatatgaagcttccacagccaccagaaggacaaacatacaacaattaa >gi568815590f:47943031_48160825|GENSCAN_predicted_peptide_5|422_aa MEDKGRGRGGPVGVAQSTMPTTEETSNTQASRVAQQSTSARLSSATLGPARQAHAESVNG RRKGGGQRATGPFFQPPLFSVPPTCDSNRHRALYTTTPEMVNQIPGAQKPGWLHLAACLP PTDSELQGKGVVLKVFWRTHVVLKLPCIRGRGKASYNTDLQISGFEGYKDAAGLRATMPC PRLLKPTLACWCSDPCRGQQWLERAAGSGCTRELTASYDLGADSEDRTEHLGSHIQPQKS PSCVPRRPVLLATVTHLPAEEAPLMADITTQVIITPQLGIWTLLPADVAASAHGSVNPSL SAALRVTLLSVSDTIGGASDTSKEDLSSPYMVTLETERSIFLVGRTNQQGMCQYLGPISY LFPMDLILMACCWYQSNCAQPEMHPSGPLRQDVQGRPLSSVGASRITNIQKHICSPLTTG RN >gi568815590f:47943031_48160825|GENSCAN_predicted_CDS_5|1269_bp atggaggacaaaggaaggggccgcggcggccctgtgggagtggcacagagcaccatgccc accacagaagagacatccaacacccaggcaagcagagtggctcagcagtcgacatcagcg cgtctgtcatcagccaccttggggccggcacggcaggcacatgcagagagtgtcaacggt agaaggaaaggtggaggccaaagggccacagggccattcttccaaccaccgctgttcagt gtgccacccacttgtgacagcaaccgacacagggccctgtacaccaccacgccagagatg gtcaaccagatccctggggcacagaagccaggctggctccacctcgccgcatgtcttccg ccaacagactcagaactacaagggaagggagtcgtactcaaggtcttttggcgaacacac gtggttcttaaactgccatgcatcaggggcaggggcaaggcttcttacaacacagatctg cagatctccgggtttgagggttacaaggatgctgctggtctcagggccacaatgccctgc ccacggctacttaagcccaccctggcctgctggtgctcagatccttgcaggggacagcag tggttagagcgtgctgctggttctgggtgcacccgggagctcaccgcatcctatgacctg ggagcggattcagaggacaggacagaacacttggggtcccacatccagccacagaagagc ccctcctgcgttccacgcaggcctgtccttcttgctactgtcacacacttgccagcagaa gaggcccctctaatggccgacatcaccacccaggttatcattacccctcagctgggcatc tggaccctcctgcctgccgatgtggctgccagcgcccatggcagtgtgaaccccagtctg tctgcagccctgcgggtgaccctcctgtctgtcagtgacactattggtggtgcatctgac acatccaaagaggacctgtctagcccctacatggttaccctggagactgaaaggtctata tttctggtgggccgcaccaaccagcagggcatgtgccaatacctgggccccataagttac ctctttccaatggacctcatcttaatggcctgctgctggtaccaaagcaattgtgcccaa cctgagatgcatccatcagggcccctgcggcaagacgtgcaggggcgtcctctcagcagt gtgggggcatcaagaatcacaaacatccagaagcatatctgttcacctttgaccacgggc agaaattaa >gi568815590f:47943031_48160825|GENSCAN_predicted_peptide_6|323_aa MCHEVGPKGRRQELSDHIAWEILHIIAFCGDSTCSSVTLSKSYGHGHRQHESNTTYPAYL AVQSLSIVYPEHKSRSCMRALHIGSFDATTCADASHREGQIKEETCRPGDVAQLSMNKGA FSITLSQVAQSLVYTGAHGETSVAASNSLHLPRFALHLFAQIFWSPYQGGTYDEMANLLS LLKRQQCMESSCFESLSLLPQLFLELPAVTGSCDNSSPAQKNEDVYPHKRFYTIVHGSLI CDSQKLETTQMSYGRPHPDAAADTLIFSIHRNKVLCRQKHTASRTLPIFLQDCLQKHMNM LPLRFINVHIERNSNDLPEMLNG >gi568815590f:47943031_48160825|GENSCAN_predicted_CDS_6|972_bp atgtgtcatgaagtgggcccaaaaggaaggcgacaagagctcagcgaccacatagcctgg gaaatactgcatattatagccttctgtggagattccacctgcagctccgtgaccttgagc aagtcatatggccatggccaccgacaacacgagagcaacaccacctaccctgcctatctg gctgtacagagtctctccatagtttacccagagcacaaatctcgaagctgcatgagagca ttacacattggcagttttgatgccacaacctgtgctgatgcttcacaccgagaaggacag atcaaagaagaaacatgccggccgggtgatgtggcccaactttccatgaataaaggagct ttctctattacactgagtcaagtagctcagagcttggtttacaccggagctcatggagag actagtgttgctgccagcaatagcctccaccttcctagatttgccctacatttgtttgcg cagattttctggtcaccatatcagggaggcacatatgatgaaatggcaaaccttctcagc ttactgaagcggcagcagtgcatggaatcctcatgctttgaatctctctcacttctgccg cagctctttcttgagctccctgctgttacgggctcctgtgataacagtagccccgcccag aaaaatgaagatgtatacccacacaaaagattctacacaattgttcatggtagccttatt tgtgatagccaaaaactagaaacaactcaaatgtcctacggcaggccacatcctgacgca gctgctgacaccctcatattcagcatccaccgaaataaggttctgtgccgccagaaacat acagcttcccgaaccctgcctattttccttcaagactgtcttcagaaacacatgaatatg cttcctctgcgctttattaatgttcatattgagagaaatagcaacgacctgccagagatg ctaaatgggtga