GENSCAN 1.0 Date run: 3-Nov-116 Time: 12:15:01 Sequence gi568815582r:8695955_8896591 : 200637 bp : 48.44% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4157 4170 14 1 2 114 73 15 0.037 0.81 1.02 Intr + 19108 19294 187 1 1 92 37 78 0.171 2.79 1.03 Intr + 23508 23579 72 2 0 81 64 42 0.100 0.70 1.04 Intr + 34145 34245 101 1 2 98 86 28 0.317 2.51 1.05 Term + 35487 35622 136 0 1 40 33 132 0.484 0.39 1.06 PlyA + 38448 38453 6 1.05 2.00 Prom + 39611 39650 40 -5.46 2.01 Init + 39786 39855 70 2 1 96 107 133 0.798 15.29 2.02 Intr + 50047 50144 98 2 2 87 109 87 0.732 10.43 2.03 Intr + 52154 52183 30 1 0 105 111 7 0.784 3.03 2.04 Intr + 54468 54585 118 2 1 71 82 121 0.995 9.84 2.05 Intr + 61803 61852 50 1 2 100 78 22 0.656 0.80 2.06 Intr + 68115 68195 81 2 0 106 109 78 0.913 11.53 2.07 Intr + 68784 68876 93 2 0 136 76 115 0.999 15.26 2.08 Intr + 70254 70316 63 2 0 88 109 27 0.929 3.71 2.09 Intr + 72239 72302 64 1 1 117 82 69 0.876 7.49 2.10 Intr + 72871 73019 149 2 2 52 80 174 0.943 12.95 2.11 Intr + 76826 76963 138 1 0 54 86 325 0.972 29.56 2.12 Intr + 78936 79103 168 2 0 86 68 276 0.999 25.54 2.13 Intr + 80390 80536 147 1 0 80 92 258 0.991 25.83 2.14 Intr + 83525 83636 112 1 1 106 116 79 0.999 12.45 2.15 Term + 85355 85476 122 0 2 124 43 41 0.660 1.84 2.16 PlyA + 87422 87427 6 1.05 3.02 PlyA - 87713 87708 6 1.05 3.01 Sngl - 100468 99998 471 1 0 1 47 546 0.639 37.92 3.00 Prom - 101115 101076 40 -9.75 4.00 Prom + 101258 101297 40 -14.13 4.01 Init + 101659 101994 336 0 0 84 111 150 0.117 14.36 4.02 Intr + 105845 105956 112 1 1 60 54 74 0.207 1.15 4.03 Intr + 106336 106407 72 2 0 74 48 92 0.680 3.18 4.04 Intr + 108813 108889 77 1 2 46 96 56 0.778 1.43 4.05 Intr + 110362 110453 92 0 2 67 90 30 0.810 -0.11 4.06 Intr + 115125 115285 161 0 2 59 46 156 0.840 8.13 4.07 Intr + 117037 117152 116 2 2 65 100 97 0.996 8.77 4.08 Intr + 118386 118455 70 2 1 95 57 64 0.269 2.75 4.09 Intr + 135849 135924 76 1 1 100 86 -35 0.022 -3.93 4.10 Intr + 136167 136346 180 0 0 86 16 96 0.247 1.18 4.11 Term + 136875 137082 208 0 1 120 44 119 0.393 7.51 4.12 PlyA + 137542 137547 6 1.05 5.09 PlyA - 137912 137907 6 1.05 5.08 Term - 139498 139322 177 1 0 75 33 124 0.156 3.29 5.07 Intr - 145309 145175 135 1 0 60 81 52 0.155 2.46 5.06 Intr - 148543 148400 144 1 0 74 90 11 0.113 0.38 5.05 Intr - 152991 152682 310 2 1 44 86 144 0.176 6.12 5.04 Intr - 159372 159235 138 2 0 106 6 186 0.025 11.78 5.03 Intr - 162518 162396 123 1 0 101 61 145 0.992 12.80 5.02 Intr - 163344 163217 128 0 2 68 69 70 0.601 2.58 5.01 Init - 163447 163442 6 1 0 95 86 0 0.536 1.55 5.00 Prom - 170014 169975 40 -8.56 6.00 Prom + 172228 172267 40 -10.05 6.01 Init + 172918 173092 175 0 1 99 23 108 0.584 2.96 6.02 Intr + 173659 173832 174 2 0 85 77 107 0.942 9.21 6.03 Intr + 187256 187332 77 0 2 84 96 56 0.538 5.23 6.04 Intr + 188369 188511 143 1 2 113 100 230 0.746 25.85 6.05 Intr + 189233 189337 105 2 0 60 50 140 0.527 6.73 6.06 Term + 193316 193445 130 2 1 89 49 68 0.372 0.65 6.07 PlyA + 194580 194585 6 1.05 7.06 PlyA - 196843 196838 6 1.05 7.05 Term - 198150 198044 107 1 2 113 34 51 0.584 0.77 7.04 Intr - 198686 198596 91 2 1 50 107 112 0.998 8.87 7.03 Intr - 198901 198830 72 1 0 97 75 154 0.999 14.60 7.02 Intr - 199196 199077 120 2 0 98 85 94 0.915 10.79 7.01 Intr - 199787 199688 100 1 1 104 55 64 0.964 4.81 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 159372 159210 163 2 1 106 49 198 0.974 15.11 S.002 Term + 173862 174064 203 1 2 70 42 95 0.961 0.65 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582r:8695955_8896591|GENSCAN_predicted_peptide_1|169_aa MPSLQNHLDPLTSHVQPTSFSTFARELHFILQGHKEDYSRNYMQNDWLGFLKRQCLLKGK RAEKWFQFFHRLGNYLPASNDVFVLPGFLIEELPPRALGMKRDGSTAPAESLHIASAQYL MMLRGHSKLIESCNSQGYNDCSNNVKWQTLDQKATHLSFVKALEAVVVV >gi568815582r:8695955_8896591|GENSCAN_predicted_CDS_1|510_bp atgcccagcctgcaaaatcacttggaccctctgacatcacatgtccaaccgacctctttc tccacatttgcccgtgagctgcacttcattttgcaaggacataaagaagactattcacgc aactacatgcaaaatgactggctgggattcctaaaacgccaatgtcttttaaaaggcaaa agggctgagaaatggttccagttttttcacaggctgggtaactacctgcctgctagcaac gatgtgttcgttctccctggattcctgattgaggagctgcctcctagggctctggggatg aaacgagatggttctacagcaccagctgagagcctgcacatagcaagtgcccagtacttg atgatgcttagaggccacagcaaacttatagagagctgcaactctcagggatacaatgac tgctccaacaatgtgaaatggcagactctggaccagaaggccacacacctgagttttgtc aaagcactggaagccgtggtagttgtctag >gi568815582r:8695955_8896591|GENSCAN_predicted_peptide_2|500_aa MASMLLAQRLACSFQHSYRLLVPGSRHISQAAAKVDVEFDYDGPLMKTEVPGPRSQELMK QLNIIQNAEAVHFFCNYEESRGNYLVDVDGNRMLDLYSQISSVPIGYSHPALLKLIQQPQ NASMFVNRPALGILPPENFVEKLRQSLLSVAPKGMSQLITMACGSCSNENALKTIFMWYR SKERGQRGFSQEELETCMINQAPGCPDYSILSFMGAFHGRTMGCLATTHSKAIHKIDIPS FDWPIAPFPRLKYPLEEFVKENQQEEARCLEEVEDLIVKYRKKKKTVAGIIVEPIQSEGG DNHASDDFFRKLRDIARKHGCAFLVDEVQTGGGCTGKFWAHEHWGLDDPADVMTFSKKMM TGGFFHKEEFRPNAPYRIFNTWLGDPSKNLLLAEVINIIKREDLLNNAAHAGKALLTGLL DLQARYPQFISRVRGRGTFCSFDTPDDSIRNKLILIARNKGVVLGGCGDKSIRFRPTLVF RDHHAHLFLNIFSDILADFK >gi568815582r:8695955_8896591|GENSCAN_predicted_CDS_2|1503_bp atggcctccatgttgctcgcccagcgcctggcctgcagcttccagcacagctaccgcctg ctggtgcctggatccagacacattagtcaagctgcagccaaagtcgacgttgaatttgat tatgatgggcctctgatgaagacggaagtcccagggcctagatctcaggagttaatgaaa cagctgaatataattcagaatgcagaggctgtgcattttttctgcaattacgaagagagc cgaggcaattacctggttgatgtggacggcaaccgaatgctggatctttattcccagatc tcctctgtccccataggttacagccaccccgccctgctgaaactcatccaacagcctcaa aatgcgagcatgtttgtcaacagacccgccctcggaatcctgcctccggagaactttgtg gagaagctccggcagtccttgctctcggtggctcccaaagggatgtcccagctcatcacc atggcctgcggctcctgctccaatgaaaacgccttaaagaccatcttcatgtggtaccgg agcaaggaaagagggcagaggggcttctcccaggaggagctggagacgtgcatgattaac caggcccctggctgccccgactacagcatcctctccttcatgggcgcgttccatgggagg accatgggttgcttagcgaccacgcactctaaagccattcacaagatcgacatcccttcc tttgactggcccatcgcaccgttcccacggctgaaataccctctggaagagtttgtgaaa gagaaccaacaggaggaggcccgctgtctggaagaggtggaggatctgattgtgaaatat cggaaaaagaagaagacggtggccgggatcatcgtggagcccatccagtccgagggtgga gacaaccacgcatccgatgacttctttcggaagctgagagacatcgccaggaagcatggc tgcgccttcttggtggacgaggtccagaccggaggaggctgcacgggcaagttctgggcc catgagcactggggcctggatgacccagcagacgtgatgaccttcagcaagaagatgatg actgggggcttcttccacaaggaggagttcaggcctaatgctccctaccggatcttcaac acctggctgggggacccgtccaagaacctgttgctggctgaggtcatcaacatcatcaag cgggaggacctgctaaataatgcagcccatgccgggaaggccctgctcacaggactgctg gacctccaggcccggtacccccagttcatcagcagggtgagaggacgaggcaccttttgc tccttcgatactcccgatgattccatacggaataagctcattttaattgccagaaacaaa ggtgtggtgttgggtggctgtggtgacaaatccattcgtttccgtcccacgctggtcttc agggatcaccacgctcacctgttcctcaatattttcagtgacatcttagcagacttcaag taa >gi568815582r:8695955_8896591|GENSCAN_predicted_peptide_3|156_aa MFYRFDAIRTFGFLSRLKLAQTALTVVALPPGYYLYSQGLLTLNTVCLMSGISGFALTML CWMSYFLRRLVGILYLNESGTMLRVAHLNFWGWRQDTYCPMADVIPLTETKDRPQEMFVR IQRYSGKQTFYVTLRYGRILDRERFTQVFGVHQMLK >gi568815582r:8695955_8896591|GENSCAN_predicted_CDS_3|471_bp atgttttaccgttttgatgccatcagaaccttcgggttcctgtcacgactgaagttggca cagactgccctgacagtggtagctttgccaccaggctattacttgtactcccagggcctc ctcactctcaacaccgtgtgcctcatgagtgggatatcgggctttgccctgaccatgctg tgctggatgagctatttcttacggagactggttggtatcctgtatctgaatgagtctggc accatgctgcgggtggcccatctgaacttctggggctggcggcaggacacatactgtccc atggcagatgtgattcccctgacagaaaccaaggaccggcctcaggagatgtttgtgcgt atccagcggtacagtgggaaacagaccttctacgtcaccctgcgctatggacgcatcctg gacagagagcgtttcacacaggtgtttggggtacatcagatgctcaagtga >gi568815582r:8695955_8896591|GENSCAN_predicted_peptide_4|499_aa MAPPPAAGSKSHRPPEPLRAHARMYKAGVICVAPWELRSRNRGCRYSQALWERSPLLFPT CPATQRPNPEVPGRVPRANVSCKVRLETGDMAAPGPALCLFDVDGTLTAPRQKITKEMDD FLQKLRQKIKIGVVGGSDFEKVQEQLGNDESPAQDPEKPERLIDPFTGESDLIVVEKYDY VFPENGLVAYKDGKLLCRQNIQSHLGEALIQDLINYCLSYIAKIKLPKKRGTFIEFRNGM LNVSPIGRSCSQEERIEFYELDKVRLSEISLVNGWVYGNKIWPGGQISFDVFPDGWDKRY CLRHVENDGYKTIYFFGDKTMPAGGVGYLAISRPVSGFLAILHLADFGTIRDLNINARIS LSPGLSLVPVSYSGMQARHSPKKAAPQEGAHHALRVTKKEADPWRERKEGLRQVVFVMQM LLRAVRLSESGVFSDGHMACALTSLRCQGSLTLQAWQPFPPFAQQGAQELSVDDWSPQGS CDVVLVCAVSPAAKAMPGG >gi568815582r:8695955_8896591|GENSCAN_predicted_CDS_4|1500_bp atggccccaccacctgctgccggaagtaaatcccaccggcccccggaacccctaagagcg catgctcgaatgtacaaggcgggcgtgatctgcgttgcaccctgggagttgcggtccagg aatcgtggctgccgctactcccaggcgttatgggaacggagtcccctcctcttcccgacg tgccctgcgactcagcggccgaacccggaagttccgggccgagttcctcgtgccaacgtg tcttgtaaggtgcggctagaaactggggacatggcagcgcctggcccagcgctctgcctc ttcgacgtggatgggaccctcaccgccccgcggcagaaaattaccaaagaaatggatgac ttcctacaaaaattgaggcagaagatcaaaatcggagtggtaggcggatcggactttgag aaagtgcaggagcaactgggaaatgatgaatctccagcacaggaccccgaaaaaccagag aggcttattgaccccttcactggagagtctgacttgatagtggttgaaaaatacgattat gtgtttccagaaaatggcttggtagcatacaaagatgggaaactcttgtgtagacagaat attcaaagtcatctgggtgaggccctaatccaagatttaatcaactactgtctgagctac attgcgaaaattaaactcccgaagaagaggggtactttcattgaattccgaaatgggatg ttaaacgtgtcccctattggaagaagctgcagccaagaagaacgcattgagttctacgaa ctcgataaagtacgtctttctgaaatatctttggtgaatggctgggtttatggaaataag atatggcctggaggccagatcagctttgatgtctttcctgatggatgggacaagagatac tgtctgcgacatgtggaaaatgacggttataagaccatttatttctttggagacaaaact atgccagctgggggtgttggttaccttgccatcagcaggcccgtgtcaggctttctcgcc atcctgcacctagcagactttgggactattagggatttaaatattaatgcgcggatttca ctgagccctggcctctctttggtgcctgtcagctactctggtatgcaagctcgacacagc cccaagaaggctgcaccccaggaaggagcccaccatgctctgagagtcacaaagaaagaa gcagacccttggcgggagagaaaggaaggcctgaggcaggttgtgtttgtgatgcagatg ctcctgcgagccgtgcggctctctgaaagcggcgtcttctctgatggccacatggcctgt gccctcacgtccctcagatgccagggttccctcaccctccaggcctggcagcccttccct ccctttgctcaacaaggcgcccaagagctgtctgttgatgactggtctccccagggcagc tgtgatgtggttttggtttgcgccgtatccccagctgctaaagcaatgcctggcgggtag >gi568815582r:8695955_8896591|GENSCAN_predicted_peptide_5|386_aa MGPPTHQASVGLLDTPRSRERSPSPLRGNVVPSPLPTRRTRTFSATVRASQGPVYKGVCK CFCRSKGHGFITPADGGPDIFLHISDVEGEYVPVEGDEVTYKMCSIPPKNEKLQAVEVVI THLAPGTKHETWKVLFHRVLAETRDPSSFTKNVQKMKRVRDADLAVAALSRLRGKVWVKA WVSADRRAAPLTVLWLGSRSIPEKLEANASRKPARGNQKVLLVKSDIPRWFLWTWGQEPP NPFSFTLSGKSRFSGGEASTPNSYLCAPIPYFHAPTPFPLFWRSDNGPAFTSQITQAVSQ ALSIQWNLHIPYSPQSSGKVERTNGLLKLDDPDTHQAQQITWAVRPQGFTDSPHYFSQAQ ISSPSVTYLGIILIKTHVLSLLIVSD >gi568815582r:8695955_8896591|GENSCAN_predicted_CDS_5|1161_bp atggggccccccacccatcaagcttcagtcgggctgctggacacccctcggagccgtgag cgctcaccatcccctctgcggggcaacgtggtcccaagcccactgcccactcgccggacg aggaccttctcggcgacggtgcgggcttcacagggccccgtctacaaaggagtctgcaaa tgcttctgccggtccaagggccatggcttcattactccagctgatggcggccccgacatc ttcctgcacatctctgatgtggaaggggagtatgtcccagtggaaggcgacgaggtcacc tataaaatgtgctccatcccacccaagaatgagaagctgcaggccgtggaggtcgtcatc actcacctggcaccaggcaccaagcatgagacctggaaagttctcttccaccgtgttcta gctgagacccgggatccttccagtttcacgaagaacgtgcagaagatgaaaagagtgagg gacgcggacctagctgtagctgcactcagccgcctccgtggtaaagtctgggtgaaggcc tgggtgagtgcagataggagagcagctcctcttactgttctctggctgggctctcgctcc atccctgagaaactagaggcgaacgcctccaggaagccagccaggggcaaccagaaagtc cttttggtgaagtctgacatcccgcgttggtttctctggacctgggggcaagaacccccc aaccccttctccttcacccttagcggcaagtcccgcttttctgggggagaggcaagtacc ccaaactcgtatctctgtgccccaatcccttatttccatgccccaacccctttcccgctt ttctggaggtctgataacggaccagcctttactagtcaaatcacccaagcagtttctcag gctcttagtattcagtggaaccttcatatcccttacagtcctcaatcttcaggaaaggta gaacggactaatggtcttttaaagttagatgaccctgacacccatcaggctcagcaaatt acctgggctgtacggccacaaggcttcacagacagcccccactacttcagtcaagcccaa atttcatccccatctgttacctatctcggcataattctcataaaaacacacgtgctctcc ctgttgatcgtgtctgattaa >gi568815582r:8695955_8896591|GENSCAN_predicted_peptide_6|267_aa MEAEEAEAPPLRLPGPRARGAYPGVPEKSSLPCNPFCSASRFAPTESAELGRGRGPRGGE ETEAQRGDQPMGTASREQPLKASLDPGTRVSLVFEGRGEGERSPELARDNRAPGGGDPYP AGPPPPYGPPHQMYFEGPQVVQLYAGMSVVGTSMPVQAVCPYCGNRIITVTTFVPGALTW LLCTTLFLFGYVLGCCFLAFCIRSLMDVKHSCPVCQRELFYYHRLAGLFPSFGACQRGGP PLADELGKLLNPHLKLGTAPILPRMEG >gi568815582r:8695955_8896591|GENSCAN_predicted_CDS_6|804_bp atggaggcagaggaggccgaggccccacccctgcggctcccggggccgcgggccaggggc gcctatcctggggtccccgagaagagctccctaccctgcaatccgttctgctccgccagc cgcttcgctccgactgagagcgcggagctggggagggggcgtggcccgagggggggagag gaaactgaggcacagagaggcgaccagcccatgggcacggccagtcgagagcagcccctg aaggcgagcctagatcccgggaccagggtcagcttggtatttgaggggcgcggggaagga gagaggtccccggagctcgcccgggacaacagggcccccggaggcggcgatccttatcct gctggtccccctcctccctatggtcctcctcaccagatgtactttgaaggcccccaagtt gttcagctgtatgccggcatgtccgtggtggggacgtccatgccggtgcaggccgtgtgt ccctactgtggaaaccgcatcatcacggtgacgacctttgtcccgggtgccctcacctgg ctgctgtgtaccaccctcttcctgttcgggtacgtcctgggctgctgcttcctggcattc tgcataaggagcctaatggacgtgaagcactcgtgtcccgtgtgtcagcgcgagctcttc tactaccaccgcctggccggcctcttccccagcttcggggcctgtcagaggggcggccca ccactggcagatgagcttggcaagttgctgaaccctcatttgaaactggggacagccccc atcctaccccgcatggagggctga >gi568815582r:8695955_8896591|GENSCAN_predicted_peptide_7|163_aa XLLEIVSYKIIGVHQEDELLECLSPATSRTFRIEEIPLDQVDIDKENEMLVTVAHFHKEV FGTFGIPFLLRIHQGEHFREVMKRIQSLLDIQEKEFEKFKFAIVMMGRHQYINEDEYEVN LKDFEPQPGNMSHPRPWLGLDHFNKAPKRSRYTYLEKAIKIHN >gi568815582r:8695955_8896591|GENSCAN_predicted_CDS_7|492_bp nngctgctagaaattgtaagctacaaaatcattggtgttcatcaagaagatgaactatta gaatgtttatctcctgcaacgagccggacgtttcgaatagaggaaatccctttggaccag gtggacatagacaaagagaatgagatgcttgtcacagtggcgcatttccacaaagaggtc ttcggaacgttcggaatcccgtttttgctgaggatacaccagggcgagcattttcgagaa gtgatgaagcgaatccagagcctgctggacatccaggagaaggagtttgagaagtttaaa tttgcaattgtaatgatgggccgacaccagtacataaatgaagacgagtatgaagtaaat ttgaaagactttgagccacagcccggtaatatgtctcatcctcggccttggctagggctc gaccacttcaacaaagccccaaagaggagtcgctacacttaccttgaaaaggccattaaa atccataactga