GENSCAN 1.0 Date run: 3-Nov-116 Time: 15:31:00 Sequence gi568815589f:124192941_124450944 : 258004 bp : 49.52% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 23650 23688 39 0 0 104 98 -19 0.217 0.98 1.02 Intr + 25785 25943 159 2 0 81 92 81 0.698 7.98 1.03 Term + 34975 35100 126 0 0 -27 43 282 0.518 10.48 1.04 PlyA + 37228 37233 6 1.05 2.00 Prom + 40869 40908 40 -5.56 2.01 Init + 49433 49505 73 1 1 45 50 140 0.070 7.13 2.02 Intr + 64150 64347 198 2 0 46 58 87 0.202 0.82 2.03 Intr + 65009 65145 137 0 2 3 117 178 0.818 12.49 2.04 Term + 66014 66031 18 1 0 121 46 5 0.828 -2.08 2.05 PlyA + 66697 66702 6 1.05 3.07 PlyA - 67353 67348 6 1.05 3.06 Term - 70184 69967 218 0 2 82 55 110 0.422 4.41 3.05 Intr - 72009 71921 89 2 2 59 50 81 0.254 1.01 3.04 Intr - 74829 74712 118 1 1 125 58 -4 0.193 -0.18 3.03 Intr - 81812 81681 132 0 0 114 49 84 0.852 7.72 3.02 Intr - 82000 81845 156 2 0 46 65 72 0.500 0.68 3.01 Init - 83344 83248 97 1 1 65 91 20 0.334 0.47 3.00 Prom - 88344 88305 40 -4.96 4.08 PlyA - 88905 88900 6 1.05 4.07 Term - 92742 92626 117 0 0 126 34 106 0.633 7.54 4.06 Intr - 94108 94035 74 2 2 113 99 12 0.463 3.93 4.05 Intr - 97648 97545 104 0 2 82 77 45 0.389 2.62 4.04 Intr - 97984 97862 123 0 0 99 75 99 0.396 9.40 4.03 Intr - 102317 102244 74 2 2 71 56 71 0.469 0.40 4.02 Intr - 103885 103756 130 0 1 52 36 113 0.675 3.20 4.01 Init - 104283 104123 161 0 2 95 57 50 0.590 1.76 4.00 Prom - 106554 106515 40 -1.86 5.00 Prom + 106609 106648 40 -10.84 5.01 Init + 109053 109114 62 2 2 73 115 47 0.268 4.74 5.02 Intr + 118142 118286 145 2 1 65 72 32 0.137 -0.42 5.03 Intr + 119569 119709 141 0 0 70 99 253 0.442 25.15 5.04 Intr + 120983 121045 63 1 0 75 105 95 0.946 8.81 5.05 Intr + 128519 128629 111 1 0 91 105 185 0.984 21.08 5.06 Intr + 133390 133498 109 0 1 86 94 153 0.966 15.56 5.07 Intr + 134398 134593 196 2 1 112 57 165 0.129 14.27 5.08 Intr + 143611 143696 86 1 2 106 81 45 0.860 5.06 5.09 Intr + 146631 146725 95 1 2 71 105 201 0.960 19.78 5.10 Intr + 152885 152998 114 1 0 58 76 43 0.598 0.74 5.11 Intr + 153881 154018 138 1 0 61 31 109 0.785 3.16 5.12 Intr + 154769 154882 114 1 0 100 113 164 0.999 20.74 5.13 Term + 157897 158007 111 0 0 93 53 151 0.993 10.66 5.14 PlyA + 158938 158943 6 1.05 6.08 PlyA - 160082 160077 6 1.05 6.07 Term - 160769 160658 112 1 1 120 42 62 0.499 2.83 6.06 Intr - 161613 161479 135 2 0 106 -28 134 0.390 3.28 6.05 Intr - 162031 161943 89 1 2 102 65 29 0.798 0.77 6.04 Intr - 163033 162652 382 0 1 -5 96 152 0.409 1.61 6.03 Intr - 163975 163824 152 1 2 90 109 302 0.999 31.46 6.02 Intr - 168479 168346 134 0 2 59 6 106 0.848 -0.14 6.01 Init - 168933 168876 58 0 1 58 74 48 0.794 1.88 6.00 Prom - 169192 169153 40 -4.26 7.09 PlyA - 169903 169898 6 1.05 7.08 Term - 171062 170919 144 2 0 61 48 142 0.571 5.41 7.07 Intr - 191716 191658 59 2 2 97 87 56 0.001 5.00 7.06 Intr - 212492 212377 116 1 2 48 92 98 0.776 6.29 7.05 Intr - 218914 218840 75 0 0 102 78 14 0.437 0.43 7.04 Intr - 219552 219412 141 2 0 41 111 148 0.957 11.87 7.03 Intr - 221065 220968 98 1 2 73 83 35 0.982 0.31 7.02 Intr - 221995 221902 94 0 1 91 121 111 0.996 14.67 7.01 Init - 222485 222424 62 2 2 101 55 102 0.976 8.92 7.00 Prom - 224236 224197 40 -8.56 8.00 Prom + 224853 224892 40 -6.06 8.01 Init + 225661 225851 191 0 2 82 81 86 0.104 5.78 8.02 Intr + 235819 236139 321 1 0 71 56 162 0.032 6.28 8.03 Intr + 245210 245399 190 2 1 61 76 72 0.031 2.89 8.04 Term + 255220 255477 258 0 0 73 41 110 0.076 0.35 8.05 PlyA + 256101 256106 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 199215 199538 324 2 0 50 43 188 0.815 6.50 S.002 Term + 239673 239706 34 0 1 119 43 62 0.808 1.86 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:124192941_124450944|GENSCAN_predicted_peptide_1|107_aa MAWQGEGTSGDHTRQHSRPQAWTSRLQQGPWPPPIALEAPEKRPFSAAAAGQTPPLEKKE VKVLWWKKEEEEEEEEKEKEKKKKKKKKKKKKKKKKRRRSTRRANVL >gi568815589f:124192941_124450944|GENSCAN_predicted_CDS_1|324_bp atggcctggcagggcgaagggacttcaggagatcatacaaggcagcactctaggccccag gcctggacctcaaggctgcagcagggcccctggccgccccctatagccttggaagcccct gagaagaggcccttcagtgcagctgcagcaggtcagacgcctcctctggagaagaaggaa gtgaaagtgctgtggtggaagaaggaggaggaggaggaggaggaggagaaggagaaggag aagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaggaggaggagc actagaagagcaaatgtattatga >gi568815589f:124192941_124450944|GENSCAN_predicted_peptide_2|141_aa MPLVMSGDSEIEMERRKRDTIEEGYHGQLKHFHSFGKQKPDGHSPCAQYSAAPKQAVSVP GSSPPGVQHSFECGGEAAGVETAQPGAAALGGGGGGGTELTGVRPLRRKLVWDAPLQPPA GQRTGPPAAAEPARAPPGLGG >gi568815589f:124192941_124450944|GENSCAN_predicted_CDS_2|426_bp atgcccctggtgatgagtggagattcagaaattgaaatggagaggcggaagcgggacacg attgaggagggatatcacggccagttgaagcacttccactcatttggcaaacagaagcca gacgggcattcaccctgtgcccagtacagtgctgcgcccaagcaggcagtcagtgtccct ggctccagccctcctggggtgcagcacagttttgaatgtggaggggaggcagcaggggtg gaaacagcccagccaggagcggccgctctcggcggtggcggcggcggcggaaccgagctg acgggcgtgcggccgctgcgccgcaaactcgtgtgggacgcaccgctccagccgcccgcg ggccagcgcaccggtcccccagcggcagccgagcccgcccgcgcgccgccaggcctgggg ggctga >gi568815589f:124192941_124450944|GENSCAN_predicted_peptide_3|269_aa MIPETLEPLKPLGSISSDLQPRILYSLIQGSPADIANQCQSPKPSQHGISGSPYQSIRAG TGEDNDCQPCYKWTCRMASTVRQTGSQTPEILSDVQVEYMIFSIVLDQHSYMKAKTLQMS AHFLGELGGAKKACGAQPSLAEWHHETWSQGFLISGSQWPAQGRPSTWQQAGGFAPFASP ELTFLILSGLSGKALMPGVFPTSPSFKDVEDKEQERAGIREGACGLIPILPLTCSKNAPL EHGEVLLKGISVHPGQHDPPRKPQPNSEA >gi568815589f:124192941_124450944|GENSCAN_predicted_CDS_3|810_bp atgattcctgagactctggagccactgaagcccctggggagcatttcttctgacttgcag cccagaattctttactccctgatccaggggtctccagcagacattgctaatcaatgccag tctccaaagccttctcaacatggcatctcgggcagcccctatcaatctatcagagccggc actggagaggacaatgattgccagccttgctacaaatggacatgcagaatggcaagcaca gtgcgtcagaccggatcccagacccctgaaattctatcagatgtccaggtggaatacatg atattcagcattgtcctcgatcagcattcttacatgaaagcaaaaaccctccagatgtct gctcactttcttggggaattgggaggggccaagaaggcatgtggtgctcagccgtccctt gctgaatggcatcatgaaacctggtcacagggcttcctcatttcagggtcccagtggcca gctcaggggcggcccagcacctggcaacaggcaggtggctttgcccccttcgcctccccg gagctcacgttcctcatccttagtggcctttcagggaaagctctgatgccaggtgtcttc ccaacaagcccaagtttcaaagacgttgaggacaaggaacaagagagggcaggcattcga gaaggggcctgtggactaattcccatactgccactcacctgcagtaagaacgcgcctctg gaacatggagaagttcttcttaaagggatctctgttcacccaggtcaacatgacccaccc cgcaagcctcagcctaattctgaagcatga >gi568815589f:124192941_124450944|GENSCAN_predicted_peptide_4|260_aa MGPEAGTCPGSSLPQVTQLVNDTAGAVGPKIAALCPTQHGLFRTALEQTSSSSTGKSFLC LLSNATLHGDWWLIVARPVKHDLRHQEGSVQGPCLEQLSTCKAKHPARILQLHDSGSLAA LRSPGAGPGTATHKLKRARSCSGKGPMAAASGGARSRGALDNRTSHFCPGQTTPVPHLQL IATGTDTGIQQLRPWRTPRGAYHSRGAAGGHINTCCAPAGSQALHLRLGHSGASGPVFGD PEPSSSISAVARICCLFHEL >gi568815589f:124192941_124450944|GENSCAN_predicted_CDS_4|783_bp atggggcctgaggcagggacttgcccagggtcatccctgccccaggtaacacagctggta aatgacacagctggggctgttggacccaagattgctgctctatgtcctacgcagcatggc ctctttaggactgctctggagcaaacgtcttcatcatctacggggaagtcattcctgtgc ctcctgtcaaatgccacacttcatggagactggtggctcatcgtggcccggccagtgaag cacgacctcagacaccaggagggctctgtccaaggaccctgcttggagcagctctccacc tgcaaagccaagcatcctgcccggattctgcagctccatgacagcggaagcctggctgct ctaaggtccccaggagcaggccctggaactgctacccacaagctaaaaagagcacgaagc tgctcaggaaaaggacccatggctgctgcttctggcggggccaggtcacggggagctttg gacaacagaacgagccacttctgccctgggcaaaccacccctgtgccccatctgcagctc atcgccactgggactgacacaggcatacagcagctgaggccctggaggacgcccaggggg gcatatcacagccggggggcagcaggtggacatatcaacacctgctgtgccccagctgga tcccaggctttgcatctgcggcttggacacagtggagccagcgggcctgtgtttggagat cctgaaccatcctcctcaatctcggctgttgctcggatctgctgcctcttccatgaactc tga >gi568815589f:124192941_124450944|GENSCAN_predicted_peptide_5|494_aa MEGVPTTSATPWGLCILLTHRHLRLRRSWSGQQFLVFPGGEAAVRRASPVVLGLVLMPSA RVLAPKDASRHPNTLSFRCSLADFQIEKKIGRGQFSEVYKATCLLDRKTVALKKVQIFEM MDAKARQDCVKEIGLLKQLNHPNIIKYLDSFIEDNELNIVLELADAGDLSQMIKYFKKQK RLIPERTVWKYFVQLCSAVEHMHSRRVMHRDIKPANVFITATGVVKLGDLGLGRFFSSET TAAHSLGKGDLSVPQQPPAVLVTMQGDANILPTCVWVLDPLFLMSQATSQPAFATLGLQL TTETVGTPYYMSPERIHENGYNFKSDIWSLGCLLYEMSFENARLTLCTRESHFNSWTCVT WVVHLAGSQPAPGQAFRSPVAVAGLQGRTGAAGLTSSKAVLALLVVPPPCASPSLATGLW MAALQSPFYGDKMNLFSLCQKIEQCDYPPLPGEHYSEKLRELVSMCICPDPHQRPDIGYV HQVAKQMHIWMSST >gi568815589f:124192941_124450944|GENSCAN_predicted_CDS_5|1485_bp atggagggagttccaacaacctctgccacaccctggggcctgtgcatcctcctgacccac aggcatctcaggctgagacgcagctggtccggccagcagttcctggtgtttccaggtggg gaagcagctgtcagaagggccagtcctgtagttctcggtttggtcttgatgccttccgcc agggtgctggctcctaaagacgcctcgaggcatcccaacacgctgtcttttcgctgctcg ctggcggacttccagatcgaaaagaagataggccgaggacagttcagcgaggtgtacaag gccacctgcctgctggacaggaagacagtggctctgaagaaggtgcagatctttgagatg atggacgccaaggcgaggcaggactgtgtcaaggagatcggcctcttgaagcaactgaac cacccaaatatcatcaagtatttggactcgtttatcgaagacaacgagctgaacattgtg ctggagttggctgacgcaggggacctctcgcagatgatcaagtactttaagaagcagaag cggctcatcccggagaggacagtatggaagtactttgtgcagctgtgcagcgccgtggag cacatgcattcacgccgggtgatgcaccgagacatcaagcctgccaacgtgttcatcaca gccacgggcgtcgtgaagctcggtgaccttggtctgggccgcttcttcagctctgagacc accgcagcccactccctaggtaagggggacctgtctgtgccccagcagcccccagcggtc ctggtgaccatgcagggagacgcaaacattctccccacgtgtgtttgggtcctggatccc ctcttcctcatgtcacaggccacatctcagccagcctttgccacgctgggactccagctc accacagagactgtggggacgccctactacatgtcaccggagaggatccatgagaacggc tacaacttcaagtccgacatctggtccctgggctgtctgctgtacgagatgtcctttgaa aatgcccgattaaccctttgcacccgagagtctcattttaattcctggacatgcgtcacc tgggtggtgcatttggcaggctcccagcccgctcctggtcaggcctttcgctcccccgtg gctgtggccggccttcagggcagaaccggtgccgcaggcctcacctcatcaaaggcggtc ttggccctgctggtggtgcctcctccgtgtgccagcccaagcctcgccacaggcctgtgg atggcagccctccagagccccttctatggagataagatgaatctcttctccctgtgccag aagatcgagcagtgtgactaccccccactccccggggagcactactccgagaagttacga gaactggtcagcatgtgcatctgccctgacccccaccagagacctgacatcggatacgtg caccaggtggccaagcagatgcacatctggatgtccagcacctga >gi568815589f:124192941_124450944|GENSCAN_predicted_peptide_6|353_aa MRKKFLGSTILDLHYSGCRALGYSCGDHYVADVDLFVPFTSAAHLSCSGKWNCTLESRDT LCDAEEEAKNLVSEAIAAGIFNDLGSGSNIDLCVISKNKLDFLRPYTVPNKKGTSPEEVS TTFCRLASISHCTHNSTSGVIHKEGMERSEPAGIHCGLHSTLRKLSGLLMGWGSTGPENA GTGVGECWWLAWWLPANQSVLEMELNQMTSNEPTKSLGPLAGRQATFQAQSCVGEPGFAE LQPSSTRSCSPLARKAEALNACRKNRALSPQCPCVQGRGWVFREHKAMAQPAAFWQFLKD SKFDRDITPAGARAEEQLGRYRCEKGTTAVLTEKITPLEIEVLEETVQTMDTS >gi568815589f:124192941_124450944|GENSCAN_predicted_CDS_6|1062_bp atgcggaaaaagtttcttggcagtactattttggacctgcactatagtggatgccgagca ctcgggtacagctgtggtgaccattacgtggccgacgtggacctctttgtgccttttaca agtgctgcccatttgtcatgtagcggtaaatggaactgcacattggagtccagggacaca ttgtgcgatgcggaggaggaagccaagaatctggtgagcgaagccatcgcagctggcatc ttcaacgacctgggctccggaagcaacattgacctctgcgtcatcagcaagaacaagctg gattttctccgcccatacacagtgcccaacaagaaggggaccagccccgaggaagtgagc accaccttctgcaggcttgcttccatctcccactgcacacacaactccacatcaggagta attcacaaggaaggaatggaacgttcagaaccagcaggtattcactgtgggctgcactca accttgcggaagctcagtgggctcctgatggggtggggcagcacggggccagagaacgct ggcactggggttggggagtgttggtggctggcttggtggcttccagcaaatcagtctgtc ttggaaatggagctaaaccagatgacctctaatgagcccacgaagtccctcgggccttta gcaggaaggcaagccacctttcaggcccagagctgtgttggggaaccgggctttgccgag ctccagccctccagcacacggtcttgcagtccactggcacgtaaggcggaagctctgaat gcatgccgcaaaaatcgggcactgtcaccacagtgcccgtgtgtccaggggagaggatgg gtgttccgggagcacaaggccatggcacagccagcagccttctggcagttcctgaaggac agcaagtttgacagagatattaccccagctggggctagagcagaggaacagcttggccgg tacaggtgtgagaaagggactactgcagtcctcactgagaaaatcactcctctggagatt gaggtgctggaagaaacagtccaaacaatggacacttcctga >gi568815589f:124192941_124450944|GENSCAN_predicted_peptide_7|262_aa MAAVSVYAPPVGGFSFDNCRRNAVLEADFAKRGYKLPKVRKTGTTIAGVVYKDGIVLGAD TRATEGMVVADKNCSKIHFISPNIYCCGAGTAADTDMTTQLISSNLELHSLSTGRLPRVV TANRMLKQMLFRTYAKALYARFHPFNSEIGTISVSYLYQGYIGAALVLGGVDVTGPHLYS IYPHGSTDKLPYVTMGSGSLAAMAVFEDKFRPDMERHYSHTYEGHSLAQRVTRCPGLSSF WATLFQPIVAVQPLTSLPGCEF >gi568815589f:124192941_124450944|GENSCAN_predicted_CDS_7|789_bp atggcggctgtgtcggtgtatgctccaccagttggaggcttctcttttgataactgccgc aggaatgccgtcttggaagccgattttgcaaagaggggatacaagcttccaaaggtccgg aaaactggcacgaccatcgctggggtggtctataaggatggcatagttcttggagcagat acaagagcaactgaagggatggttgttgctgacaagaactgttcaaaaatacacttcata tctcctaatatttattgttgtggtgctgggacagctgcagacacagacatgacaacccag ctcatttcttccaacctggagctccactccctctccactggccgtcttcccagagttgtg acagccaatcggatgctgaagcagatgcttttcaggacctatgctaaagcgctttatgca cgatttcatccttttaactctgagatagggactattagcgtctcctatttgtatcaaggt tacattggtgcagccctagttttagggggagtagatgttactggacctcacctctacagc atctatcctcatggatcaactgataagttgccttatgtcaccatgggttctggctccttg gcagcaatggctgtatttgaagataagtttaggccagacatggagaggcattattctcat acttacgaagggcactctttggctcagagagtgacccgctgtccagggctgtcctcattc tgggctacccttttccagcccattgtggccgtgcagccgctgacatctcttcctggctgc gagttctga >gi568815589f:124192941_124450944|GENSCAN_predicted_peptide_8|319_aa MHTGRTPYEDEGRHWSDDPMGQGMPKVAIKPPEARRQVWNSLPHGSQKKQLCRHLALDLH PPEWSSEQHQRPPDVLYLRCQSARCPGLFTGSWPPCTGSKPWPWARLRQEAGRAVIGGAA MLSGQHRPGILPQQSPLSGSMLTFQACGDFLPNAEPAPGTNYLFPKKSIKRNPSSSPPTL LAIFGFFSFHPTTQVSRKFPTPGQLPGPDRVQVPCKLSAIMEQNPWKWRQEEKQAMSPRS AVAIAYPQPAGFVLGNKQPVKAARCKVDRWLLAANWTMPRLCGGEIRPAVAQTSPNCSGA SNQGPWMPCPSMVLDHLQE >gi568815589f:124192941_124450944|GENSCAN_predicted_CDS_8|960_bp atgcacacagggagaaccccatatgaagatgaaggtagacactggagtgatgatcctatg ggccaaggaatgccaaaagttgccatcaaaccaccagaagctaggcgccaggtctggaac agtctgcctcacggctctcagaagaagcaactttgcagacaccttgctttggacttgcac cctccagaatggagcagcgagcagcaccagcggccacccgacgtcctgtatctccgctgt caatctgcccgctgccctgggttgtttactggaagctggcctccctgcacaggttcaaag ccctggccctgggcacgtctgcggcaggaagcaggccgggcagtaattggcggtgcggcc atgctaagtggccagcaccgacccggtatcctcccccagcagtcgcctttgagtggctcc atgctgaccttccaggcctgcggggacttcctgccgaacgccgagccagctccaggaaca aattacttattccccaagaaatctattaaaaggaatccctccagctctcccccaactctt ctcgccatcttcggcttcttcagttttcaccccactactcaggtgtccagaaaattccct acacctggccagctcccggggcctgacagggtgcaggttccgtgtaaactttcagcaatc atggagcagaatccctggaagtggcggcaggaggagaagcaggcaatgagcccgaggtcc gcagtggctatcgcctacccacagcctgcaggctttgtccttggaaacaagcagcctgtc aaagctgcccgctgtaaagtggatagatggctcttggctgccaactggacaatgcccagg ctctgtggaggggagattaggccagccgtggcccaaaccagtcccaactgctccggggcc tcaaatcagggtccatggatgccctgcccctccatggtacttgaccatcttcaggagtga