GENSCAN 1.0 Date run: 5-Nov-116 Time: 13:06:19 Sequence gi568815575r:38049714_38278346 : 228633 bp : 39.43% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4500 4709 210 0 0 123 77 151 0.439 14.61 1.02 Intr + 22334 22449 116 2 2 54 91 89 0.583 4.97 1.03 Intr + 23877 23985 109 1 1 -10 78 122 0.938 -0.18 1.04 Intr + 26854 26988 135 1 0 47 110 85 0.599 5.36 1.05 Intr + 39733 39874 142 1 1 58 89 148 0.963 11.33 1.06 Intr + 44582 44711 130 1 1 17 94 107 0.765 3.55 1.07 Intr + 46421 46521 101 0 2 109 115 40 0.989 7.71 1.08 Intr + 52629 52721 93 2 0 127 67 104 0.998 11.44 1.09 Intr + 56880 57058 179 2 2 68 59 107 0.913 3.60 1.10 Intr + 58887 58986 100 0 1 60 94 95 0.985 6.49 1.11 Intr + 60608 60769 162 1 0 58 97 132 0.969 10.35 1.12 Intr + 70645 70753 109 0 1 92 75 120 0.984 10.04 1.13 Intr + 72367 72502 136 2 1 53 45 68 0.942 -2.29 1.14 Intr + 75585 75793 209 0 2 103 65 125 0.967 9.50 1.15 Intr + 76875 77006 132 1 0 125 25 119 0.409 9.00 1.16 Intr + 98913 99164 252 1 0 71 74 169 0.283 10.28 1.17 Term + 99363 99415 53 1 2 84 49 42 0.220 -3.49 1.18 PlyA + 99652 99657 6 1.05 2.10 PlyA - 99817 99812 6 -0.45 2.09 Term - 100181 99998 184 1 1 110 44 207 0.939 14.53 2.08 Intr - 104870 104749 122 2 2 78 91 108 0.787 8.47 2.07 Intr - 107316 107159 158 1 2 64 100 173 0.508 14.91 2.06 Intr - 110483 110304 180 0 0 88 96 208 0.949 20.52 2.05 Intr - 111341 111220 122 1 2 62 94 81 0.811 5.32 2.04 Intr - 115182 115056 127 1 1 73 80 168 0.914 13.32 2.03 Intr - 122344 122168 177 2 0 86 86 220 0.640 20.57 2.02 Intr - 124638 124447 192 1 0 81 92 201 0.989 18.34 2.01 Init - 127770 127623 148 0 1 95 69 18 0.279 0.90 2.00 Prom - 134189 134150 40 -5.45 3.04 PlyA - 134528 134523 6 1.05 3.03 Term - 143215 143046 170 2 2 63 50 135 0.005 4.26 3.02 Intr - 160928 160807 122 1 2 55 76 79 0.100 2.62 3.01 Init - 171079 170895 185 1 2 98 40 329 0.733 25.74 3.00 Prom - 172590 172551 40 -9.25 4.00 Prom + 174185 174224 40 -4.55 4.01 Init + 174446 174457 12 1 0 73 94 4 0.562 -0.31 4.02 Intr + 175771 175860 90 0 0 88 42 96 0.752 4.27 4.03 Term + 178160 178273 114 1 0 63 37 130 0.581 2.99 4.04 PlyA + 179456 179461 6 1.05 5.04 PlyA - 182315 182310 6 1.05 5.03 Term - 189338 189250 89 0 2 91 49 77 0.484 0.94 5.02 Intr - 189706 189573 134 0 2 60 96 55 0.168 2.87 5.01 Init - 210403 210336 68 1 2 71 90 63 0.204 5.40 5.00 Prom - 219323 219284 40 -5.05 6.04 PlyA - 219435 219430 6 1.05 6.03 Term - 220119 219913 207 0 0 71 36 93 0.379 -1.24 6.02 Intr - 223798 223673 126 1 0 47 115 109 0.621 9.46 6.01 Intr - 227059 226874 186 1 0 78 86 285 0.999 26.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:38049714_38278346|GENSCAN_predicted_peptide_1|789_aa XKLKNELLEAKRRSGKTQQEASRVCVHCHRNLGLIFDRGDPCQACSLRVCRECRVAGPNG SWKCTVCDKIAQLRIITGEWFFEEKAKRFKQVNVLGTDVVRQSILRRSPGAEEVQSQEQT RQDAEKSDTSPVAGKKASHDGPKRKGFLLSKFRSATRGEIITPKTDTGRSYSLDLDGQHF RSLKSPPGSDRGSTGSSDLNDQEPGPRTPKSSRSNGVTPGTQSSPAPSTRTVTSVISREY GFENSMDLAAIEGTSQELTKSHRRNTSGTPSIAVSGTSLSSDQSRSELDLSESFTEDSED TVSIRSKSVPGALDKDSLEETEESIDALVSSQLSTNTHRLASGLSTTSLNSMMSVYSETG DYGNVKVSGEILLHISYCYKTGGLYIFVKNCRNLAIGDEKKQRTDAYVKSYLLPDKSRNN KRKTKIRTGTNPEFNETLKYTISHTQLETRTLQLSVWHYDRFGRNSFLGEVEIPFDSWNF ENPTDEWFVLQPKVEFAPDIGLQYKGELTVVLRYIPPEENLMLPPEQLQGNKTFKKGKKK ESPVISGGILEVFIKEAKNLTAVKSGGTSDSFVKGYLLPDDSKATKHKTLVIKKSVNPQW NHTFMFSGIHPQDIKNVCLELTIWDKEAFSSNIFLGGVRLNSGSGVSHGKNVDWMDSQGE EQRLWQKMANNPGTPFEGVLMLRSSMGKCLIIGDDEDFTWQASLGKLPSTHQTWCTASVW QSLWGVAVKDSDPLANHALSHIYIPSPVPSFLGPLSEALGKRSSGSSAEILWGENGRVKL MLPLVLDCD >gi568815575r:38049714_38278346|GENSCAN_predicted_CDS_1|2370_bp nngaagctgaaaaatgaactcttagaagcaaaacgtagaagtgggaaaactcaacaagag gccagcagagtttgtgttcactgtcacagaaacctgggcctaatctttgaccggggagac ccttgtcaggcttgctcactgagggtatgcagggagtgtcgagttgcaggccccaatggc agctggaagtgcactgtctgtgacaaaatcgcgcagctaaggattataactggtgagtgg ttttttgaagaaaaggcaaaacgtttcaagcaagtcaatgttctcggcactgatgttgtc cgacagtccattttaagaagaagtccaggagctgaagaagtacagagccaagagcaaacc cgccaggatgcagaaaagtcagacacttcacctgttgctgggaagaaggccagccatgat gggcccaagagaaagggatttcttcttagcaagttcagatcggcaaccagaggagaaatc ataactcccaaaactgacactgggcggagctatagcttggacttagacggtcaacatttt cggagtttaaaatcacctcctggttcagacaggggaagcactggctcatcagatctcaat gaccaggaacctggtcctaggaccccgaagagcagtcggagcaatggtgtgaccccaggc actcagagttcaccagccccaagcacacgaactgtgacctcagtcatcagtagagagtat ggttttgaaaattccatggatttggctgctattgaaggtacctctcaggagctcacaaag agtcaccgcagaaacacttctggcacaccttccatagcagtgtctggaacctctctctcc tcagatcagagtcgatctgagttagatttgagtgagtcatttacagaagactcagaggat actgtaagcataagaagcaagtctgtccctggggctttagacaaggactccttggaagag actgaagaaagcattgatgccttagtgtcctcgcagttatctacaaacactcaccgtctg gcaagtggcctatcaactaccagccttaacagcatgatgagcgtttacagtgaaacggga gactatggcaacgtgaaagtcagtggtgaaatccttctccatatcagctactgctacaaa actggtgggctgtacatttttgtcaagaattgcagaaatctggccataggagatgaaaag aaacagaggacagatgcttatgtcaagtcatatcttcttcctgacaagtcccggaacaac aagcgtaagaccaaaatcagaacaggcaccaatccagaattcaatgaaacactaaagtac actatcagccatacccagctggaaacaagaactctgcagctctcagtctggcactatgat cgatttggacgtaatagcttcctcggggaagtagagattccttttgactcatggaacttt gaaaatccaactgatgagtggtttgtgcttcaacccaaggtggagtttgctcctgatatt ggccttcaatacaaaggagagctgacagttgttttacgttacattcccccagaagagaac ctgatgcttccaccagaacaactccaaggaaataagacttttaaaaagggaaagaagaag gagtcacctgtaatctctggaggaatactagaagtgttcatcaaagaggcaaagaatttg acagcagtgaagtcaggaggcacttctgatagctttgtgaagggctacctgctccctgat gatagcaaagccaccaagcacaaaactctggtaataaaaaagagtgttaaccctcagtgg aatcatacattcatgttcagtggcatccatccccaggatataaagaatgtttgcctagaa cttactatctgggacaaggaggccttttccagcaacatctttctgggaggagttcgtttg aattctggaagtggtgtgagccatgggaagaacgtggattggatggactctcagggggaa gagcagcgcctttggcagaagatggccaacaaccctggaactccctttgagggtgtactc atgcttcgttccagcatgggaaaatgcctgatcataggcgatgacgaagactttacctgg caggcttcactgggaaagctgccctccacacaccagacttggtgcacagccagcgtatgg cagtctctgtggggtgttgcagtcaaagactcagaccccctggccaatcatgctctttca catatttacattcctagccctgtgccttcatttctaggccctttatctgaggccttgggc aagaggtcttcaggttcctctgctgagatcttatggggggaaaatggaagagtgaagttg atgcttcctctggttttagactgtgactga >gi568815575r:38049714_38278346|GENSCAN_predicted_peptide_2|469_aa MALVCCDVSQIQENQGRTGQLLTGDVLQLFSTQVNNISNIMTGAWCLCADTPWCSPIKVK YGDVYCRAPQGGYYKTALGTRCDIRCQKGYELHGSSLLICQSNKRWSDKVICKQKRCPTL AMPANGGFKCVDGAYFNSRCEYYCSPGYTLKGERTVTCMDNKAWSGRPASCVDMEPPRIK CPSVKERIAEPNKLTVRVSWETPEGRDTADGILTDVILKGLPPGSNFPEGDHKIQYTVYD RAENKGTCKFRVKVRVKRCGKLNAPENGYMKCSSDGDNYGATCEFSCIGGYELQGSPARV CQSNLAWSGTEPTCAAMNVNVGVRTAAALLDQFYEKRRLLIVSTPTARNLLYRLQLGMLQ VSPRGRAKQAQCGLDLRHITVVELVGVFPTLIGRIGAKIMPPALALQLRLLLRIPLYSFS MVLVDKHGMDKERYVSLVMPVALFNLIDTFPLRKEEMVLQAEMSQTCNT >gi568815575r:38049714_38278346|GENSCAN_predicted_CDS_2|1410_bp atggcccttgtttgttgtgatgtgtctcagatacaggagaatcagggtaggactggacaa cttttgactggagatgtccttcagttgttttctacacaagtgaacaatatttccaatata atgactggtgcctggtgcttatgtgcagataccccgtggtgctcccccatcaaggtgaag tatggggatgtgtactgcagggcccctcaaggaggatactacaaaacagccctgggaacc aggtgcgacattcgctgccagaagggctacgagctgcatggctcttccctactgatctgc cagtcaaacaaacgatggtctgacaaggtcatctgcaaacaaaagcgatgtcctaccctt gccatgccagcaaatggagggtttaagtgtgtagatggtgcctactttaactcccggtgt gagtattattgttcaccaggatacacgttgaaaggggagcggaccgtcacatgtatggac aacaaggcctggagcggccggccagcctcctgtgtggatatggaacctcctagaatcaag tgcccaagtgtgaaggaacgcattgcagaacccaacaaactgacagtccgggtgtcctgg gagacacccgaaggaagagacacagcagatggaattcttactgatgtcattctaaaaggc ctccccccaggctccaactttccagaaggagaccacaagatccagtacacagtctatgac agagctgagaataagggcacttgcaaatttcgagttaaagtaagagtcaaacgctgtggc aaactcaatgccccagagaatggttacatgaagtgctccagcgacggtgataattatgga gccacctgtgagttctcctgcatcggcggctatgagctccagggtagccctgcccgagta tgtcaatccaacctggcttggtctggcacggagcccacctgtgcagccatgaacgtcaat gtgggtgtcagaacggcagctgcacttctggatcagttttatgagaaaaggagactcctc attgtgtccacacccacagcccgaaacctcctttaccggctccagctaggaatgctgcag gtgagtcctcgaggcagggccaagcaagcacagtgtggccttgatcttcgacacatcacc gtggtggagctggtgggtgtgttcccgactctcattggcaggataggagcaaagattatg cctccagccctagcgctgcagctcaggctgttgctgcgaatcccactctactccttcagt atggtgctagtggataagcatggcatggacaaagagcgctatgtctccctggtgatgcct gtggccctgttcaacctgattgacacttttcccttgagaaaagaagagatggtcctacaa gccgaaatgagccagacctgtaacacctga >gi568815575r:38049714_38278346|GENSCAN_predicted_peptide_3|158_aa MGSPAHRPALLLLLPPLLLLLLLRVPPSRSFPGRIGGVAASSAGHQRPPESPLGPFARDR GRIQCPNLRLEVDWLLLPADIFLNASWISRQQSPDVLPCQDLGCWKIAQKDSIFGKVPFR AHTAFFTGPWFLRTALIAANEDFTSPSSPFPADSLPWS >gi568815575r:38049714_38278346|GENSCAN_predicted_CDS_3|477_bp atggggagccccgcacatcggcccgcgctgctgctgctgctgccgcctctgctgctgctg ctgctgctgcgcgtcccgcccagccgcagcttcccaggtaggatcggaggcgtggctgcc tcctcagctgggcaccaaagacccccggagtcgccgttgggacccttcgctcgggataga gggaggatccagtgtcctaatctgaggctggaggtggactggcttttgctacctgctgac atatttcttaatgcttcctggatttctagacagcagtctccagatgtgctgccctgccag gatcttggatgctggaaaattgctcagaaagactcaatttttggcaaagttccctttagg gcacatactgcctttttcactggcccatggtttctgagaactgccctcatcgctgcaaat gaggacttcaccagcccctccagtcccttccctgcagacagcctgccttggtcctag >gi568815575r:38049714_38278346|GENSCAN_predicted_peptide_4|71_aa MRVEPSQAVAGKGIPKNGFLDLQQERVKRKSQSTRTEFSGQIATIIIWKDKEVQRVTVKL CLPGAEDMLEP >gi568815575r:38049714_38278346|GENSCAN_predicted_CDS_4|216_bp atgagggtagagccctctcaagctgttgccggaaaagggatccccaagaatgggttcttg gatcttcagcaggaaagagttaaaaggaagtcgcagagtacaagaactgagttttcaggg caaattgccaccatcatcatctggaaagacaaagaagtccagagagtcacagtcaagctc tgcttacctggagcagaagacatgctggaaccataa >gi568815575r:38049714_38278346|GENSCAN_predicted_peptide_5|96_aa MAYLKEHYEDITDFSTETVKAKSLIVCCLKYPSLARMLGVQVREKPNSTDANSNNIFTTY MKESRTPGLKDLFSQLLEVFLAESLQLSVLFGNALY >gi568815575r:38049714_38278346|GENSCAN_predicted_CDS_5|291_bp atggcatatttaaaggaacactatgaagacataactgacttctcaacagaaactgtgaaa gccaaaagccttatcgtctgttgcctgaagtatccatccttggctaggatgcttggggtt caagtaagagaaaagcctaactcaactgatgcaaacagtaacaacatttttaccacctat atgaaggagtccagaaccccaggattgaaggacctattctctcagctgctggaggtgttt ttagcagagagcctgcagctgtcagtcctttttggaaatgccctttactga >gi568815575r:38049714_38278346|GENSCAN_predicted_peptide_6|172_aa DHEFSKTEELKLEDVDEEINAENVESKKKTVGDDESVPTGYHSKTEGAERTNDDSSAETI EKLKLVFLNDPIVGYMLDDADSSSLEILENSETTPSKDMKKTKKIFLFKRVPSINQKIVK NNNEPLPEIKSIGDQIILKSDNKDADQNHMSQNHQNIPPTNTERRSKSCTIL >gi568815575r:38049714_38278346|GENSCAN_predicted_CDS_6|519_bp gatcatgaattttctaaaactgaggaactaaaactagaagatgtggatgaggaaattaat gctgaaaatgtggaaagcaagaagaaaactgtgggagatgatgaaagtgttcctacaggt tatcacagtaaaacagaaggagcagaaagaaccaatgatgatagctcagctgaaactatt gaaaagttaaaactagtatttctgaatgatccaattgtaggatacatgcttgatgatgca gatagcagttcattagaaatcctagaaaacagtgaaacaacaccaagcaaagacatgaaa aaaacaaagaagatttttctgttcaaaagagtcccctcaataaatcaaaagattgtcaag aataacaatgagccgctcccagagataaaatccataggagaccagatcattttaaaaagt gataataaagatgccgaccagaaccacatgagtcagaatcatcagaatatcccaccaaca aatacagagagaagatcaaaatcctgtacaatactataa