GENSCAN 1.0 Date run: 7-Nov-116 Time: 18:06:48 Sequence gi568815591f:112106717_112442726 : 336010 bp : 39.20% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4018 4371 354 0 0 74 64 99 0.038 3.28 1.02 Term + 14602 14724 123 0 0 100 33 82 0.149 1.30 1.03 PlyA + 16775 16780 6 1.05 2.00 Prom + 20372 20411 40 -3.65 2.01 Init + 32129 32272 144 1 0 53 88 66 0.034 3.27 2.02 Intr + 37125 37185 61 2 1 101 84 26 0.015 0.89 2.03 Intr + 40825 40972 148 2 1 54 99 89 0.027 4.97 2.04 Intr + 44087 44282 196 2 1 55 105 2 0.010 -2.90 2.05 Intr + 48086 48202 117 1 0 83 80 67 0.834 5.14 2.06 Intr + 48929 49031 103 1 1 87 90 62 0.825 5.13 2.07 Intr + 50541 50690 150 1 0 62 64 84 0.693 2.61 2.08 Intr + 73480 73749 270 2 0 -60 100 196 0.025 2.59 2.09 Intr + 83557 83651 95 2 2 38 65 117 0.039 3.16 2.10 Term + 98497 99057 561 0 0 70 47 223 0.471 9.78 2.11 PlyA + 99358 99363 6 1.05 3.00 Prom + 99444 99483 40 -15.66 3.01 Sngl + 100001 100360 360 1 0 66 53 272 0.948 17.39 3.02 PlyA + 100854 100859 6 1.05 4.03 PlyA - 101050 101045 6 1.05 4.02 Term - 104449 104281 169 0 1 111 37 75 0.350 1.07 4.01 Init - 114421 114126 296 1 2 89 59 151 0.133 9.01 4.00 Prom - 122017 121978 40 -4.35 5.00 Prom + 133058 133097 40 -5.05 5.01 Init + 150746 150776 31 1 1 62 92 49 0.102 2.65 5.02 Intr + 180157 180358 202 2 1 92 115 205 0.982 21.02 5.03 Intr + 182034 182095 62 0 2 87 36 82 0.038 0.36 5.04 Intr + 203545 203675 131 2 2 46 70 72 0.061 0.69 5.05 Intr + 211466 211557 92 1 2 87 41 70 0.127 0.07 5.06 Intr + 221001 221111 111 0 0 83 108 24 0.188 2.48 5.07 Intr + 223368 223500 133 0 1 53 87 50 0.397 1.13 5.08 Intr + 229388 229455 68 1 2 88 52 142 0.802 7.38 5.09 Intr + 231014 231110 97 2 1 77 94 37 0.824 2.29 5.10 Intr + 232116 232205 90 2 0 59 81 73 0.844 2.97 5.11 Intr + 233127 233169 43 2 1 84 111 28 0.998 1.59 5.12 Intr + 234156 234330 175 1 1 127 45 111 0.972 8.78 5.13 Intr + 235845 235967 123 0 0 114 14 121 0.016 6.08 5.14 Intr + 253244 253325 82 2 1 45 53 67 0.007 -2.38 5.15 Intr + 256912 257008 97 0 1 55 94 84 0.527 4.36 5.16 Intr + 258241 258387 147 2 0 104 48 87 0.414 5.59 5.17 Intr + 258605 258786 182 0 2 83 87 43 0.439 2.37 5.18 Intr + 259500 259568 69 2 0 55 94 74 0.431 3.16 5.19 Intr + 263603 263822 220 1 1 53 63 117 0.311 2.65 5.20 Intr + 266232 266345 114 1 0 47 85 76 0.479 2.70 5.21 Intr + 268176 268395 220 1 1 98 22 119 0.125 2.74 5.22 Intr + 286701 286799 99 0 0 56 75 79 0.000 1.71 5.23 Intr + 291486 291531 46 0 1 87 92 33 0.302 1.09 5.24 Term + 292075 292167 93 0 0 153 42 29 0.781 1.65 5.25 PlyA + 292297 292302 6 1.05 6.00 Prom + 296103 296142 40 -5.25 6.01 Init + 301458 301611 154 2 1 75 82 83 0.083 6.59 6.02 Intr + 302832 303008 177 1 0 40 92 87 0.064 3.27 6.03 Intr + 310900 311030 131 2 2 91 72 29 0.031 1.09 6.04 Term + 314029 314121 93 0 0 86 36 133 0.904 4.75 6.05 PlyA + 314225 314230 6 1.05 7.02 PlyA - 316540 316535 6 1.05 7.01 Term - 326650 326504 147 1 0 79 48 97 0.089 1.72 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 33534 33664 131 1 2 68 42 134 0.933 4.16 S.002 Init - 168302 168235 68 2 2 76 119 47 0.918 7.20 S.003 Term + 182034 182157 124 0 1 87 49 94 0.862 2.28 S.004 Term + 235845 236013 169 0 1 114 48 124 0.978 7.37 S.005 Intr - 283668 283498 171 1 0 81 71 111 0.814 7.69 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:112106717_112442726|GENSCAN_predicted_peptide_1|158_aa MGCSEDPPSDTSSIDTVSTAWGSQTGGQLEGWLTCCRRRLSHRGNSAPHGNVRTLTPCQR LPHSECTCTSAKRIPAHVLTNPQTPIAPWMSTLEWSIRRAGDLHQKQGVIISEKHPPQGV VTGCYPVIRFPTFPYVFCGAQAPSASYKGPSPSTKRLT >gi568815591f:112106717_112442726|GENSCAN_predicted_CDS_1|477_bp atgggctgctctgaggaccctccttcagacacttccagtatagacacagtatccactgct tggggaagtcagactggtggtcagctggaggggtggcttacatgctgcaggaggaggctg tcccacaggggcaactccgctccccacggaaatgtgcggaccctcactccctgccagcgt cttccccattcagaatgcacctgtacctcagcaaaaagaatccctgcccatgttctaacg aatccccaaacccccattgccccttggatgtccacccttgagtggagtataaggcgtgcg ggtgacctacatcagaaacagggtgtgatcatttctgaaaagcacccaccgcagggagta gtaacaggatgctatcctgttattagattccccacattcccctatgtgttctgcggagca caggctcctagtgcttcttacaaaggtccatctccatcaacaaaacgtttaacatag >gi568815591f:112106717_112442726|GENSCAN_predicted_peptide_2|614_aa MHMHQEKATVRRWLSTSQEVGLHYNFNSAGWPNPNPNPSAGRQESTVQTYTFPLVSITAL ADVLIRIISFAQYLKIPSKVFPIIEKFAHFPVFSIKSRNPASEDLATTITKIRRLSERTS IYSDTISYPHSWEHQQQQHNPHSLITRYYLAPARLGMLKGLQRFVCLSHKDLLRMSFLVI KWKKEQDPALQLQPGYGHAIAGSILQELWPDTIPCCASDLKQLGVPDCGQCGPQSSKDVS QSPREVFEFVSGSANEAWQPSNESNYRVLIQYPPCSGHGRRHFPLPALDMVHCRPDSTPA VGLLSREAEPDKEGGSPPQPRNPRQMQNDSGEPCSPKEPNKHQRTKDFRKTEAQCSFTCK NNLFTSADNFYPDRSTQHNLPSARQIRLQFSFPKFGRLPYTVAKSDSFKTAAAKALREAA AGHRAGTENSESYPVPFSPNASVRVCFPKSYLPAPKNQAFRSPSPFITQQPCTATPPPLR SPGVERVARSKNTAEIPVLLQRPKNRAAQMLKFTNAPAPQHTHPINPGSQRGRRVREGSA CQSGSLQDPATFHTPGMSIRARGKGLRLPTVLGVFSLPECCCLPLHPEEPLQLPVALGKK GKEGEKPHSPPLPT >gi568815591f:112106717_112442726|GENSCAN_predicted_CDS_2|1845_bp atgcacatgcaccaagaaaaagccacagtgagaaggtggctgtctacaagccaggaagtg ggcctgcattataatttcaactctgcaggttggcctaaccctaaccctaacccctctgca ggtaggcaagaatcaactgtgcagacttacacattccctttggtctcaatcacagcactt gcagatgttttgattcggatcatctcctttgcccaatacctcaaaataccctcaaaggta tttcctataattgaaaagtttgcccactttcctgtgttcagtataaaatctagaaatcct gcatcagaagacctggccacaaccatcacaaaaataaggcggttatctgagaggacttct atatattcagacaccatttcctacccacattcttgggaacatcagcaacagcaacataat ccccatagcttaattacaagatactacttggccccagccagactggggatgctaaaaggc ttgcagaggtttgtttgtttaagccacaaggatctcttaaggatgagttttctggttatt aagtggaagaaagaacaagatcctgcactgcaacttcagccaggctacgggcatgcaata gcggggagcattcttcaggagctctggcctgacacaataccttgctgtgcctcagattta aaacagttaggagtcccagactgtggacagtgtggaccccagagttcaaaagatgtgagt cagagtcccagggaagtatttgagtttgtcagtggttctgctaatgaagcctggcaacca agcaatgagagtaattacagagtgcttatccaatacccaccttgtagcggccatgggaga cgtcattttccccttcctgccctggacatggtacactgccggccagattctacaccagct gttggcttactctccagagaggcagaaccagacaaggaaggaggtagcccccctcaaccc agaaacccccggcagatgcaaaacgactctggagaaccttgctctcccaaagagcccaac aagcatcagaggaccaaagacttcagaaaaacagaagcacagtgctcctttacatgtaaa aataacctcttcacttctgcagacaacttttaccccgacaggtccacacagcacaacctg ccatctgccaggcagatcaggctgcaattctcctttcccaagtttggaaggcttccctac actgtagcaaagtcggacagtttcaaaacagcagcagcaaaggccctgagagaggcagct gcagggcacagagcaggaacagagaacagcgagagctatcctgttcctttttctccgaac gccagtgtcagggtatgctttcctaaaagctatctccctgcccccaagaaccaggctttc cgatccccctccccatttatcactcagcagccctgtaccgccacaccgcccccactgcgc tcacccggggtcgagagggtggccagaagtaaaaacactgctgaaatacccgttcttctc caaagaccaaagaaccgagcggcgcagatgttgaaatttaccaatgcccccgccccgcaa catacacaccccataaatccgggatcgcagagagggaggagggtcagagagggctcagct tgtcagagcggctctctccaggaccctgccacattccacactccgggaatgtcgatccga gccagagggaagggtttgcgcctccctactgttttgggggtgttctctctgcccgagtgt tgttgtctccctctccacccagaggaaccgttgcagctccctgtcgctttggggaagaag ggtaaggaaggagagaaaccccattcgcctcctcttccaacttga >gi568815591f:112106717_112442726|GENSCAN_predicted_peptide_3|119_aa MAASKTQGAVARMQEDRDGSCSTVGGVGYGGEYGAPEARLMCLPFSMTGCATDLWSDPRS PSALGPPGFDWAPRVVAQRDGLEPVAATWLGLPGDGGENAMGWRRKCHGMAFPSAGEAP >gi568815591f:112106717_112442726|GENSCAN_predicted_CDS_3|360_bp atggctgcttccaagacccagggggctgtcgcccgaatgcaggaagaccgtgatgggagc tgcagcacagtcgggggtgtaggttatgggggtgagtacggtgccccggaggcgcggctg atgtgtcttcctttctctatgaccgggtgtgcaacggacctctggtctgaccctaggagc ccttcagctctggggccacctgggttcgactgggccccacgggtggtggcccagcgggat gggttggagccggtggcggctacttggttggggctgcccggggatggcggagaaaatgcc atgggatggcggagaaaatgccatggaatggccttcccctccgccggggaagctccctga >gi568815591f:112106717_112442726|GENSCAN_predicted_peptide_4|154_aa MGTSLDQEPSRHPAGSGRVEVSGGSAAGLRWWQTAVVDKEQKLSSSGNKHGPEECAVGRF NRVKTELPYKGRGPKEGSPCWLECLGLYPDNCPSPYALRDITHEIVSSLEDKHYQNHQII RSRIQALAKSSTTKHPFGGKKVDFDRILQYGDTD >gi568815591f:112106717_112442726|GENSCAN_predicted_CDS_4|465_bp atgggcacctcgctggatcaggagcccagcagacaccctgctggatctggaagggtggaa gtcagcggcgggtcagcggcgggtctgcgatggtggcaaacagcagtggtggacaaagag caaaagctcagctcgagcggtaacaaacacggaccagaagagtgtgcagttggaaggttt aatagagtgaaaacggagctcccatacaaagggaggggacccaaagagggtagcccgtgc tggctggaatgcctgggtttatatcctgacaattgtccctccccctatgctctcagagac atcacacatgagatagtaagttctctggaagataaacattaccaaaatcatcagataata aggtccagaatccaagcattagcaaaatcatcaactactaaacacccattcggagggaaa aaagttgactttgacagaatattacaatatggagatacagattaa >gi568815591f:112106717_112442726|GENSCAN_predicted_peptide_5|908_aa MNSEPARDLPDSKDCILEPLSLPESPGGTTTLEGSPSVPCIFCEEHFPVAEQDKLLKHMI IEHKIVIADVKLVADFQSCVLSQRGQPSPLEEDQSLVKGVVKGPTMNNTDPPVTQEILRV NKLSPRNRQQGPSEFSTQTLYLREILEQQQQERNDTNFHGVCMFCNEEFLGNRSVILNHM AREHAFNIGLPDNIVNCNEFLCTLQKKLDNLQCLYCEKTFRDKNTLKDHMRKKQHRKINP KNREYDRFYVINYLELGKSWEEVQLEDDRELLDHQEDDWSDWEEHPASAVCLFCEKQAET IEKLYVHMEVSLTSAGSLSVITSRIPNHPSYGATLNAIPDAHEFDLLKIKSELGLNFYQQ VKLVNFIRRQVHQCRCYGCHVKFKSKADLRTHMEETKHTSLLPDRKTWDQLEYYFPTYEN DTLLCTLSDSESDLTAQEQNENVPIISEDTSKLNRKIYPKIHMDSQETRITKTILKKKNK GIGSECGRRNTKGVGAELCHGSSVLEPDGSLPEHTMGFHSVNALHPRDWHPILLEECYGP GASAVPSVGCFHAPSGWQLPGGLTGPSVATLLPHWPVGLFGELYHVKGSTLVSINGHSAM VIVSLRPFLFLLKEKAISRSVLATLSGAEGQLLSVFKLHAHTDPSYTFQIHKVLTTGLPP CPRAICVPPTTSPHGHPEATDNYSTTLSYCLFSWATPGTNSHSLDSLEANAEMEFEVQGS ISHNLNDEQDIQKIGGLFKTLPFTSSSLTIGSLALTALATVHLLFLHETESNNPSGVSSD PDKITSPRQYTTKDVLGLIFLLLLPDLLSDPDNYTLANPLNTPPHIKPERILNSVWNFAG TSEMSVGTYCPYLKEQVQIQCGRMKVPGESGFGFVLLEVLDVLLLVPEYCMTFYHVPVSC PLTLVVSV >gi568815591f:112106717_112442726|GENSCAN_predicted_CDS_5|2727_bp atgaactcagagcctgcaagggacctgccagacagtaaggattgtatcctggagccgctt tccctgccagaaagtccaggtggcaccaccactttagaaggttctccatctgtgccttgt attttctgtgaagaacattttcctgtggctgaacaagacaaacttctgaagcacatgatt attgagcataagattgtcatagctgatgtcaagttggttgctgatttccaaagctgtgtt ctcagtcaaagaggacaaccatctccactagaagaggaccagtctttggtcaagggtgtg gtcaaagggcccaccatgaataacacagaccctccagtcactcaggaaattctcagggtt aataagttatctcctaggaacagacaacaagggccatccgaattctccacacaaaccctt tacctgagagaaattctggaacaacagcagcaagaacgaaatgataccaattttcatggc gtttgtatgttttgcaatgaagaattccttggaaacagatctgttattttgaaccacatg gccagagaacatgctttcaacattggattgccagacaacattgtaaactgcaatgaattt ttgtgtacattacagaaaaagcttgacaatttgcagtgcttgtactgtgagaagaccttc agggacaaaaatacacttaaagatcacatgaggaaaaaacagcatcgtaagattaatcct aagaacagagaatatgacagattttatgtcatcaattatttggaacttggaaaatcgtgg gaagaagttcagttggaagatgatcgggagttgctggaccatcaggaagatgactggtct gattgggaagaacaccctgcctctgcagtctgcttattttgtgaaaagcaagcagaaaca attgagaagttgtatgtccacatggaggtgtccttaacctctgctgggtcactttcagtg ataacatcacgaattcctaatcatccttcatatggtgctacactaaatgccatcccagat gcacacgaatttgatcttctcaaaataaagtcagaacttggattaaatttctatcagcaa gtgaaactggtcaattttattcggaggcaagttcaccaatgcagatgttatggctgccat gtgaagttcaaatccaaagcagacttaagaactcacatggaagaaactaaacacacttcg ctgctccccgatagaaagacgtgggatcaactggagtattattttccaacctatgaaaat gacactctcctgtgtacactatctgacagtgaaagtgacctgacagctcaggaacaaaat gaaaatgttcccatcatcagtgaagatacatctaaactaaatagaaaaatctatcctaaa attcatatggactctcaagagactcgaatcaccaaaacaatcttgaaaaagaagaacaag gggattggcagtgaatgtggaagacgaaatacaaaaggggtgggggcagagctctgtcat gggagctctgtcttggagcctgacggttctctgccagagcacacaatgggctttcattca gtgaatgccttgcatcctagagattggcaccccatccttttagaggaatgctatggccct ggtgcttcagctgtgccctctgtgggctgtttccatgctccctcaggctggcagcttccg ggaggattgacggggcccagtgtagccaccttgctgccacactggccagttggtctcttt ggggaattataccatgttaagggctccacgctggtctctattaatggacattcagcaatg gtgatagttagccttaggccatttctctttctactcaaagaaaaagccatcagcagatca gtcttggccaccctgagtggagctgaagggcagctgctgtctgttttcaaattgcatgca catactgacccatcctacaccttccaaattcacaaagttttaacaactggactgccacct tgccccagggctatctgtgttcctcctactaccagtcctcatggccacccagaggcaact gataactactcaacaaccttatcttattgcctgttttcttgggccactcctggtaccaac tcacatagcttggattctctggaagcaaatgctgagatggagtttgaagtgcaaggctcc atcagccataacctcaatgacgaacaagacatccaaaaaataggaggactgttcaagact ttacccttcacttcctcctcccttactattggtagccttgcacttacagctctagcaact gttcaccttttattcttgcatgaaacagaatctaacaacccttcaggagtttcatcagac cccgacaaaatcacttccccccgacaatatacaaccaaagatgttctaggtttaattttt ctcctcctccttcccgacctcctgagcgacccagataattacactttagccaaccccctc aataccccaccccacattaagccagagcgtatcctgaacagcgtctggaactttgcgggc accagtgaaatgtcagttggaacgtattgcccatacctaaaagagcaggtgcagatacag tgtgggaggatgaaagtaccaggtgagagtgggttcggttttgttttgctggaggttctg gatgtgctcttgcttgttccagaatactgcatgaccttttatcatgtccctgtttcctgt ccactcactctagttgtttcagtctga >gi568815591f:112106717_112442726|GENSCAN_predicted_peptide_6|184_aa MDLNIPGSFHWLVSQGFESPSVYKFYITLRLHASESEAPGVKYGVVRQGQVGSSEDGSGL MALCDVNLVHGKQTPLSTIGRNAAGSAIVLCSCSGSSIQSLGSHPFGRESELEKTTLNFI WNQKRACIAKTILSKKNKGGDIMLPDFKLYYKATLAVSGSHDIAQARAMLKVVQTAKDLL SSPF >gi568815591f:112106717_112442726|GENSCAN_predicted_CDS_6|555_bp atggacctcaacatcccaggttccttccactggctcgtttctcagggctttgaatctcct tctgtttacaagttttatattactcttaggctacatgcctctgaaagtgaggcccctgga gtgaaatatggtgtggtgagacagggccaggtagggtcctctgaagatggttctggtctg atggccctctgtgatgtcaatttggtccatggaaagcaaacacccttgtccaccattggg aggaatgctgctggctcagccatagttctctgttcttgctctggcagctccatacagtcc ctcggttcacacccttttgggcgagaatcagaactagaaaaaactactttaaatttcata tggaaccaaaaaagagcctgtatagccaagacaatcctaagcaaaaagaacaaaggtgga gacatcatgctacctgacttcaaactatactacaaggctacactggcagtctccggcagc cacgatattgcccaggctcgtgctatgttgaaagtggtccaaactgccaaagatttactc tcctctcccttttag >gi568815591f:112106717_112442726|GENSCAN_predicted_peptide_7|48_aa ELIQHKRTASPSYDFICDPTNQHTPLSNPLLTKLSLKTLIPEFLGRLI >gi568815591f:112106717_112442726|GENSCAN_predicted_CDS_7|147_bp gaactgattcagcacaagaggacagcttcaccttcctatgatttcatctgtgaccccacc aatcagcacaccccactttccaacccactactcaccaaattgtccttaaaaaccctgatc cctgagtttttggggagactgatttga