GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:49:53 Sequence gi568815589f:88326540_88575274 : 248735 bp : 44.16% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 5452 5447 6 1.05 1.02 Term - 31128 30964 165 0 0 96 47 67 0.532 1.32 1.01 Init - 31888 31838 51 1 0 68 98 13 0.467 1.48 1.00 Prom - 52513 52474 40 -1.86 2.04 PlyA - 52820 52815 6 1.05 2.03 Term - 60060 59889 172 2 1 77 54 169 0.870 9.70 2.02 Intr - 60229 60193 37 2 1 70 100 3 0.806 -2.98 2.01 Init - 62143 61687 457 1 1 75 85 140 0.419 6.37 2.00 Prom - 64065 64026 40 -3.76 3.03 PlyA - 64352 64347 6 1.05 3.02 Term - 84348 83986 363 0 0 -22 43 290 0.930 8.07 3.01 Init - 84971 84690 282 2 0 57 61 230 0.795 14.24 3.00 Prom - 87232 87193 40 -3.46 4.00 Prom + 90652 90691 40 -4.76 4.01 Init + 108233 108290 58 1 1 71 63 63 0.027 1.57 4.02 Intr + 122402 122450 49 0 1 104 82 55 0.076 4.24 4.03 Intr + 135957 136210 254 0 2 46 111 116 0.773 6.78 4.04 Intr + 141833 142066 234 0 0 100 100 140 0.980 14.06 4.05 Term + 148539 148738 200 1 2 86 37 228 0.969 15.06 4.06 PlyA + 150776 150781 6 1.05 5.00 Prom + 155960 155999 40 -3.76 5.01 Init + 170297 170395 99 1 0 54 95 127 0.927 8.27 5.02 Intr + 208842 209197 356 2 2 88 73 680 0.766 60.59 5.03 Intr + 212541 212810 270 0 0 92 34 103 0.590 1.96 5.04 Term + 217840 218008 169 1 1 100 45 217 0.998 15.95 5.05 PlyA + 218407 218412 6 1.05 6.00 Prom + 218716 218755 40 -10.84 6.01 Init + 220516 220585 70 0 1 116 93 0 0.321 4.52 6.02 Intr + 229270 229313 44 2 2 62 93 41 0.087 -0.04 6.03 Term + 232892 233035 144 1 0 38 42 171 0.246 5.41 6.04 PlyA + 234858 234863 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 65122 64967 156 1 0 67 48 111 0.894 2.93 S.002 Init - 108911 108867 45 2 0 120 53 49 0.811 5.30 S.003 Term - 199224 198979 246 0 0 90 48 186 0.931 10.59 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:88326540_88575274|GENSCAN_predicted_peptide_1|71_aa MISFNNRSHIQVTLMQKAQNHVEAAQVWGFHPEKPQPELYVGLFSAMFGAAGTQDTKSIG STQHGYPGPGS >gi568815589f:88326540_88575274|GENSCAN_predicted_CDS_1|216_bp atgatctcctttaataacaggtctcacatccaggtcacactgatgcaaaaggctcaaaac catgtggaagctgcccaggtttggggcttccaccctgagaagccacagcctgagctctat gttggcctcttttctgccatgtttggagcagctgggacccaggacaccaagtccataggc agcacacagcatgggtaccctgggcctggctcatga >gi568815589f:88326540_88575274|GENSCAN_predicted_peptide_2|221_aa MRAPPAGQCSAGPRAGRAHLRPRADRRTAGPPAARTPSRPTPALRGPHLLTAEAAVHRPT GSAVARSRSRRPQPEPDGASPPPLPRRGRRPPPNSQGARALRPPNCDGRAAPAPGADGRL RRRSGAGRGRRGRGLAPLTWLAAGRGSSPGLPVGSSPKQRGMGRRMFDSIPGLHPLDASS THTPPKFTENVFKVKYVAPPSSPSPTPDLKFATGDDLIGSY >gi568815589f:88326540_88575274|GENSCAN_predicted_CDS_2|666_bp atgcgggcgccgcccgctggacaatgctccgccgggccccgcgcgggccgcgctcacctg cgcccccgggcagaccgccgcaccgctggcccgcccgcagcccgcactccctcacgaccc acgcccgccctgcgcgggcctcacctgctgaccgccgaggctgcagtccacaggcccacg ggttccgccgtcgctcgttcgcgctcccgccggccccagcctgagccagacggcgcctcc ccgcccccgctccctcgccgcggacgccgcccgccccccaacagtcagggcgcacgcgca ctgcgcccgccgaactgcgacggccgcgccgccccggcgcctggcgccgacggccgactg cgcaggcgcagtggggcggggcggggccggcgagggcggggcctggcccctctcacctgg ctggcggcagggcggggctcgagcccgggactgccagtggggtcatctccgaaacaaaga gggatgggaagaaggatgtttgacagcatccctggcctccacccactagatgccagtagc acacacactcctccaaaattcacggaaaacgtcttcaaagtcaaatatgtggccccacct tcatctccatccccaacccctgatttaaaatttgccactggtgatgatctcattgggagc tattga >gi568815589f:88326540_88575274|GENSCAN_predicted_peptide_3|214_aa MDDAEASLKSLFSMPSCLSHKSPKEPEQLRKLFIGGLSFETTDESLRSHFEQWGTLTDCV VMRDPNTKCSRGFEFVTYATVEVEDAAMNARPHKPKKLSDSGNVVVVEVVSVGMTTLVVE ETSVVMVALVAVGMAIMDLVMVEAILEVVEATMILAITTISLQILDPRRKETLEAEALAP MVVEANTMPNHETTVATMVVPVAAVAMAVAEDFN >gi568815589f:88326540_88575274|GENSCAN_predicted_CDS_3|645_bp atggatgacgctgaagcatcattaaagtctctcttctccatgccatcatgtctaagtcac aagtctcctaaagagcccgaacagctgaggaagctcttcattggagggttgagctttgaa acaaccgatgagagcctgaggagccattttgagcaatgggggacactcacggactgtgtg gtaatgagagatccaaacaccaagtgctccaggggctttgagtttgtcacatatgccact gtggaggtggaggatgcagccatgaatgcaaggccacacaagccaaagaagctaagtgat tctggaaacgtggtggtcgtggaggtggtttcagtgggaatgacaactctggtagtggag gaaacttcagtggtcatggtggctttggtggcagtggggatggctataatggatttggta atggtggaagcaattttggaggtggtggaagctacaatgattttggcaattacaacaatc agtcttcaaattttggacccacgaaggaaggaaactttggaggcagaagctctggcccct atggtggtggaggccaatactatgccaaaccatgaaaccacggtggctactatggtggtt ccagtagcagcagtagctatggcagtagcagaagattttaattag >gi568815589f:88326540_88575274|GENSCAN_predicted_peptide_4|264_aa MSCARLLMPVIPTLWEAEAGHAGVSANMMKKRTSHKKHRSSVGPSKPVSQPRRNIVGCRI QHGWKEGNGPVTQWKGTVLDQVPVNPSLYLIKYDGFDCVYGLELNKDERVSALEVLPDRV ATSRISDAHLADTMIGKAVEHMFETEDGSKDEWRGMVLARAPVMNTWFYITYEKDPVLYM YQLLDDYKEGDLRIMPDSNDSPPAEREPGEVVDSLVGKQVEYAKEDGSKRTGMVIHQVEA KPSVYFIKFDDDFHIYVYDLVKTS >gi568815589f:88326540_88575274|GENSCAN_predicted_CDS_4|795_bp atgagctgcgcgcggttgctcatgcctgtaatcccaacactctgggaggccgaagcgggc catgctggagtatctgccaacatgatgaagaagaggacatcccacaaaaaacatcggagc agtgtgggtccgagcaaacctgtttcccagccccggcggaacatcgtaggctgcaggatt cagcatgggtggaaagaggggaatggccctgttacccagtggaaaggaaccgttctggac caggtgcctgtaaatccttctttgtatcttataaaatacgatggatttgactgtgtttat ggactagaacttaataaagatgaaagagtttctgcgcttgaagtcctccctgatagagtt gcgacatctcgaatcagcgatgcacacttggcagacacaatgattggcaaagcagtggaa catatgtttgagacagaggatggttctaaagatgagtggaggggaatggtcttagcacgt gcacctgtcatgaacacatggttttacattacctatgagaaagaccctgtcttgtacatg taccaactcttagatgattacaaagaaggcgaccttcgcattatgcctgattccaatgat tcacctccagcagaaagggaaccaggagaagttgtggacagcctggtaggcaaacaagtg gaatatgccaaagaagatggctcgaaaaggactggcatggtcattcatcaagtagaagcc aagccctccgtctatttcatcaagtttgatgatgatttccatatttatgtctacgatttg gtgaaaacatcctag >gi568815589f:88326540_88575274|GENSCAN_predicted_peptide_5|297_aa MGLARESGLALHVPLCCLLPVSKMDMDERRKQKVIILLQVSSGLRWLRVCAMVDILGERH LVTCKGATVEAEAALQNKVVALYFAAARCAPSRDFTPLLCDFYTALVAEARRPAPFEVVF VSADGSSQEMLDFMRELHGAWLALPFHDPYRHWCRGCRKPWKTAVGLANLSKHKPPDLGH CPSVRGGEIAFLMSFQVMLVLLLLGPHIVPDSAGQRLVHGPMSVGNLSGVQMLSDLGVLR NLELRKRYNVTAIPKLVIVKQNGEVITNKGRKQIRERGLACFQDWVEAADIFQNFSV >gi568815589f:88326540_88575274|GENSCAN_predicted_CDS_5|894_bp atggggctcgcccgagagtcaggcctggctcttcacgttcccttgtgctgcctccttccc gtgtccaaaatggacatggatgagagaaggaagcaaaaggtgatcatcctcctgcaggtg tcctcgggtctcaggtggctgcgtgtctgcgccatggttgacattctgggcgagcggcac ctggtgacctgtaagggcgcgacggtggaggccgaggcggcgctgcagaacaaggtggtg gcactgtacttcgcggcggcccggtgcgcgccgagccgcgacttcacgccgctgctctgc gacttctatacggcgctggtggccgaggcgcggcggcccgcgcccttcgaagtggtcttc gtgtcagccgacggcagctcccaggagatgctggacttcatgcgcgagctgcatggcgcc tggctggcgctgcccttccacgacccctaccggcattggtgcaggggatgcagaaagccc tggaagactgctgtgggattggccaacttgagcaagcataaaccaccagatcttgggcac tgccccagcgtcagaggtggggagattgcatttctgatgagtttccaggtgatgttggtg ttgctgctcctgggaccccatattgttcctgacagtgctggtcaaaggctggtgcatgga cccatgtcagttgggaatctgtcaggagtgcagatgctgtcggacctgggggtgctcagg aatcttgagctgaggaagaggtacaacgtcacagccatccccaagcttgtgattgtgaaa caaaatggggaggtcatcaccaacaaagggcggaagcagatccgggaacgggggttggcc tgcttccaggactgggtggaggcggccgatatcttccagaatttctccgtttga >gi568815589f:88326540_88575274|GENSCAN_predicted_peptide_6|85_aa MGQGLKDMQHRDPAQAHVSSRLPDPSPSVGKPGAERQKNLSFMTSQHLVLAPPTATTANR IAKGFLATKHVHLLDLIPAIRRPVP >gi568815589f:88326540_88575274|GENSCAN_predicted_CDS_6|258_bp atggggcaggggttgaaagacatgcagcatcgtgacccagcccaggcccacgtctcatcc agactccctgatccttcaccttctgtgggcaagccaggtgcagagcggcaaaagaatctg tccttcatgaccagccagcatctggtacttgcccctccaacagcaacaacagcaaatcga attgcaaagggcttcctcgctaccaagcacgttcacttgctcgacctcatccctgccatc cgtaggcctgtcccttga