GENSCAN 1.0 Date run: 5-Nov-116 Time: 08:25:58 Sequence gi568815585f:49911941_50113161 : 201221 bp : 39.10% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 1578 1573 6 1.05 1.05 Term - 3220 3123 98 2 2 102 42 33 0.369 -2.75 1.04 Intr - 9640 9538 103 1 1 105 103 17 0.690 3.73 1.03 Intr - 16145 15979 167 0 2 62 89 148 0.933 11.06 1.02 Intr - 19194 19078 117 1 0 54 77 67 0.718 1.72 1.01 Init - 24295 24190 106 1 1 91 82 131 0.988 11.27 1.00 Prom - 39824 39785 40 -1.85 2.04 PlyA - 40281 40276 6 1.05 2.03 Term - 43582 43011 572 2 2 19 48 248 0.736 7.51 2.02 Intr - 44513 44271 243 0 0 -209 53 428 0.670 6.15 2.01 Init - 45448 45352 97 1 1 78 66 53 0.321 2.62 2.00 Prom - 46496 46457 40 -6.25 3.06 PlyA - 46570 46565 6 1.05 3.05 Term - 66434 66307 128 0 2 133 43 86 0.834 6.06 3.04 Intr - 77120 76976 145 2 1 57 99 44 0.552 1.33 3.03 Intr - 82021 81853 169 0 1 67 108 164 0.980 15.33 3.02 Intr - 85387 85134 254 1 2 87 88 314 0.236 26.61 3.01 Init - 89556 89542 15 0 0 30 111 5 0.311 -2.63 3.00 Prom - 89678 89639 40 -6.45 4.00 Prom + 94938 94977 40 -4.75 4.01 Init + 100001 101122 1122 1 0 63 61 672 0.043 56.26 4.02 Intr + 104034 104131 98 2 2 87 68 49 0.333 0.69 4.03 Term + 108274 108514 241 1 1 109 45 168 0.999 9.11 4.04 PlyA + 108948 108953 6 1.05 5.00 Prom + 111232 111271 40 -5.65 5.01 Init + 139919 140165 247 1 1 114 86 23 0.549 2.51 5.02 Term + 141089 141186 98 0 2 115 39 85 0.626 3.45 5.03 PlyA + 143140 143145 6 1.05 6.00 Prom + 162983 163022 40 -4.15 6.01 Init + 169443 169639 197 2 2 79 38 194 0.607 11.95 6.02 Intr + 170231 170304 74 2 2 87 39 73 0.175 0.43 6.03 Intr + 170440 170617 178 2 1 -12 91 160 0.110 4.36 6.04 Intr + 182699 182792 94 2 1 43 50 89 0.079 -0.35 6.05 Term + 187498 187908 411 0 0 39 45 227 0.618 7.76 6.06 PlyA + 188304 188309 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 100001 101224 1224 1 0 63 38 683 0.870 56.35 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:49911941_50113161|GENSCAN_predicted_peptide_1|196_aa MATSVLCCLRCCRDGGTGHIPLKEMPAVQLDTQHMGTDVVIVKNGRRICGTGGCLASAPL HQNKSYFEFKIQSTGIWGIGVATQKVNLNQIPLGRDMHSLVMRNDGALYHNNEEKNRLPA NSLPQEGDVVGITYDHVELNVYLNGKNMHCPASGIRGTVYPVVYVDDSAILDCQFSEFYH TPPPGFEKILFEQQIF >gi568815585f:49911941_50113161|GENSCAN_predicted_CDS_1|591_bp atggccacctcggtgttgtgctgcctgcggtgctgcagagacggggggactggccacatc cctctgaaggagatgccggccgtgcagctggacacgcagcacatgggaacagatgttgtt attgtaaagaatggaagaagaatatgtggaacaggaggttgtttagccagcgcaccttta catcaaaacaaaagctattttgaattcaaaatccagtccacaggaatctggggtattggt gttgcaactcagaaggttaacttgaatcagattcctcttggccgagatatgcacagtctg gtgatgagaaatgatggagccctttaccacaacaatgaagagaaaaataggctgccagca aacagtcttccgcaggaaggagatgtggtgggtattacttatgaccatgtcgaattaaat gtatacttgaatggaaaaaacatgcattgtccagcatcaggtatacgagggacagtgtat ccagttgtttatgttgatgacagtgcaattttggattgccagttcagtgagttttatcat acgcctccacctggttttgaaaaaatattatttgaacagcaaatcttctga >gi568815585f:49911941_50113161|GENSCAN_predicted_peptide_2|303_aa MISLVDLKLCWSNNGHSLCKEESLSPILPTSDGEERRRKRKKRKRKKRKQKQQQQQEAAA PAEADAEATAAEAAAEEAAARKEEEEEGEEGEGNGEEEEEKAAAAARGDWSQQEKACKSS LQAILFGHAKLEPGRLPRPTQCRVEATVLVGTNSPLDNPLWSISWSLWQKPVGTSQQQPL GLWTREFPLEGHLLTRNETFTEATPLIPEIGMLSQMMSEEHSNGDGSAQKSSMTQWKWFI EDHATQGMQRISFPLGLTLELCEKLLDSTVPNKQLSTDQTRASWFMDGNSKAQNARKNCS VSQ >gi568815585f:49911941_50113161|GENSCAN_predicted_CDS_2|912_bp atgatctccctggttgatcttaagctctgctggtcaaataacggacattccttgtgtaaa gaggagagcctctcaccaatcctccctaccagtgatggagaagaaaggagaagaaaaagg aagaagaggaagaggaagaagaggaagcagaagcagcagcagcagcaagaagcagcagca ccagcagaagcagatgcagaagcaacagcagcagaagcagcagctgaagaagcagcagca agaaaggaggaggaggaggaaggggaggagggggaggggaatggggaggaggaggaggag aaggcagcggcggcagcaagaggggactggtcccagcaggagaaggcatgtaaatcgtcc cttcaagcaatattatttggacatgctaagctggaaccagggagactgcccaggcccaca cagtgcagagtagaagctacggtgctggtagggacaaattctccacttgataaccctcta tggagcatttcctggagcttatggcaaaagcctgtgggcacctcccagcaacaaccactg ggactttggactagagaatttccacttgaagggcatttactgactcgcaatgaaacattt actgaagccaccccactgatacctgaaataggcatgctgtctcagatgatgtcagaggaa cattctaatggggatggcagtgcccagaagagttccatgacacaatggaaatggtttata gaggatcatgctactcaaggcatgcagagaatttcttttcccctaggactgactctggaa ctgtgtgagaaactgctggattctacagtgccaaataaacaactttcaactgaccaaaca agagcttcttggtttatggatggcaattccaaggcacaaaatgctaggaagaactgttct gtttctcagtaa >gi568815585f:49911941_50113161|GENSCAN_predicted_peptide_3|236_aa MFTSLAAPPQAHDSAPLVSAALPDWSSDGLRQPPQPTQIPADLGNSPPSLKVRGPPRFPE RSRAAGTYLRPPASGKGRATARPPTTGTARTATGKLPLEKDDCPTKTCIHSYILPWKSTV ELDPSTDSTGIVNILVTLKFPSTDLQAFGLALVLPEAHTASSSRAPAPLVPSLTRPSPAT SHPHPPTIAQLTVSGRASKTDGSEVLHLRAALALRAGQGLSGPLTGPQDCPFPQHA >gi568815585f:49911941_50113161|GENSCAN_predicted_CDS_3|711_bp atgttcactagcttggccgctcctccccaggcccatgacagtgccccgctggtttctgcc gcgctgccggactggagctcagacggccttcgccagcccccccaacccacgcagatccct gctgacctgggcaactccccaccctcgctgaaggttcgaggaccaccccgctttcccgag aggagccgggcggcgggtacttatctccgacctccggctagtgggaaaggccgcgcgacc gcccgtcctccaactacagggaccgcacgaacagcgactgggaaactgccactagaaaaa gatgactgtccaactaaaacctgcatacacagctacatccttccctggaagagcacagtg gaactagatcctagtacagatagcacagggattgtcaacattcttgtcacactgaaattt ccttctactgatctccaagcatttgggttagcccttgtgttacccgaggctcatactgct tccagcagcagggctcctgcacccttagttccttccctaacgaggccctcaccagccacc tcccatccccatcctccaaccatagcacaactgacagtctcaggtagggcctctaaaacc gatggaagtgaagtgctacacctcagagctgctttagcccttagagctggccagggtctg agtgggcctctcacagggccacaggactgccctttcccccagcatgcctag >gi568815585f:49911941_50113161|GENSCAN_predicted_peptide_4|486_aa MELLEEDLTCPICCSLFDDPRVLPCSHNFCKKCLEGILEGSVRNSLWRPAPFKCPTCRKE TSATGINSLQVNYSLKGIVEKYNKIKISPKMPVCKGHLGQPLNIFCLTDMQLICGICATR GEHTKHVFCSIEDAYAQERDAFESLFQSFETWRRGDALSRLDTLETSKRKSLQLLTKDSD KVKEFFEKLQHTLDQKKNEILSDFETMKLAVMQAYDPEINKLNTILQEQRMAFNIAEAFK DVSEPIVFLQQMQEFREKIKVIKETPLPPSNLPASPLMKNFDTSQWEDIKLVDVDKLSLP QDTGTFISKIPWSFYKLFLLILLLGLVIVFGPTMFLEWSLFDDLATWKGCLSNFSSYLTK TADFIEQSVFYWEQMTLLPLPPQRPSYHDLVFQCGSDSTTDNQTGVRYVSIKPDNRKLAN GTNVLGLLIDTLLKEGFHLVSTRTVSSEDKTECYSFERIKSPEVLITNETPKPETIIIPE QSQIKK >gi568815585f:49911941_50113161|GENSCAN_predicted_CDS_4|1461_bp atggagctgcttgaagaagatctcacatgccctatttgttgtagtctgtttgatgatcca cgggttttgccttgctcccacaacttctgcaaaaaatgcttagaaggtatcttagaaggg agtgtgcggaattccttgtggagaccagctccattcaagtgtcctacatgccgtaaggaa acttcagctactggaattaatagcctgcaggttaattactccctgaagggtattgtggaa aagtataacaagatcaagatctctcccaaaatgccagtatgcaaaggacacttggggcag cctctcaacattttctgcctgactgatatgcagctgatttgtgggatctgtgctactcgt ggggagcacaccaaacatgtcttctgttctattgaagatgcctatgctcaggaaagggat gcctttgagtccctcttccagagctttgagacctggcgtcggggagatgctctttctcgc ttggataccttggaaactagtaagaggaaatccctacagttactgactaaagattcagat aaagtgaaggaattttttgagaagttacaacacacactggatcaaaagaagaatgaaatt ctgtctgactttgagaccatgaaacttgctgttatgcaagcatatgacccagagatcaac aaactcaacaccatcttgcaggagcaacggatggcctttaacattgctgaggctttcaaa gatgtgtcagaacccattgtatttctgcaacagatgcaggagtttagagagaaaatcaaa gtaatcaaggaaactcctttacctccctctaatttgcctgcaagccctttaatgaagaac tttgataccagtcagtgggaagacataaaactagtcgatgtggataaactttctttgcct caagacactggcacattcattagcaagattccctggagcttttataagttatttttgcta atccttctgcttggccttgtcattgtctttggtcctaccatgttcctagaatggtcatta tttgatgacctggcaacttggaaaggctgtctttcaaacttcagttcctatctgactaaa acagccgatttcatagaacaatcagttttttactgggaacagatgaccttacttccactg cctccacaaagaccttcttaccatgacctggttttccagtgtggttctgacagcactact gataaccaaactggagtcaggtatgtttctataaaacctgataaccgaaaattggccaac ggaacaaatgtcctcggcttactgattgacactttattaaaggaaggctttcatttggtc agcactagaacagtatcttctgaagacaaaactgaatgctatagctttgaaaggataaaa agccctgaagtgctcatcacgaatgaaacaccaaaaccagagactatcatcataccagag caatctcagataaagaaatga >gi568815585f:49911941_50113161|GENSCAN_predicted_peptide_5|114_aa MPSPGWVFNADKYALFWKKMPQRTFISKEEKQAPGFKAGRDRQTLLFCANAVGLLPLAIK LLIPEPQREKTPAASLLVVQQEETAKPTPFLPQPAQCENSEDEGLYDDPLPFNQ >gi568815585f:49911941_50113161|GENSCAN_predicted_CDS_5|345_bp atgcccagcccaggatgggtttttaatgctgataaatatgccctattctggaagaaaatg ccacaaaggacatttattagtaaggaagaaaagcaagcaccaggatttaaggcaggaaga gataggcaaactctactgttctgtgcaaatgcagtcggtttgctgcccttagccataaag ctgctaatccctgagcctcaaagggaaaaaacaccagcagccagtcttttggttgtacaa caagaagagacagcaaaaccaaccccttttcttcctcagcctgctcaatgtgaaaacagt gaagatgaaggcctgtatgatgatccacttccatttaatcaatag >gi568815585f:49911941_50113161|GENSCAN_predicted_peptide_6|317_aa MTAITGSRKRREKAARFGNGGGARVQSSCQGAAACRDPAVRRPRLRSRAGGLADRSAGRR LLRRSRHMRRIIVVHGSPFASSVAVLLLLAREKKNVLQPVSKLQTTECINETLYMDTRAF KTALPACRAFAVLQRFTGVIALKPRNNFTRLVYVHSIQDWNPGQWTVLQVLTDAGMLTAT KKKLYLQPSCGEEVIYELHQMAISVSFLDDLEDEASWPGKQNSCGCGDHVQEVPKLKVCA LHMSSWACSQIPKAGDKILTFDQLTLDTLKGCGTILLSGPHKGQEVYWHFSKALGTQHSH TKPCVHSRDQKFKHIRG >gi568815585f:49911941_50113161|GENSCAN_predicted_CDS_6|954_bp atgacggccatcaccgggtcgagaaaacggcgagaaaaggcggcccggttcggaaatggg ggaggggcgcgcgtccagtcctcctgtcaaggagcggccgcctgcagagaccccgcagtg cgccgtccccggctccggtcccgggcggggggtcttgctgacagatccgctgggcggcgg ctgctccgccgcagccggcacatgcgcagaatcatcgtggtgcacggctctccctttgct tcttcggttgcagtcctcttgcttcttgcgcgagaaaagaaaaacgtcttacagccagtg tctaaactccaaacaacggaatgtatcaatgagaccttgtatatggatacacgtgcattt aaaaccgccctgccggcttgtagagcttttgccgttctccagcgctttacaggggttatc gcacttaagcctcggaacaactttaccaggcttgtttatgtgcattccatccaggattgg aatcctggacaatggacagtgttgcaggtattgactgatgcaggcatgctgacagccaca aaaaagaaactctaccttcaaccaagttgtggtgaagaggttatttatgagttgcaccaa atggccatctctgtctctttcctggatgatctggaagatgaagcttcctggccaggaaaa caaaacagctgtggttgtggggaccatgttcaggaggtgcccaaactgaaggtgtgtgca ctgcacatgagcagttgggcctgcagccaaatccccaaggctggggacaagattctcacc tttgaccagctgaccctggacaccctcaaaggctgtggcaccatcctgctctctgggcct cacaagggccaagaagtgtactggcatttcagcaaggccctgggaacccagcatagccac actaagccctgtgtccactccagggaccagaaattcaagcacatcagaggctga