GENSCAN 1.0 Date run: 4-Nov-116 Time: 21:14:54 Sequence gi568815593f:172986499_173233742 : 247244 bp : 44.43% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 182 262 81 1 0 97 81 101 0.978 11.27 1.02 Intr + 8277 8324 48 2 0 102 89 0 0.169 0.38 1.03 Term + 43599 43619 21 2 0 117 42 14 0.143 -1.99 1.04 PlyA + 45661 45666 6 1.05 2.03 PlyA - 45692 45687 6 1.05 2.02 Term - 47853 47711 143 1 2 61 41 79 0.574 -1.41 2.01 Init - 50023 49906 118 1 1 82 49 93 0.795 5.22 2.00 Prom - 63054 63015 40 -2.56 3.06 PlyA - 63063 63058 6 1.05 3.05 Term - 69341 69237 105 2 0 43 42 160 0.856 5.31 3.04 Intr - 70363 70197 167 2 2 127 23 59 0.718 2.98 3.03 Intr - 92589 92495 95 2 2 52 86 77 0.320 3.51 3.02 Intr - 99407 99271 137 2 2 24 91 132 0.351 6.47 3.01 Init - 101712 101695 18 0 0 62 89 31 0.405 0.65 3.00 Prom - 102049 102010 40 -9.85 4.00 Prom + 102261 102300 40 -4.96 4.01 Init + 103826 104903 1078 1 1 69 67 688 0.091 59.36 4.02 Intr + 122126 122320 195 0 0 33 95 129 0.942 7.49 4.03 Intr + 124024 124213 190 2 1 92 99 86 0.414 8.74 4.04 Intr + 125808 125881 74 0 2 79 52 66 0.343 1.15 4.05 Intr + 136582 136704 123 2 0 90 103 72 0.613 9.46 4.06 Term + 147132 147247 116 1 2 100 37 15 0.184 -3.67 4.07 PlyA + 147594 147599 6 -0.45 5.00 Prom + 151100 151139 40 -3.36 5.01 Init + 158048 158131 84 1 0 115 84 115 0.939 14.62 5.02 Intr + 160368 160460 93 2 0 70 94 27 0.714 1.66 5.03 Intr + 167824 167915 92 0 2 74 96 73 0.975 5.49 5.04 Intr + 173435 173553 119 2 2 97 71 156 0.999 14.91 5.05 Term + 177227 177423 197 0 2 94 50 290 0.982 23.37 5.06 PlyA + 177838 177843 6 1.05 6.05 PlyA - 177855 177850 6 1.05 6.04 Term - 188969 188855 115 1 1 40 43 97 0.299 -1.56 6.03 Intr - 191105 190998 108 1 0 66 99 45 0.494 2.80 6.02 Intr - 197480 197326 155 2 2 83 102 67 0.172 6.47 6.01 Init - 207199 207125 75 1 0 80 79 31 0.021 2.49 6.00 Prom - 218683 218644 40 -5.36 7.03 PlyA - 219220 219215 6 -0.45 7.02 Term - 219438 219288 151 2 1 84 43 141 0.944 6.58 7.01 Init - 222288 222218 71 0 2 49 78 63 0.707 1.92 7.00 Prom - 222452 222413 40 -2.16 8.00 Prom + 226228 226267 40 -7.76 8.01 Sngl + 227201 227383 183 1 0 97 48 215 0.488 11.66 8.02 PlyA + 227777 227782 6 1.05 9.06 PlyA - 232063 232058 6 1.05 9.05 Term - 239520 239338 183 0 0 62 36 226 0.987 12.34 9.04 Intr - 242481 242089 393 0 0 32 65 135 0.278 0.45 9.03 Intr - 243466 243264 203 2 2 98 60 116 0.941 8.80 9.02 Intr - 245133 244966 168 1 0 -21 65 140 0.079 0.72 9.01 Intr - 246711 246088 624 1 0 103 23 1145 0.123 101.72 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 120439 120487 49 0 1 77 58 66 0.879 1.61 S.002 Term - 246711 246071 641 1 2 103 50 1143 0.822 106.37 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593f:172986499_173233742|GENSCAN_predicted_peptide_1|49_aa MSIFTDGRKKPLKQPKKQAKEMDGEDKSYHYHVGDLFSLLLSLCPAGKS >gi568815593f:172986499_173233742|GENSCAN_predicted_CDS_1|150_bp atgtccattttcacagatggcaggaagaagcccctgaaacagcccaagaagcaggccaag gagatggatggggaagataagagttatcattaccatgttggtgacctgttcagtttgctg ctatctctttgcccagctggcaaatcctag >gi568815593f:172986499_173233742|GENSCAN_predicted_peptide_2|86_aa MTFAWKGSGAQEHFRYPLNGLNGGPSSLGPDKHWMECGNGRQKVNIEKVNQANTLKCTDA QQNMKHRIINLVQVWPGFAGLWQKVQ >gi568815593f:172986499_173233742|GENSCAN_predicted_CDS_2|261_bp atgacgttcgcctggaagggctctggagcgcaggaacattttcggtatcctctgaacggg ctaaatggaggtccttccagtctgggccctgacaaacactggatggagtgtggcaacgga aggcaaaaagttaacatcgagaaagttaaccaagccaacactttaaaatgcacagatgct caacaaaatatgaaacacaggataatcaacttggtgcaagtttggccaggttttgcaggg ctgtggcaaaaggtccagtaa >gi568815593f:172986499_173233742|GENSCAN_predicted_peptide_3|173_aa MEEELQTPFLLVGTQTDLRDDPSTIEKLAKNKQKPITPETAKKLTHDLKAISLSLCDLMP SSYNWYVVIIIALILSPDSYEKEGRLPAAAAPGSEAAGKGRAARQDGRPALPIPGLAYCP PASPIYRRPALLCNRPGPGHGFRGTGINQTACTFEERDPYTTVQPECHYYRSA >gi568815593f:172986499_173233742|GENSCAN_predicted_CDS_3|522_bp atggaagaagagctgcagactcctttcttgcttgttgggacccaaactgatctcagagat gacccctctactattgagaaacttgccaagaacaaacagaagcctatcactccagagact gctaaaaagctgacccatgacctgaaggccatcagtctctccttatgtgacctcatgccg tcgtcctacaactggtatgtagtcataatcattgccctaattctctcccctgattcctat gagaaagaaggccgactgccggcagcagcggccccgggctcggaggcagcggggaagggc cgggcggcccggcaggacggacgcccggcgctgcccatccccggcctagcctactgcccg cccgcgagtcccatctaccgccgccccgcgcttttatgtaaccgtcccgggccggggcac ggattccgaggcacagggatcaatcagactgcgtgcacctttgaagagagagacccgtat acgactgtacaaccagaatgccactactacagaagcgcctag >gi568815593f:172986499_173233742|GENSCAN_predicted_peptide_4|591_aa MNYQQNPRDNFLSLEDCKDIENLESFTDVLDNEGALTSNWEQWDTYCEDLTKYTKLTSCD IWGTKEVDYLGLDDFSSPYQDEEVISKTPTLAQLNSEDSQSVSDSLYYPDSLFSVKQNPL PSSFPGKKITSRAAAPVCSSKTLQAEVPLSDCVQKASKPTSSTQIMVKTNMYHNEKVNFH VECKDYVKKAKVKINPVQQSRPLLSQIHTDAAKENTCYCGAVAKRQEKKGMEPLQGHATP ALPFKETQELLLSPLPQEGPGSLAAGESSSLSASTSVSDSSQKKEEHNYSLFVSDNLGEQ PTKCSPEEDEEDEEDVDDEDHDEGFGSEHELSENEEEEEEEEDYEDDKDDDISDTFSEPG YENDSVEDLKEVTSISSRKRGKRRYFWEYSEQLTPSQQERMLRPSEWNRDTLPSNMYQKN GLHHGKYAVKKSRRTDVEDLTPNPKKLLQIGNELRKLNKVISDLTPVSELPLTARPRSRK EKNKLASRACRLKKKAQYEANKVKLWGLNTEYDNLLFVINSIKQEIVNRVQNPRDERGPN MGQKLEILIKDTLGLPVAGQTSEFVNQVLEKTAEGNPTGGLVGLRIPTSKV >gi568815593f:172986499_173233742|GENSCAN_predicted_CDS_4|1776_bp atgaactaccaacagaatcctagagacaactttctttctttggaggactgcaaagacatt gaaaatctggagtctttcacagatgtcctggataatgagggtgctttaacctcaaactgg gaacagtgggatacatactgtgaagacctaacgaaatataccaaactaaccagctgtgac atctggggaacaaaagaagtggattacttgggtcttgatgacttttctagtccttaccaa gatgaagaggttataagtaaaactccaactttagctcaacttaatagtgaggactcacag tctgtttctgattccctttattaccccgattcacttttcagtgtcaaacaaaatccctta ccctcttcattccctggtaaaaagatcacaagcagagcagctgctcctgtgtgttcttct aagactctgcaggctgaggtccctttgtcagactgtgtccaaaaagcaagtaaacccact tcaagcacacaaatcatggtgaagaccaacatgtatcataatgaaaaggtgaactttcat gttgaatgtaaagactatgtaaaaaaggcaaaggtaaagatcaacccagtgcaacagagc cggcccttgttgagccagattcacacagatgcagcaaaggagaacacctgctactgtggt gcagtggcaaagagacaagagaaaaaagggatggagcctcttcaaggtcatgccactccc gctttgccttttaaagaaacccaggaactattactaagtcccctgccccaggaaggtcct gggtcacttgcagcaggagagagcagcagtctttctgccagtacatcagtctcagattca tcccagaaaaaagaagagcacaattattctctttttgtctccgacaacttgggtgaacag ccaactaaatgcagtcctgaagaagatgaggaggacgaggaggatgttgatgatgaggac catgatgaaggattcggcagtgagcatgaactgtctgaaaatgaggaggaggaagaagag gaagaggattatgaagatgacaaggatgatgatattagtgatactttctctgaaccaggc tatgaaaatgattctgtagaagacctgaaggaggtgacttcaatatcttcacggaagaga ggtaaaagaagatacttctgggagtatagtgaacaacttacaccatcacagcaagagagg atgctgagaccatctgagtggaaccgagatactttgccaagtaatatgtatcagaaaaat ggcttacatcatggaaaatatgcagtaaagaagtcacggagaactgatgtagaagacctg actccaaatcctaaaaaactcctccagataggcaatgaacttcggaaactgaataaggtg attagtgacctgactccagtcagtgagcttcccttaacagcccgaccaaggtcaaggaag gaaaaaaataagctggcttccagagcttgtcggttaaagaagaaagcccagtatgaagct aataaagtgaaattatggggcctcaacacagaatatgataatttattgtttgtaatcaac tccatcaagcaagagattgtaaaccgggtacagaatccaagagatgagagaggacccaac atggggcagaagcttgaaatcctcattaaagatactctcggtctaccagttgctgggcaa acctcagaatttgttaaccaagtgttagagaagactgcagaagggaatcccactggaggc cttgtaggattaaggataccaacatcaaaggtgtaa >gi568815593f:172986499_173233742|GENSCAN_predicted_peptide_5|194_aa MAAPQDVHVRICNQEIVKFDLEVKALIQDIRDCSGPLSALTELNTKVKEKFQQLRHRIQD LEQLAKEQDKESEKQLLLQEVENHKKQMLRKTTKESLAQTSSTITESLMGISRMMAQQVQ QSEEAMQSLVTSSRTILDANEEFKSMSGTIQLGRKLITKYNRRELTDKLLIFLALALFLA TVLYIVKKRLFPFL >gi568815593f:172986499_173233742|GENSCAN_predicted_CDS_5|585_bp atggcggctccccaagacgtccacgtccggatctgtaaccaagagattgtcaaatttgac ctggaggtgaaggcgcttattcaggatatccgtgattgttcaggacccttaagtgctctt actgaactgaatactaaagtaaaagagaaatttcaacagttgcgtcacagaatacaggac ctggagcagttggctaaagagcaagacaaagaatcagagaaacaacttctactccaggaa gtggagaatcacaaaaagcagatgctcaggaaaaccaccaaagagagcctggcccagaca tccagtaccatcactgagagcctcatggggatcagcaggatgatggcccagcaggtccag cagagcgaggaggccatgcagtctctagtcacttcttcacgaacgatcctggatgcaaat gaagaatttaagtccatgtcgggcaccatccagctgggccggaagcttatcacaaaatac aatcgccgggagctgacggacaagcttctcatcttccttgcgctagccctgtttcttgct acggtcctctatattgtgaaaaagcggctctttccatttttgtga >gi568815593f:172986499_173233742|GENSCAN_predicted_peptide_6|150_aa MDTDVDLHSIGGMSKDLQPRFKTTTGGHWRILKEAMACASEREGLTQGTNLNEESPAALG PVRSCQNDPSKAQGGPSHQWPFHMDLLQMTSLAEGPGGRDMTPFSQPSSQSVRNWVALQE VSRQRALPPELYPLSDQQRHEILIGARTLL >gi568815593f:172986499_173233742|GENSCAN_predicted_CDS_6|453_bp atggacacagacgtagacctgcactccataggaggaatgtcaaaggatttgcagccacgt tttaaaaccaccacaggaggccactggagaattctcaaggaggccatggcatgtgcctct gaaagagaggggctgacgcagggcaccaacttgaatgaggaaagtccggctgcattgggc ccagtcagaagctgccagaatgacccaagcaaggcacagggaggcccaagccaccagtgg cctttccacatggacttgctgcaaatgaccagtctggctgagggccctgggggccgggac atgacacccttctctcagccctcatcacagtctgtgaggaactgggtggccctgcaggag gtcagccgccagcgagcattaccgcctgagctctacccactgtcggatcagcagcggcat gagattctcataggagcgagaaccctattgtga >gi568815593f:172986499_173233742|GENSCAN_predicted_peptide_7|73_aa MEQLEEDLRREFKTEGAARAKALRNRRGFVLSCASEERHPLRGHWMPHAFDVKVPFENGP TLYQQVPETPPGY >gi568815593f:172986499_173233742|GENSCAN_predicted_CDS_7|222_bp atggagcagctggaggaagatctgagaagagagtttaagacagagggagcagctcgtgca aaggccctgagaaacaggagaggtttcgtcctgagctgtgcaagtgaggagcgtcaccct ctgagagggcactggatgccacacgctttcgatgtcaaggtcccttttgaaaatggccct actctgtaccaacaagttccagagacaccaccaggctactaa >gi568815593f:172986499_173233742|GENSCAN_predicted_peptide_8|60_aa MPEPAPDAVGSCAAPASPTSAAPCSTAPGPIDCPRAEECRRKHHGTGRQLRLRPGSRTTE >gi568815593f:172986499_173233742|GENSCAN_predicted_CDS_8|183_bp atgcctgagcctgcccctgacgccgtgggctcctgcgcggccccagcctccccgacgagc gccgccccctgctccacggcgcccggtcccatcgactgcccaagggctgaggagtgccgg cgcaagcaccacgggactggcaggcagctccgcctgcggcccgggtccaggaccactgag tga >gi568815593f:172986499_173233742|GENSCAN_predicted_peptide_9|523_aa XLCALQKAVELEKTEADNAERPRARRRRKPRVLFSQAQVYELERRFKQQRYLSAPERDQL ASVLKLTSTQVKIWFQNRRYKCKRQRQDQTLELVGLPPPPPPPARRIAVPVLVRDGKPCL GDSAPYAPAYGVGLNPYGYNAYPAYPGYGGAACSPGYSCTAAYPAGPSPAQPATAAANNN FVNFGVGDLNAVQSPGIPQSNSGVSTLHACFQMEKPRTQALGKRSQAPQMPPDATSGPHQ LPVGLSAEAGPQRGVLRMVLTSSCDRETEEVTWVVKIIQSLSGNGSWETHRTHTGESVPE ARARHTVDVSPVKMQTEWLQAALSRTQTLASGDARKPQVSPLTRKHHAVPRGSSTVFPAA PAPRILREAFRAQAQRFYAAGNSLVASKPCRLPYQAELLEKGACGLPLIYHSQSLSRARH CSELFTSEPIESLFIPLGYREETDEGFILNPERRSTLPRVTQLVNSEDKRALAKLVEAIR TNYNDKYDEIHRHWGGSVLGPKSVVHIAKLEKAKAKELATKLG >gi568815593f:172986499_173233742|GENSCAN_predicted_CDS_9|1572_bp nagctgtgcgcgctgcagaaggcggtggagctggagaagacagaggcggacaacgcggag cggccccgggcgcgacggcggaggaagccgcgcgtgctcttctcgcaggcgcaggtctat gagctggagcggcgcttcaagcagcagcggtacctgtcggcccccgaacgcgaccagctg gccagcgtgctgaaactcacgtccacgcaggtcaagatctggttccagaaccggcgctac aagtgcaagcggcagcggcaggaccagactctggagctggtggggctgcccccgccgccg ccgccgcctgcccgcaggatcgcggtgccagtgctggtgcgcgatggcaagccatgccta ggggactcggcgccctacgcgcctgcctacggcgtgggcctcaatccctacggttataac gcctaccccgcctatccgggttacggcggcgcggcctgcagccctggctacagctgcact gccgcttaccccgccgggccttccccagcgcagccggccactgccgccgccaacaacaac ttcgtgaacttcggcgtcggggacttgaatgcggttcagagccccgggattccgcagagc aactcgggagtgtccacgctgcatgcctgtttccagatggaaaagccaagaacccaagcc cttggcaagcgttctcaggctcctcagatgcccccagatgccacgtcggggcctcatcag ctgcccgtgggactgagtgccgaggctggaccccagagaggtgtcctgcggatggtgctc acctccagctgtgaccgggaaactgaggaggttacgtgggtggtcaagatcatccagtct ctaagcgggaacgggagctgggaaacacacaggacacatactggggagagtgttcctgaa gccagagcccgccacactgtggatgtctctccagtaaagatgcagacagaatggctccag gccgcgctgagcagaacccagactcttgccagtggggacgcgcggaagccgcaggtttca ccgctgactcggaaacatcacgcggtcccgcgcgggagtagcaccgtcttccccgcagcg cccgcccctcgcatcctccgggaagcattccgagctcaggcccagcgcttctacgccgca ggcaacagccttgtggcctctaagccttgccgactcccctaccaggctgagctcctggag aaaggggcctgtggcctgcctttaatttatcactcacagagtttatcgcgtgccaggcac tgttctgagctctttacaagtgaacccattgaatccctttttatacctctgggttatcgt gaggaaacagatgaaggattcattttaaacccagaaagaaggagtacgttgcccagggtc acacagctggttaactcagaagacaaaagagctttggctaagctggtggaagctatcagg accaattacaacgacaaatatgatgagatccaccgtcactggggaggcagtgtcctgggt cccaagtctgtggttcacatcgccaagcttgaaaaggcaaaggctaaagaacttgccacc aaactgggttaa