GENSCAN 1.0 Date run: 19-Jun-119 Time: 14:24:05 Sequence gi568815589f:97956809_98183037 : 226229 bp : 46.33% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 4569 4608 40 -0.86 1.01 Init + 7519 7593 75 0 0 47 116 -3 0.620 -0.49 1.02 Term + 9532 9651 120 0 0 116 48 71 0.808 4.37 1.03 PlyA + 10635 10640 6 1.05 2.00 Prom + 24772 24811 40 -3.46 2.01 Init + 26748 26801 54 2 0 101 61 74 0.909 7.15 2.02 Intr + 37823 37972 150 1 0 112 115 53 0.995 10.76 2.03 Intr + 41748 41870 123 2 0 93 111 41 0.994 7.68 2.04 Intr + 48156 48345 190 2 1 55 113 317 0.970 30.06 2.05 Intr + 54463 54581 119 2 2 48 78 391 0.993 34.28 2.06 Intr + 55613 55664 52 1 1 114 86 152 0.979 16.08 2.07 Intr + 61481 61681 201 0 0 103 100 61 0.894 8.06 2.08 Intr + 61740 61882 143 1 2 47 73 93 0.700 3.77 2.09 Intr + 78522 78709 188 2 2 82 44 61 0.102 -0.41 2.10 Term + 80159 80270 112 2 1 145 54 -11 0.081 -0.77 2.11 PlyA + 84360 84365 6 1.05 3.00 Prom + 90516 90555 40 -4.76 3.01 Init + 100001 100132 132 1 0 78 100 254 0.996 24.14 3.02 Intr + 103974 104189 216 2 0 56 105 279 0.981 25.10 3.03 Intr + 104306 104337 32 1 2 101 44 26 0.014 -3.57 3.04 Intr + 121254 121539 286 0 1 116 97 233 0.162 24.34 3.05 Intr + 124008 124274 267 2 0 79 93 274 0.999 24.73 3.06 Term + 126038 126247 210 1 0 89 42 280 0.969 20.79 3.07 PlyA + 126249 126254 6 1.05 4.06 PlyA - 127572 127567 6 1.05 4.05 Term - 131197 130662 536 2 2 88 50 1008 0.878 91.41 4.04 Intr - 135193 135152 42 2 0 79 82 52 0.738 1.91 4.03 Intr - 138221 138059 163 2 1 86 106 273 0.991 28.45 4.02 Intr - 143356 143123 234 1 0 108 95 180 0.980 18.59 4.01 Init - 149666 149580 87 2 0 66 50 49 0.210 -0.45 4.00 Prom - 150761 150722 40 -7.16 5.03 PlyA - 152242 152237 6 1.05 5.02 Term - 153176 152997 180 2 0 88 48 147 0.988 8.31 5.01 Init - 162380 162174 207 2 0 40 75 241 0.988 14.86 5.00 Prom - 164145 164106 40 -9.16 6.13 PlyA - 164193 164188 6 1.05 6.12 Term - 168097 167966 132 1 0 89 53 233 0.992 17.99 6.11 Intr - 169952 169741 212 0 2 85 48 89 0.532 3.23 6.10 Intr - 171452 171362 91 2 1 92 78 61 0.988 5.07 6.09 Intr - 171911 171799 113 0 2 103 90 226 0.999 24.40 6.08 Intr - 173082 172986 97 0 1 145 123 230 0.999 31.78 6.07 Intr - 174251 174147 105 2 0 136 63 170 0.960 19.71 6.06 Intr - 175493 175377 117 2 0 69 94 139 0.994 13.26 6.05 Intr - 176409 176230 180 0 0 76 82 227 0.954 20.96 6.04 Intr - 178147 177998 150 1 0 92 113 240 0.998 27.26 6.03 Intr - 180880 180764 117 1 0 78 79 138 0.965 12.56 6.02 Intr - 200852 200652 201 2 0 132 110 340 0.999 40.08 6.01 Init - 203966 203895 72 2 0 74 100 47 0.532 5.59 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 104306 104341 36 1 0 101 54 31 0.925 -1.66 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589f:97956809_98183037|GENSCAN_predicted_peptide_1|64_aa MFPYFSDFIVLELRGTLGISWCELKVISLIQFLMNHNKNFEINCSDYQNSLEEWRYCDVE SLSE >gi568815589f:97956809_98183037|GENSCAN_predicted_CDS_1|195_bp atgtttccatacttttcagacttcatagtgttagagctgagagggaccttggggatctcc tggtgcgagttaaaggtaatttctcttattcagtttcttatgaaccataacaagaacttt gaaatcaattgttctgattaccaaaacagtctggaggagtggcgatactgcgatgtagag agcctcagtgaatga >gi568815589f:97956809_98183037|GENSCAN_predicted_peptide_2|443_aa MDMKRRIHLELRNRTPAAVRELVLDNCKSNDGKIEGLTAEFVNLEFLSLINVGLISVSNL PKLPKLKKLELSENRIFGGLDMLAEKLPNLTHLNLSGNKLKDISTLEPLKKLECLKSLDL FNCEVTNLNDYRESVFKLLPQLTYLDGYDREDQEAPDSDAEVDGVDEEEEDEEGEDEEDE DDEDGEEEEFDEEDDEDEDVEGDEDDDEVSEEEEEFGLDEEDEDEDEDEGVQHLKNDSGQ RGNAQTTLCKILHPTRSCITTTRTPGTYSRTDTGELSRPVPGIPIWATLPTPSPEKALLH CAAINFNEPDDTAVLTVWKGNQVTSRLKLQPAKQLLCRKITKIAHLLWYQARKQALETAL VDAAVGCTGKRRQMRDVRGHKPPPETEGVDFHPPHPPECDGGFALSGALPWVQCWEGTPN GSLTEGAYTLEGMTDKRGRCHKG >gi568815589f:97956809_98183037|GENSCAN_predicted_CDS_2|1332_bp atggacatgaagaggaggatccacctggagctgaggaaccggaccccggcagctgttcga gaacttgtcttggacaattgcaaatcaaatgatggaaaaattgagggcttaacagctgaa tttgtgaacttagagttcctcagtttaataaatgtaggcttgatctcagtttcaaatctc cccaagctgcctaaattgaaaaagcttgaactcagtgaaaatagaatctttggaggtctg gacatgttagctgaaaaacttccaaatctcacacatctaaacttaagtggaaataaactg aaagatatcagcaccttggaacctttgaaaaagttagaatgtctgaaaagcctggacctc tttaactgtgaggttaccaacctgaatgactaccgagagagtgtcttcaagctcctgccc cagcttacctacttggatggctatgaccgagaggaccaggaagcacctgactcagatgcc gaggtggatggtgtggatgaagaggaggaggacgaagaaggagaagatgaggaagacgag gacgatgaggatggtgaagaagaggagtttgatgaagaagatgatgaagatgaagatgta gaaggggatgaggacgacgatgaagtcagtgaggaggaagaagaatttggacttgatgaa gaagatgaagatgaggatgaggatgaaggagtccagcacctgaagaatgatagtggccag cgtggaaatgcacaaaccaccttgtgtaagatactgcaccctacccgctcctgtatcacc accaccaggactcctggtacatactccaggacagatacaggagagttgagtcgtcctgtt cctggcattcctatctgggctacactacctaccccatcgccagagaaagctctgttgcat tgtgctgccatcaattttaatgagcctgatgataccgccgtgttgacagtttggaaaggg aatcaagttactagtagacttaagctgcagccagccaaacagttgctctgcaggaaaatc accaagattgcacacctgctgtggtaccaggcacgcaagcaggcactcgaaactgctttg gtggatgcagcagtggggtgcactggaaaacggaggcaaatgagggatgtcagaggtcac aagcctccaccagagactgaaggtgttgactttcaccccccacaccctccagagtgtgat ggggggtttgctctctcaggggctttgccctgggttcagtgctgggaaggtactccaaat ggttccttaactgaaggcgcctacactctagagggaatgacagacaaaagaggcagatgt cataaagggtga >gi568815589f:97956809_98183037|GENSCAN_predicted_peptide_3|380_aa MPLELELCPGRWVGGQHPCFIIAEIGQNHQGDLDVAKRMIRMAKECGADCAKFQKSELEF KFNRKALERPYTSKHSWGKTYGEHKRHLEFSHDQYRELQRYAEEVGIFFTASGMDELLER LAGAKAQGWHSVLRDVCGVQLNDETSSWDVLGRVRTSKEKVLMVLVLDYSGRPMVISSGM QSMDTMKQVYQIVKPLNPNFCFLQCTSAYPLQPEDVNLRVISEYQKLFPDIPIGYSGHET GIAISVAAVALGAKVLERHITLDKTWKGSDHSASLEPGELAELVRSVRLVERALGSPTKQ LLPCEMACNEKLGKSVVAKVKIPEGTILTMDMLTVKVGEPKGYPPEDIFNLVGKKVLVTV EEDDTIMEELVDNHGKKIKS >gi568815589f:97956809_98183037|GENSCAN_predicted_CDS_3|1143_bp atgccgctggagctggagctgtgtcccgggcgctgggtgggcgggcaacacccgtgcttc atcattgccgagatcggccagaaccaccagggcgacctggacgtagccaagcgcatgatc cgcatggccaaggagtgtggggctgattgtgctaagttccagaagagtgagctagaattc aagtttaatcggaaagccttggagaggccatacacctcgaagcattcctgggggaagacg tacggggagcacaaacgacatctggagttcagccatgaccagtacagggagctgcagagg tacgccgaggaggttgggatcttcttcactgcctctggcatggatgagctactggagagg ctagcaggcgccaaagcccagggctggcacagtgttttaagagatgtctgtggagttcag ttgaatgatgagaccagcagttgggatgtgttggggagagtcagaacctctaaagagaaa gtgctgatggtgttggtgctggattactcaggtcgcccaatggtgatctccagtgggatg cagtcaatggacaccatgaagcaagtttatcagatcgtgaagcccctcaaccccaacttc tgcttcttgcagtgtaccagcgcatacccgctccagcctgaggacgtcaacctgcgggtc atctcggaatatcagaagctctttcctgacattcccatagggtattctgggcatgaaaca ggcatagcgatatctgtggccgcagtggctctgggggccaaggtgttggaacgtcacata actttggacaagacctggaaggggagtgaccactcggcctcgctggagcctggagaactg gccgagctggtgcggtcagtgcgtcttgtggagcgtgccctgggctccccaaccaagcag ctgctgccctgtgagatggcctgcaatgagaagctgggcaagtctgtggtggccaaagtg aaaattccggaaggcaccattctaacaatggacatgctcaccgtgaaggtgggtgagccc aaaggctatcctcctgaagacatctttaatctagtgggcaagaaggtcctggtcactgtt gaagaggatgacaccatcatggaagaattggtagataatcatggcaaaaaaatcaagtct taa >gi568815589f:97956809_98183037|GENSCAN_predicted_peptide_4|353_aa MELSNGYSHGMGSPSSTLPTSFSVNNLAPANAESSKTWLKGKFTELRLLLDEEEALAKKF IDKNTQLTLQVYREQADSCREQLDIMNDLSNRVWSISQEPDPVQRLQAYTATEQEMQQQM SLGELCHPVPLSFEPVKSFFKGLVEAVESTLQTPLDIRLKESINCQLSDPSSTKPDARTP TLDPDTMHARLRLSADRLTVRCGLLGSLGPVPVLRFDALWQVLARDCFATGRHYWEVDVQ EAGAGWWVGAAYASLRRRGASAAARLGCNRQSWCLKRYDLEYWAFHDGQRSRLRPRDDLD RLGVFLDYEAGVLAFYDVTGGMSHLHTFRATFQEPLYPALRLWEGAISIPRLP >gi568815589f:97956809_98183037|GENSCAN_predicted_CDS_4|1062_bp atggagctgagcaatggctactcccatggcatgggatctcccagctccaccctgcccact tccttcagtgttaacaacctggctcctgctaatgcagagtcaagtaaaacctggctgaag gggaaattcactgaactcagattactacttgacgaagaggaagcgctggccaagaaattc attgataaaaacacgcagcttaccctccaggtgtacagggaacaagctgactcttgcaga gagcaacttgacatcatgaatgatctctccaacagggtctggagtatcagccaggagccc gatcctgtccagaggcttcaggcatacacggccaccgagcaggagatgcagcagcagatg agcctcggggagctgtgccatcccgtgcccctctcctttgagcccgtcaagagcttcttt aagggcctcgtggaagccgtggagagtacattacagacgccattggacattcgccttaag gaaagcataaactgccagctctcagacccttccagcaccaagccagacgcgcgcacgccc acgctggatcctgacacgatgcacgcgcgcctgcgcctgtccgccgatcgcctgacggtg cgctgcggcctgctgggcagcctggggcccgtgcccgtgctgcggttcgacgcgctctgg caagtgctggctcgtgactgcttcgccaccggccgccactactgggaggttgacgtgcag gaggcgggcgccggctggtgggtgggcgcggcctacgcctcccttcggcgccgcggggcc tcggccgccgcccgcctgggctgcaaccgccagtcctggtgcctcaagcgctacgacctt gagtactgggccttccacgacggccagcgcagccgcctgcggccccgcgacgacctcgac cggctcggcgtcttcctggactacgaggccggcgtcctcgccttctacgacgtgacgggc ggcatgagccacctgcataccttccgcgccacgttccaggagccgctctacccggccctg cggctctgggagggggccatcagcatcccccggctgccctag >gi568815589f:97956809_98183037|GENSCAN_predicted_peptide_5|128_aa MAGAATGSRTPGRSELVEGCGWRCPEHGDRVAELFCRRCRRCVCALCPVLGAHRGHPVGL ALEAAVHVQKLSQECLKQLAIKKQQHIDNITQIEDATEKLKASGHFFLIPMERPTFPPRQ KGPPIWSQ >gi568815589f:97956809_98183037|GENSCAN_predicted_CDS_5|387_bp atggcgggcgcggcgaccgggagccggacccctgggaggtcggagcttgtcgagggatgc ggctggcgctgcccggagcatggcgaccgcgtggctgagctcttctgtcgccgctgccgc cgctgcgtgtgcgcgctttgcccggtgctgggcgcgcaccgtggccaccctgtgggcctg gcgctggaggcagcggtgcacgtgcagaaactcagccaagaatgtttaaagcagctggca atcaagaagcagcagcacattgacaacataacccagatagaagatgccaccgagaagctc aaggcaagtggacatttcttcctcatacccatggaaagacccacatttccccccagacaa aagggacccccaatatggagccaatga >gi568815589f:97956809_98183037|GENSCAN_predicted_peptide_6|528_aa MGRLVSITSCHRQVHSRHPQQLLKMSWHPQYRSSKFRHVFGKPASKENCYDSVPITRSVH DNHFCAVNPHFIAVVTECAGGGAFLVIPLHQTGKLDPHYPKVCGHRGNVLDVKWNPFDDF EIASCSEDATIKIWSIPKQLLTRNLTAYRKELVGHARRVGLVEWHPTAANILFSAGYDYK VMIWNLDTKESVITSPMSTISCHQDVILSMSFNTNGSLLATTCKDRKIRVIDPRAGTVLQ EASYKGHRASKVLFLGNLKKLMSTGTSRWNNRQVALWDQDNLSVPLMEEDLDGSSGVLFP FYDADTSMLYVVGKGDGNIRYYEVSADKPHLSYLTEYRSYNPQKGIGVMPKRGLDVSSCE IFRFYKLITTKSLIEPISMIVPRRSESYQEDIYPPTAGAQPSLTAQEWLSGMNRERPIFN SMAPASPRLLNQTEKLAAEDGWRSSSLLEEKMPRWAAEHRLEEKKTWLTNGFDVFECPPP KTENELLQMFYRQQEEIRRLRELLTQREVQAKQLELEIKNLRMGSEQL >gi568815589f:97956809_98183037|GENSCAN_predicted_CDS_6|1587_bp atgggacgtctggtgagcatcaccagctgccacagacaggtgcactcccgccacccccag cagttgctcaagatgtcatggcacccccagtaccggagctccaagttccgtcatgtcttt ggcaaaccagccagcaaggagaactgctacgactccgtgcctatcacccgcagcgttcac gacaaccacttctgtgccgtgaacccccacttcattgcagttgtgactgagtgtgctggt ggaggggccttcctcgtcatccccctgcaccagacagggaagttggacccccactaccca aaagtctgcgggcacagaggcaacgttttggatgtcaagtggaacccttttgatgatttt gagatcgcctcctgttctgaagatgccacaattaagatctggagcatccccaagcagctg ctgaccaggaacctcacggcctacaggaaggaactcgtgggccacgcgcgcagagtaggc ctggtggagtggcaccccacggccgccaacatcctcttcagtgctggctatgactacaag gtgatgatctggaacctggatacaaaggagtctgtcatcacaagccccatgagtacgatt agctgtcaccaagatgtgatcctctccatgtccttcaacaccaacggcagcctgttggcc accacctgcaaagaccgcaagattcgggttattgacccccgagcagggaccgtcctccag gaggccagctacaaagggcaccgggccagcaaagtgctgtttctggggaacctgaagaag ctgatgtccacaggcacatcccgatggaacaaccggcaggtggccttgtgggaccaggat aacctctctgtgcctctgatggaggaggacctggacggctcctcgggcgtgctgtttccc ttctatgacgcggacaccagcatgctctacgtggtggggaagggagatggcaacatccgc tactacgaggtgagcgccgacaagcctcacctgagctacctgactgagtaccgctcctat aacccacagaaggggatcggtgtcatgccaaagagaggactcgacgtgtcctcctgcgag atcttccgcttctacaagctgatcacaaccaaaagcctcatcgagcccatctccatgatt gtgccccggcggtcagaatcctaccaagaggacatataccctccaacagcaggggcccag ccctccctgacggcccaggagtggctcagcgggatgaatcgagagagacctatcttcaat tccatggccccagcctcaccccggctcttgaatcagacagaaaagctggctgcagaagat ggctggaggtcttcctccctgttggaggagaagatgccaaggtgggcagcagaacacagg ctggaggagaagaaaacctggctgacaaatggctttgacgttttcgaatgccccccacca aagacagagaatgagttgctgcagatgttctaccggcaacaggaggagatccgaaggctc cgggagctgttgacccagcgagaggtccaggccaaacagttggaactggagatcaaaaac ttgcggatgggctcagagcagctctga