GENSCAN 1.0 Date run: 6-Nov-116 Time: 23:06:46 Sequence gi568815597f:206797833_207003056 : 205224 bp : 44.63% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 1029 1174 146 1 2 125 98 0 0.137 4.70 1.02 Intr + 11596 11676 81 0 0 130 58 6 0.046 1.63 1.03 Intr + 39126 39191 66 2 0 103 67 54 0.912 3.90 1.04 Intr + 42018 42170 153 2 0 66 56 205 0.848 15.37 1.05 Intr + 43172 43246 75 1 0 37 110 34 0.326 0.21 1.06 Intr + 45995 46104 110 1 2 122 75 2 0.484 1.38 1.07 Intr + 49436 49501 66 2 0 58 96 43 0.031 0.12 1.08 Intr + 56731 56837 107 1 2 68 80 26 0.029 -0.34 1.09 Intr + 67715 67854 140 0 2 60 83 53 0.506 2.18 1.10 Intr + 67991 68179 189 1 0 73 76 58 0.600 2.88 1.11 Intr + 68467 68532 66 0 0 77 95 21 0.634 0.80 1.12 Intr + 68652 68804 153 2 0 107 56 94 0.916 8.37 1.13 Intr + 87006 87134 129 2 0 33 87 105 0.081 5.79 1.14 Term + 87265 87405 141 0 0 52 54 103 0.981 1.23 1.15 PlyA + 88927 88932 6 1.05 2.00 Prom + 89861 89900 40 -0.96 2.01 Init + 101531 101683 153 1 0 36 101 101 0.232 4.20 2.02 Term + 105144 105227 84 2 0 112 42 118 0.980 7.25 2.03 PlyA + 105619 105624 6 1.05 3.00 Prom + 107946 107985 40 -4.76 3.01 Init + 109787 109852 66 1 0 96 -54 131 0.370 -1.21 3.02 Intr + 109865 110063 199 1 1 40 61 241 0.578 15.52 3.03 Term + 110069 110397 329 0 2 -203 52 632 0.585 25.37 3.04 PlyA + 110402 110407 6 1.05 4.07 PlyA - 110942 110937 6 1.05 4.06 Term - 111629 111490 140 0 2 97 52 42 0.577 -0.37 4.05 Intr - 112508 112378 131 1 2 144 105 88 0.773 16.44 4.04 Intr - 113979 113898 82 1 1 62 94 42 0.618 0.90 4.03 Intr - 115210 115097 114 2 0 76 78 28 0.523 1.02 4.02 Intr - 116262 115927 336 1 0 98 66 256 0.741 19.89 4.01 Init - 127406 127196 211 2 1 67 94 77 0.354 3.92 4.00 Prom - 128402 128363 40 -4.96 5.11 PlyA - 128992 128987 6 -0.45 5.10 Term - 132581 132486 96 2 0 109 42 186 0.999 14.07 5.09 Intr - 133723 133665 59 2 2 87 72 41 0.888 1.00 5.08 Intr - 133970 133839 132 0 0 101 86 83 0.895 10.02 5.07 Intr - 134745 134624 122 2 2 12 109 104 0.998 5.04 5.06 Intr - 135334 135154 181 2 1 77 110 169 0.506 16.93 5.05 Intr - 136914 136588 327 1 0 104 100 421 0.999 40.57 5.04 Intr - 137986 137654 333 2 0 91 97 553 0.999 52.04 5.03 Intr - 139919 139263 657 0 0 113 90 755 0.999 70.01 5.02 Intr - 141631 141287 345 2 0 90 86 444 0.997 39.76 5.01 Init - 142699 142657 43 1 1 69 89 121 0.842 9.88 5.00 Prom - 145757 145718 40 -3.76 6.03 PlyA - 146737 146732 6 1.05 6.02 Term - 156454 156240 215 2 2 20 38 172 0.567 2.79 6.01 Init - 158997 158868 130 0 1 89 38 105 0.631 5.91 6.00 Prom - 160162 160123 40 -4.86 7.07 PlyA - 161352 161347 6 1.05 7.06 Term - 163391 162586 806 0 2 98 40 219 0.905 11.49 7.05 Intr - 164719 164381 339 2 0 120 80 368 0.969 34.65 7.04 Intr - 167956 167883 74 0 2 70 87 33 0.185 0.45 7.03 Intr - 169294 169220 75 0 0 70 113 25 0.129 1.83 7.02 Intr - 187698 187606 93 2 0 17 95 127 0.163 5.38 7.01 Init - 203833 203772 62 1 2 88 68 21 0.109 1.01 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 24382 24524 143 0 2 86 68 107 0.805 8.11 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:206797833_207003056|GENSCAN_predicted_peptide_1|540_aa XRCLHTLTGVQECALRERFRTDLRVPYHSHMCTHISMCVCQCFGALFHGVFTLAQLQGKT LLANWVEYFSAQPTVLQAKDTFPNVTILSTLETLQIIKPLDVCCVTKNLLAFYVDRVFKD HQEPNPKILRKISSIANSFLYMQKTLRQCQEQRQCHCRQEATNATRVIHDNYDQAHCDPA IVVCSVFRKHTRHVLTIQCMHFVYMHTHKLRTWLLDGISEPALGNSGAHSHEGSASPDGS LPCPGNGSCSFCMANQAPGSLPGSPCYSGLTCWALTAEPGWGQNKGATTCATNSHSDSEL RPEIFSSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVI ATNLQEIRNGFSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVF KNYQTPDHYTLRKISSLANSFLTIKKDLRLCRIHLEKERSVTCELCSVTGTNVTDIDEFQ SHVTRNSSKPTTVEEQFSTEKLLRRTLFSPPVGSHMELAIASSASPESTDCLMGLEETSL >gi568815597f:206797833_207003056|GENSCAN_predicted_CDS_1|1623_bp nagcggtgcttgcacacactgacaggagtccaagaatgtgcactgagggagcgtttccgc acagatctgcgtgttccttaccactcacacatgtgcacacacatatccatgtgtgtgtgc cagtgctttggggctctgttccacggggtcttcactttagctcagcttcaagggaagact ctgttggccaactgggtagaatatttctctgcccagcccacagttctgcaagctaaggac accttcccaaatgtcactatcctgtccacattggagactctgcagatcattaagccctta gatgtgtgctgcgtgaccaagaacctcctggcgttctacgtggacagggtgttcaaggat catcaggagccaaaccccaaaatcttgagaaaaatcagcagcattgccaactctttcctc tacatgcagaaaactctgcggcaatgtcaggaacagaggcagtgtcactgcaggcaggaa gccaccaatgccaccagagtcatccatgacaactatgatcaggctcactgtgacccagcc atagttgtttgctctgtgtttcgcaaacacaccaggcatgttctcaccatacagtgtatg cactttgtgtacatgcacactcataaactgaggacttggctcttggatggcatttctgaa cctgccctggggaacagtggagcccacagccatgaagggtctgcatctccagatggctct ctgccatgtcctggcaatggctcttgtagtttctgcatggccaaccaggctccaggtagc cttcctggttctccctgctactcaggccttacctgctgggcactaacggcggagccagga tggggacagaataaaggagccacgacctgtgccaccaactcgcactcagactctgaactc agacctgaaatcttctcttcacgggaggcttggcagtttttcttactcctgtggtctcca gatttcaggcctaagatgaaagcctctagtcttgccttcagccttctctctgctgcgttt tatctcctatggactccttccactggactgaagacactcaatttgggaagctgtgtgatc gccacaaaccttcaggaaatacgaaatggattttctgagatacggggcagtgtgcaagcc aaagatggaaacattgacatcagaatcttaaggaggactgagtctttgcaagacacaaag cctgcgaatcgatgctgcctcctgcgccatttgctaagactctatctggacagggtattt aaaaactaccagacccctgaccattatactctccggaagatcagcagcctcgccaattcc tttcttaccatcaagaaggacctccggctctgtaggattcaccttgagaaagaaaggagt gtgacatgtgagctgtgctcagtaacaggaaccaatgtcactgatattgatgaattccag tcccatgttacgagaaactcctccaaaccaaccactgtggaggagcaatttagcactgag aagctgctgaggagaaccctcttctcaccaccagtggggtcacacatggagcttgcaata gctagctcagccagccctgaaagcacagactgcctcatgggcttggaagaaacatccctt tga >gi568815597f:206797833_207003056|GENSCAN_predicted_peptide_2|78_aa MVVLPCLGFTLLLWSQVSGAQGQEFHFGPCQVKGVVPQKLWEAFWAVKDTMLDVEAALTK ALGEVDILLTWMQKFYKL >gi568815597f:206797833_207003056|GENSCAN_predicted_CDS_2|237_bp atggttgtgctcccttgcctgggttttaccctgcttctctggagccaggtatcaggggcc cagggccaagaattccactttgggccctgccaagtgaagggggttgttccccagaaactg tgggaagccttctgggctgtgaaagacactatgttggacgtagaagcagctctgaccaaa gcccttggggaagtggacattcttctgacctggatgcagaaattctacaagctctga >gi568815597f:206797833_207003056|GENSCAN_predicted_peptide_3|197_aa MAGVQVLVLDGGGHLLGPLAAIVLLGWKVVVVRWEGINISGNFYRNKLKYLAFPRKRMNT NPSRGPYHFPAPSRILWHKYEACCPKRPSQAALDHLKVFDGILLPYDKKKRMVVPAALKV VRLKLTRKFAYLERLAHDGGWKYQAVITTLEEKWKEKAKFHYRKKKQLMRLRKRAEKNVE KKTDKYTEVLKTHGLLV >gi568815597f:206797833_207003056|GENSCAN_predicted_CDS_3|594_bp atggcgggggtgcaggtcctggtgcttgatggtggaggccatctcctgggccccctggcg gccatcgtactgctgggctggaaggtggttgtcgtacgctgggagggcatcaacatttct ggcaatttctacagaaacaagttgaagtacctggctttcccccgcaagcggatgaacacc aacccttcccgaggtccctaccacttcccggcccccagccgcatcttgtggcacaagtac gaggcttgctgccccaagagaccaagccaggccgccctggaccacctcaaggtgtttgac ggcatcctactgccctacgacaagaaaaagcggatggtggttcctgctgccctcaaggtc gtgcgtctgaagcttacaagaaagtttgcctatctggagcgcctggctcacgatggtggc tggaagtaccaggcagtgataaccaccctggaggagaagtggaaggagaaggccaagttc cactacaggaagaagaaacagctcatgaggctacggaaacgggccgagaagaacgtggag aagaaaactgacaaatacacagaggtcctcaagacccacgggctcctggtctga >gi568815597f:206797833_207003056|GENSCAN_predicted_peptide_4|337_aa MVCSSLADTPLGVRVPGHHGSPVSQKGEIELRREKKKEENKDVRIVKFTISPAVSLPLGP NPHDPGATAKVSGALRILPEVKVEGELGGSVTIKCPLPEMHVRIYLCREMAGSGTCGTVV STTNFIKAEYKGRVTLKQYPRKNLFLVEVTQLTESDSGVYACGAGMNTDRGKTQKVTLNV HSEYEPSWEEQPMPETPKWFHLPYLFQMPAYASSSKFVTRASKISALEGLLKPQTPSYNH HTRLHRQRALDYGSQSGREGQGFHILIPTILGLFLLALLGLVVKRAVERRKGKLRSCPDW QPRPDWAAPALGGDPSVASRAAAAAIRRAGFPAAPGP >gi568815597f:206797833_207003056|GENSCAN_predicted_CDS_4|1014_bp atggtgtgctccagcctggcagacactcccctgggagtaagagtgccagggcaccatggt agccctgtgagtcagaaaggagaaatcgagcttagacgagaaaaaaagaaagaagaaaat aaggatgtaagaattgtaaagttcacaatttcaccagcagtcagtcttcccttgggcccc aacccccatgacccaggggcaacagctaaagtatcgggggccctgaggatcctcccagaa gtaaaggtagagggggagctgggcggatcagttaccatcaagtgcccacttcctgaaatg catgtgaggatatatctgtgccgggagatggctggatctggaacatgtggtaccgtggta tccaccaccaacttcatcaaggcagaatacaagggccgagttactctgaagcaataccca cgcaagaatctgttcctagtggaggtaacacagctgacagaaagtgacagcggagtctat gcctgcggagcgggcatgaacacagaccggggaaagacccagaaagtcaccctgaatgtc cacagtgaatacgagccatcatgggaagagcagccaatgcctgagactccaaaatggttt catctgccctatttgttccagatgcctgcatatgccagttcttccaaattcgtaaccaga gcctcaaaaatctcagctctggaggggctgctcaagccccagacgcccagctacaaccac cacaccaggctgcacaggcagagagcactggactatggctcacagtctgggagggaaggc caaggatttcacatcctgatcccgaccatcctgggccttttcctgctggcacttctgggg ctggtggtgaaaagggccgttgaaaggaggaaaggtaagctccgctcctgccctgattgg cagccgcggcccgactgggcggccccagccctgggcggggacccgagtgtggcttcccgc gccgccgccgccgcgatccggagagcgggtttcccagcagctcctgggccctga >gi568815597f:206797833_207003056|GENSCAN_predicted_peptide_5|764_aa MLLFVLTCLLAVFPAISTKSPIFGPEEVNSVEGNSVSITCYYPPTSVNRHTRKYWCRQGA RGGCITLISSEGYVSSKYAGRANLTNFPENGTFVVNIAQLSQDDSGRYKCGLGINSRGLS FDVSLEVSQGPGLLNDTKVYTVDLGRTVTINCPFKTENAQKRKSLYKQIGLYPVLVIDSS GYVNPNYTGRIRLDIQGTGQLLFSVVINQLRLSDAGQYLCQAGDDSNSNKKNADLQVLKP EPELVYEDLRGSVTFHCALGPEVANVAKFLCRQSSGENCDVVVNTLGKRAPAFEGRILLN PQDKDGSFSVVITGLRKEDAGRYLCGAHSDGQLQEGSPIQAWQLFVNEESTIPRSPTVVK GVAGGSVAVLCPYNRKESKSIKYWCLWEGAQNGRCPLLVDSEGWVKAQYEGRLSLLEEPG NGTFTVILNQLTSRDAGFYWCLTNGDTLWRTTVEIKIIEGEPNLKVPGNVTAVLGETLKV PCHFPCKFSSYEKYWCKWNNTGCQALPSQDEGPSKAFVNCDENSRLVSLTLNLVTRADEG WYWCGVKQGHFYGETAAVYVAVEERKAAGSRDVSLAKADAAPDEKVLDSGFREIENKAIQ DPRLFAEEKAVADTRDQADGSRASVDSGSSEEQGGSSRALVSTLVPLGLVLAVGAVAVGV ARARHRKNVDRVSIRSYRTDISMSDFENSREFGANDNMGASSITQETSLGGKEEFVATTE STTETKEPKKAKRSSKEEAEMAYKDFLLQSSTVAAEAQDGPQEA >gi568815597f:206797833_207003056|GENSCAN_predicted_CDS_5|2295_bp atgctgctcttcgtgctcacctgcctgctggcggtcttcccagccatctccacgaagagt cccatatttggtcccgaggaggtgaatagtgtggaaggtaactcagtgtccatcacgtgc tactacccacccacctctgtcaaccggcacacccggaagtactggtgccggcagggagct agaggtggctgcataaccctcatctcctcggagggctacgtctccagcaaatatgcaggc agggctaacctcaccaacttcccggagaacggcacatttgtggtgaacattgcccagctg agccaggatgactccgggcgctacaagtgtggcctgggcatcaatagccgaggcctgtcc tttgatgtcagcctggaggtcagccagggtcctgggctcctaaatgacactaaagtctac acagtggacctgggcagaacggtgaccatcaactgccctttcaagactgagaatgctcaa aagaggaagtccttgtacaagcagataggcctgtaccctgtgctggtcatcgactccagt ggttatgtaaatcccaactatacaggaagaatacgccttgatattcagggtactggccag ttactgttcagcgttgtcatcaaccaactcaggctcagcgatgctgggcagtatctctgc caggctggggatgattccaatagtaataagaagaatgctgacctccaagtgctaaagccc gagcccgagctggtttatgaagacctgaggggctcagtgaccttccactgtgccctgggc cctgaggtggcaaacgtggccaaatttctgtgccgacagagcagtggggaaaactgtgac gtggtcgtcaacaccctggggaagagggccccagcctttgagggcaggatcctgctcaac ccccaggacaaggatggctcattcagtgtggtgatcacaggcctgaggaaggaggatgca gggcgctacctgtgtggagcccattcggatggtcagctgcaggaaggctcgcctatccag gcctggcaactcttcgtcaatgaggagtccacgattccccgcagccccactgtggtgaag ggggtggcaggaggctctgtggccgtgctctgcccctacaaccgtaaggaaagcaaaagc atcaagtactggtgtctctgggaaggggcccagaatggccgctgccccctgctggtggac agcgaggggtgggttaaggcccagtacgagggccgcctctccctgctggaggagccaggc aacggcaccttcactgtcatcctcaaccagctcaccagccgggacgccggcttctactgg tgtctgaccaacggcgatactctctggaggaccaccgtggagatcaagattatcgaagga gaaccaaacctcaaggtaccagggaatgtcacggctgtgctgggagagactctcaaggtc ccctgtcactttccatgcaaattctcctcgtacgagaaatactggtgcaagtggaataac acgggctgccaggccctgcccagccaagacgaaggccccagcaaggccttcgtgaactgt gacgagaacagccggcttgtctccctgaccctgaacctggtgaccagggctgatgagggc tggtactggtgtggagtgaagcagggccacttctatggagagactgcagccgtctatgtg gcagttgaagagaggaaggcagcggggtcccgcgatgtcagcctagcgaaggcagacgct gctcctgatgagaaggtgctagactctggttttcgggagattgagaacaaagccattcag gatcccaggctttttgcagaggaaaaggcggtggcagatacaagagatcaagccgatggg agcagagcatctgtggattccggcagctctgaggaacaaggtggaagctccagagcgctg gtctccaccctggtgcccctgggcctggtgctggcagtgggagccgtggctgtgggggtg gccagagcccggcacaggaagaacgtcgaccgagtttcaatcagaagctacaggacagac attagcatgtcagacttcgagaactccagggaatttggagccaatgacaacatgggagcc tcttcgatcactcaggagacatccctcggaggaaaagaagagtttgttgccaccactgag agcaccacagagaccaaagaacccaagaaggcaaaaaggtcatccaaggaggaagccgag atggcctacaaagacttcctgctccagtccagcaccgtggccgccgaggcccaggacggc ccccaggaagcctag >gi568815597f:206797833_207003056|GENSCAN_predicted_peptide_6|114_aa MDPIQPMREQALQGVYFSKGHSEQPTSTAMFWIAVPLAWIYDLACSLHCQQYDELSSYCL LSKGPAETHVGSREAAALGENPMLHLYRHVVITSANAPGLQTTVCLLDALHQRH >gi568815597f:206797833_207003056|GENSCAN_predicted_CDS_6|345_bp atggatcctattcaaccaatgagggagcaggctctgcaaggtgtgtacttcagcaaaggc cactcggagcagcccacctccacagctatgttctggatcgccgtgccccttgcttggatt tatgatctagcctgctcccttcactgtcagcagtatgatgagctgtcatcctactgccta ctctctaaggggccagctgaaacccatgtgggcagcagagaagcggcagccttaggagag aaccccatgctgcacctctacaggcatgtggtgatcaccagtgccaatgcccctggcctg cagaccactgtttgcctcctggatgcattgcatcaaagacattga >gi568815597f:206797833_207003056|GENSCAN_predicted_peptide_7|482_aa MADGKHPAQATVQGARLACPGERRTIQVLHGSTVKEKEVNRHPILVDAHRSRVFFAGHQQ EGGMENAPLPHTVPATRTHLRAMGTLRPSSPLCWREESSFAAPNSLKGSRLVSGEPGGAV TIQCHYAPSSVNRHQRKYWCRLGPPRWICQTIVSTNQYTHHRYRDRVALTDFPQRGLFVV RLSQLSPDDIGCYLCGIGSENNMLFLSMNLTISAGPASTLPTATPAAGELTMRSYGTASP VANRWTPGTTQTLGQGTAWDTVASTPGTSKTTASAEGRRTPGATRPAAPGTGSWAEGSVK APAPIPESPPSKSRSMSNTTEGVWEGTRSSVTNRARASKDRREMTTTKADRPREDIEGVR IALDAAKKVLGTIGPPALVSETLAWEILPQATPVSKQQSQGSIGETTPAAGMWTLGTPAA DVWILGTPAADVWTSMEAASGEGSAAGDLDAATGDRGPQATLSQTPAVGPWGPPGKESSV KR >gi568815597f:206797833_207003056|GENSCAN_predicted_CDS_7|1449_bp atggctgatggcaaacacccagcacaggcaacagttcagggagcacgtctggcctgtcct ggggaacggaggacaatccaggtcctgcatggatcaacggtcaaagagaaagaagtcaat agacatcccatcctggtggatgcccatcggagcagagtgttttttgcaggtcaccagcag gagggcgggatggaaaatgcccctcttcctcatactgtgcctgctacaaggacccatctc cgggccatgggaacactcaggccttcctcgcccctctgctggcgggaggagagctccttt gcagctccaaattcattgaagggctcaaggctggtgtcaggggagcctggaggagctgtc accatccagtgccattatgccccctcatctgtcaacaggcaccagaggaagtactggtgc cgtctggggcccccaagatggatctgccagaccattgtgtccaccaaccagtatactcac catcgctatcgtgaccgtgtggccctcacagactttccacagagaggcttgtttgtggtg aggctgtcccaactgtccccggatgacatcggatgctacctctgcggcattggaagtgaa aacaacatgctgttcttaagcatgaatctgaccatctctgcaggtcccgccagcaccctc cccacagccactccagctgctggggagctcaccatgagatcctatggaacagcgtctcca gtggccaacagatggaccccaggaaccacccagaccttaggacaggggacagcatgggac acagttgcttccactccaggaaccagcaagactacagcttcagctgagggaagacgaacc ccaggagcaaccaggccagcagctccagggacaggcagctgggcagagggttctgtcaaa gcacctgctccgattccagagagtccaccttcaaagagcagaagcatgtccaatacaaca gaaggtgtttgggagggcaccagaagctcggtgacaaacagggctagagccagcaaggac aggagggagatgacaactaccaaggctgataggccaagggaggacatagagggggtcagg atagctcttgatgcagccaaaaaggtcctaggaaccattgggccaccagctctggtctca gaaactttggcctgggaaatcctcccacaagcaacgccagtttctaagcaacaatctcag ggttccattggagaaacaactccagctgcaggcatgtggaccttgggaactccagctgca gatgtgtggatcttgggaactccagctgcagatgtgtggaccagcatggaggcagcatct ggggaaggaagcgctgcaggggacctagatgctgccactggagacagaggtccccaagca acactgagccagaccccggcagtaggaccctggggaccccctggcaaggagtcctccgtg aagcggtaa